NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Trees to Flows and Back: Unifying Decision Trees and Diffusion Models (arxiv.org)
niksmather 23 hours ago [-]
Apologies if I didn't understand the paper, but why do you want to apply diffusion models to tabular datasets in the first place?

Do we think they'll be better than decision trees? Is there some tabular problem that can be handled by diffusion but not trees?

henrydark 9 hours ago [-]
First, they give a novel generation algorithm based on combining trees with diffusion, which trees alone just don't give you.

Second, yes, they think some tabular data will be fit better by their combination of trees with diffusion than just with trees.

robotresearcher 18 hours ago [-]
You might not want to make a sword out of iron if steel is available, but understanding the relationship between iron and steel is broadly valuable.
niksmather 11 hours ago [-]
I can see the mathematical results are interesting, I was more wondering if there was a practical utility to this TreeFlow thing they built.
semessier 24 hours ago [-]
this lacks the math for any bold claims
emil-lp 23 hours ago [-]
Did you read the paper? Is there something specifically you're missing? A proof? A theorem statement?
semessier 13 hours ago [-]
this is an empirical engineering paper with theoretical dressing, it would not need to be a theorem paper of course.
henrydark 21 hours ago [-]
Is the code available somewhere?
rsn243 1 days ago [-]
Decision trees and diffusion models are ostensibly disparate model classes, one discrete and hierarchical, the other continuous and dynamic. This work unifies the two by establishing a crisp mathematical correspondence between hierarchical decision trees and diffusion processes in appropriate limiting regimes. Our unification reveals a shared optimization principle: \emph{Global Trajectory Score Matching (GTSM)}, for which gradient boosting (in an idealized version) is asymptotically optimal. We underscore the conceptual value of our work through two key practical instantiations: \treeflow, which achieves competitive generation quality on tabular data with higher fidelity and a 2\times computational speedup, and \dsmtree, a novel distillation method that transfers hierarchical decision logic into neural networks, matching teacher performance within 2\% on many benchmarks.
Jaxan 23 hours ago [-]
You could at least fix the latex commands when copy pasting the abstract. ;-)
gorold 21 hours ago [-]
Figure 1 definitely cleared up any misunderstandings I had about the paper
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 17:56:46 GMT+0000 (Coordinated Universal Time) with Vercel.