Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language Models
This new search technique could finally make diffusion models competitive for code and math.
Researchers introduced McDiffuSE, a framework using Monte Carlo Tree Search (MCTS) to optimize the generation order in Masked Diffusion Models for reasoning tasks. By simulating look-ahead completions, it systematically explores the best sequence to fill in missing information. The method achieved an average 3.2% improvement over standard autoregressive models and a notable 19.5% gain on the MBPP coding benchmark, showing MCTS can significantly enhance output quality and reduce variance.
Why It Matters
It provides a new path to make diffusion language models more reliable and accurate for complex problem-solving like coding and mathematics.