FLASH replaces discrete action chunks with continuous Legendre polynomial trajectory representation, enabling single-step flow matching inference?

FLASH replaces discrete action chunks with continuous Legendre polynomial trajectory representation, enabling single-step flow matching inference.

Inference time of 31.40ms per episode?

175x faster than diffusion policies and 18x faster than prior flow matching policies.

Achieves ≥92% success rate on all seven tasks (5 simulated, 2 real) with up to 4x faster training convergence than ACT?

Achieves ≥92% success rate on all seven tasks (5 simulated, 2 real) with up to 4x faster training convergence than ACT.

Robotics

FLASH Policy: 175x faster robot control with single-step inference

arXiv cs.RO May 18, 2026

⚡New method slashes inference time to 31ms, boosting robot speed by 175x

Deep Dive

A team of researchers (Jiaqi Bai, Jindou Jia, Yuxuan Hu, Gen Li, Xiangyu Chen, Tuo An, Kuangji Zuo, Jianfei Yang) has published FLASH (Fast Legendre-polynomial Action policy via Sparse History-anchored flow) on arXiv, a new approach to visuomotor policy learning for robotics. The key innovation is replacing the common two-step process of generating discrete action chunks and then denoising them iteratively. Instead, FLASH represents robot actions as continuous Legendre polynomial trajectories derived from expert demonstrations under sparse temporal sampling. This allows a single forward pass of a flow-matching model to cover an extended action horizon, completely sidestepping the slow denoising loop required by diffusion or prior flow-matching policies.

The performance gains are dramatic. In experiments across five simulated and two real-world manipulation tasks, FLASH achieves state-of-the-art success rates of at least 92% across all tasks. The per-episode inference time is just 31.40 milliseconds — that’s up to 175 times faster than diffusion policies and 18 times faster than previous flow-matching methods. Training converges up to 4 times faster than the popular ACT (Action Chunking with Transformers) approach. Furthermore, because FLASH outputs continuous trajectories, the analytic differentiation of the polynomial directly provides velocity feed-forward signals to torque controllers, reducing tracking error by 5x to 7x compared to discrete-action baselines. This work points toward a future where real-time, reactive robotic control can leverage generative models without the latency penalty that has limited their deployment.

Key Points

FLASH replaces discrete action chunks with continuous Legendre polynomial trajectory representation, enabling single-step flow matching inference.
Inference time of 31.40ms per episode: 175x faster than diffusion policies and 18x faster than prior flow matching policies.
Achieves ≥92% success rate on all seven tasks (5 simulated, 2 real) with up to 4x faster training convergence than ACT.

Why It Matters

FLASH makes generative AI practical for real-time robot control by eliminating the latency bottleneck, enabling reactive manipulation at human-like speeds.

Read Original Article

FLASH Policy: 175x faster robot control with single-step inference

Why It Matters

Related Articles

🚀 Stay Ahead in AI