Robotics

STRELGen generates safety-critical driving scenarios with spatio-temporal logic

Diffusion models + formal logic = rare crash scenes without brute-force data collection

Deep Dive

A team led by Lorenzo Bonin, Francesco Giacomarra, Luca Bortolussi, Jyotirmoy V. Deshmukh, and Francesca Cairoli has published STRELGen, a scalable framework that rethinks autonomous vehicle safety validation. Traditional approaches rely on exposing AD systems to millions of real-world traffic scenes—a brute-force method that is both expensive and statistically poor at capturing the rare, safety-critical edge cases needed to ensure robustness. STRELGen tackles this by synergistically combining a multi-agent trajectory-generation diffusion model with Spatio-Temporal Logic (STREL) specifications. STREL encodes complex safety and realism properties in a highly interpretable formalism, and crucially, its monitoring satisfaction levels are differentiable.

At inference time, STRELGen optimizes directly over the diffusion model's latent space to maximize satisfaction of the STREL formula, yielding efficient generation of highly plausible yet safety-critical multi-agent scenarios that remain within the learned data distribution. This gradient-based search enables targeted creation of dangerous but realistic scenarios—such as near-misses or sudden pedestrian crossings—that traditional methods would require enormous datasets to encounter. STRELGen thus provides a flexible, interpretable, and powerful tool for stress-testing autonomous driving systems, moving beyond the limitations of brute-force data collection. The paper is available on arXiv (2605.19038) and represents a significant advance in neuro-symbolic methods for safety validation.

Key Points
  • STRELGen combines multi-agent diffusion trajectory models with Spatio-Temporal Logic (STREL) for interpretable safety specification.
  • Differentiable logic satisfaction enables gradient-based search over the diffusion latent space for targeted scenario generation.
  • Efficiently produces plausible, safety-critical edge cases that traditional brute-force data collection struggles to capture.

Why It Matters

Targeted generation of rare crash scenarios accelerates autonomous driving safety validation, replacing expensive brute-force data collection with efficient, interpretable AI.