AI Safety

Training LLMs for True Randomness: Solving Bias with Distributional Fidelity

LLMs choose Wednesday 80% of the time – new training fixes mode collapse

Deep Dive

Recent work exposes a critical flaw in LLMs: they fail at generating truly random outputs. When asked to pick a random weekday, Qwen3 selects Wednesday about 80% of the time. Gemma-3 names only four cities in 75% of its responses, and multiple-choice questions see models heavily favoring option C for correct answers. This bias stems from training objectives that optimize for maximum likelihood, collapsing probability mass onto narrow modes even when diversity is essential for synthetic data generation, creative tasks, and agentic behavior. Researchers have now proposed addressing this as a training-time problem rather than an inference-time hack.

To solve this, the team trained models on minimal prompts to generate numbers from 30 different distribution families (24 seen during training). They evaluated three dimensions: distributional fidelity (do samples match the target distribution?), transfer to natural-language settings (e.g., random city names, balanced MCQ positions), and retention of other capabilities. Results show the method significantly improves stochastic behavior without major performance degradation, offering a path toward more reliable and diverse LLM outputs for downstream applications like synthetic data pipelines and unbiased agents.

Key Points
  • Qwen3 picks Wednesday 80% of the time when asked to name a random weekday; Gemma-3 gives 75% answers as just four cities.
  • Multiple-choice question models heavily bias option C as the correct answer due to mode collapse.
  • New training method explicitly teaches models to match 30 mathematical distribution families, improving randomness without sacrificing performance.

Why It Matters

Enables reliable synthetic data generation and unbiased agent behavior by fixing hidden randomness biases in LLMs.