Achieves 381.1-fold reduction in LLM calls over full simulation for 10M-agent public-opinion modeling?

Achieves 381.1-fold reduction in LLM calls over full simulation for 10M-agent public-opinion modeling.

Uses adaptive prototype selection, shadow-audit residual correction, and tail-protected routing to manage bias?

Uses adaptive prototype selection, shadow-audit residual correction, and tail-protected routing to manage bias.

Final-round Jensen-Shannon divergence of 0.094 against reference, outperforming same-budget baselines?

Final-round Jensen-Shannon divergence of 0.094 against reference, outperforming same-budget baselines.

Agent Frameworks

New APS method simulates 10M agents with 381x fewer LLM calls

arXiv cs.MA May 28, 2026

⚡381-fold reduction in LLM queries while keeping simulation accuracy high — a game-changer for social modeling.

Deep Dive

Simulating large populations with LLM agents is computationally expensive: each agent queries the model every round, scaling linearly with size and horizon. A new paper from Quan Zheng and collaborators introduces APS (Adaptive Prototype Simulation), which reframes the problem as a recurrent oracle-allocation problem. Instead of querying every agent, APS selects a small set of adaptive prototype agents to generate responses, then propagates those responses to nearby agents via local response surfaces. To control approximation bias, it employs shadow-audit residual correction (estimating and correcting errors) and tail-protected singleton routing (directly querying isolated or high-curvature agents).

In a 10M-agent multi-round public-opinion simulation, APS achieved a 381.1-fold reduction in LLM calls while maintaining a final-round Jensen-Shannon divergence of just 0.094 compared to a full-LLM reference. The framework outperforms scale-oriented and same-budget baselines in distributional discrepancy. By treating bias control as a first-class concern, APS makes population-scale social simulations practical for policy modeling, epidemiology, and market research — without the astronomical costs of naive LLM-based agent simulations.

Key Points

Achieves 381.1-fold reduction in LLM calls over full simulation for 10M-agent public-opinion modeling.
Uses adaptive prototype selection, shadow-audit residual correction, and tail-protected routing to manage bias.
Final-round Jensen-Shannon divergence of 0.094 against reference, outperforming same-budget baselines.

Why It Matters

Enables realistic, population-scale LLM agent simulations at a fraction of the cost — key for policy and social science modeling.

Read Original Article

New APS method simulates 10M agents with 381x fewer LLM calls

Why It Matters

Related Articles

🚀 Stay Ahead in AI