Agent Frameworks

DeepMind's AlphaEvolve uses LLMs to discover new multi-agent learning algorithms

arXiv cs.MA February 20, 2026

⚡An AI agent evolved two novel algorithms that outperform state-of-the-art baselines in game theory.

Deep Dive

Researchers from DeepMind and collaborators built AlphaEvolve, an evolutionary coding agent powered by large language models, to automate the discovery of multi-agent reinforcement learning (MARL) algorithms. It evolved two new algorithms: VAD-CFR for iterative regret minimization and SHOR-PSRO for population-based training. These algorithms introduce non-intuitive mechanisms like volatility-sensitive discounting and hybrid meta-solvers, outperforming established baselines like Discounted Predictive CFR+ in imperfect-information games.

Why It Matters

This automates a core, human-intensive research process, potentially accelerating breakthroughs in game theory, economics, and multi-agent AI systems.

Read Original Article

DeepMind's AlphaEvolve uses LLMs to discover new multi-agent learning algorithms

Why It Matters

Related Articles

🚀 Stay Ahead in AI