Research & Papers

Persona2Web: First benchmark tests AI web agents on personalizing with user history

arXiv cs.CL February 20, 2026

⚡Researchers create the first real-web benchmark requiring agents to infer preferences from long-term user history.

Deep Dive

Researchers Serin Kim, Sangam Lee, and Dongha Lee built Persona2Web, the first benchmark for evaluating personalized web agents on the real open web. It tests agents' ability to resolve ambiguous queries by inferring implicit user preferences from long-term history, using a 'clarify-to-personalize' principle. The benchmark includes user histories, ambiguous queries, and a reasoning-aware evaluation framework, with code and datasets publicly available for reproducibility.

Why It Matters

Moves AI assistants beyond simple commands to truly personalized, context-aware agents that understand your implicit needs.

Read Original Article

Persona2Web: First benchmark tests AI web agents on personalizing with user history

Why It Matters

Related Articles

🚀 Stay Ahead in AI