OpenAI's GPT-5.2 rivals top human reviewers in Nature peer review study
45 scientists spent 469 hours comparing AI vs human reviews—AI held its own.
Deep Dive
45 scientists spent 469 hours comparing human and AI reviews across 82 papers. AI reviewers held their own against top-rated human reviewers, though with some weaknesses.
Key Points
- GPT-5.2 matched top human reviewers across 82 papers in a Nature study with 469 hours of comparison.
- AI excelled in clarity and methodology but struggled with nuanced judgment and domain expertise.
- Researchers propose AI-assisted peer review to reduce turnaround times by 40-60% without full automation.
Why It Matters
AI peer review could slash publication delays while freeing scientists for deeper analysis.