AI Safety

Study finds mid-2025 LLMs offer only modest help for novice lab work

arXiv cs.CY February 19, 2026

⚡New research shows AI assistance provides a 1.4x boost in some tasks but doesn't guarantee complex procedure completion.

Deep Dive

A research team led by Shen Zhou Hong published a pre-registered, randomized controlled trial (n=153) testing if mid-2025 LLMs help novices complete a viral reverse genetics lab workflow. The study found no significant difference in overall completion rates (5.2% LLM vs. 6.6% Internet). However, Bayesian modeling estimated a 1.4-fold increase in success for a typical task, with notable improvement in cell culture (68.8% vs. 55.3%).

Why It Matters

This reveals a critical gap between AI benchmark performance and real-world utility, impacting biosecurity risk assessments.

Read Original Article

Study finds mid-2025 LLMs offer only modest help for novice lab work

Why It Matters

Related Articles

🚀 Stay Ahead in AI