Image & Video

MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs

A new open-source medical AI just outperformed top proprietary models on key benchmarks.

Deep Dive

Researchers have introduced MedXIAOHE, a state-of-the-art medical vision-language foundation model. It achieves top performance across diverse medical benchmarks and surpasses leading closed-source multimodal systems. The model uses an entity-aware pretraining framework to broaden knowledge coverage, including rare diseases, and incorporates medical reasoning patterns via reinforcement learning for multi-step diagnostics. It also focuses on reliability with evidence-grounded reasoning and low-hallucination report generation to improve real-world clinical application.

Why It Matters

This open-source model could accelerate AI adoption in healthcare by providing a powerful, transparent alternative to proprietary systems.