Research & Papers

Dynamic Decision Learning: Test-Time Evolution for Abnormality Grounding in Rare Diseases

New framework improves abnormality grounding in rare diseases without retraining.

Deep Dive

Researchers from multiple institutions have introduced Dynamic Decision Learning (DDL), a novel framework that enables frozen large vision-language models (LVLMs) to improve their performance on clinical abnormality grounding, particularly for rare diseases. The approach works by optimizing model instructions and consolidating predictions under visual perturbations at test time, avoiding the need for costly supervised fine-tuning on scarce medical data. This process not only enhances localization quality but also generates a consensus-based reliability score that quantifies model confidence.

Tested on brain imaging benchmarks including a rare-disease dataset covering 281 pathology types, DDL delivered a 105% improvement in mean average precision at IoU 0.75 (mAP@75) for rare disease cases. The framework outperformed both standard adaptation methods and full supervised fine-tuning across models ranging from 3B to 72B parameters. Notably, DDL maintained strong calibration between its reliability scores and actual localization accuracy even under severe distribution shifts and increasing task difficulty, making it a practical solution for real-world clinical deployment where training data is limited.

Key Points
  • DDL improves mAP@75 by up to 105% on rare-disease cases across 281 pathology types.
  • Works with frozen LVLMs from 3B to 72B parameters, eliminating need for fine-tuning.
  • Generates a reliability score that stays well-calibrated under distribution shifts and harder tasks.

Why It Matters

Enables accurate AI diagnosis for rare diseases without large datasets, critical for clinical adoption.