Research & Papers

Fidelity, Diversity, and Privacy: A Multi-Dimensional LLM Evaluation for Clinical Data Augmentation

arXiv cs.LG May 01, 2026

⚡DeepSeek-R1, Qwen 3.5, and OpenBioLLM generate privacy-safe clinical reports from ICD-10 codes

Deep Dive

A new arXiv paper (2604.27014) by Guillermo Iglesias and colleagues tests three large language models — DeepSeek-R1, OpenBioLLM-Llama3, and Qwen 3.5 — for generating synthetic clinical reports in mental health. The scarcity of annotated medical data (especially under strict privacy regulations) makes synthetic data augmentation attractive. The researchers conditioned text generation on specific ICD-10 diagnostic codes to produce realistic evaluation reports. To avoid mode collapse or memorization of private patient data, they designed a comprehensive evaluation framework across three dimensions: semantic fidelity (does the text clinically match the code?), lexical diversity (is there variety?), and privacy/plagiarism (does it leak real patient info?).

All three LLMs passed all tests, producing clinically coherent, lexically diverse, and privacy-safe synthetic reports. This significantly expands the available training data for clinical natural language processing tasks without compromising confidentiality. The study highlights that even open-weight models like OpenBioLLM can generate high-quality synthetic medical text when properly guided with condition codes. The 9-page paper includes one table and one figure detailing results. This work addresses a critical bottleneck in healthcare AI — data scarcity — while maintaining ethical safeguards.

Key Points

Three LLMs (DeepSeek-R1, OpenBioLLM-Llama3, Qwen 3.5) generated synthetic mental health reports conditioned on ICD-10 codes
Evaluation framework measured semantic fidelity, lexical diversity, and privacy/plagiarism to avoid mode collapse or data leaks
All models produced clinically coherent, diverse, and safe synthetic reports, enabling expanded training data for clinical NLP

Why It Matters

Synthetic clinical data from LLMs could accelerate mental health AI while preserving patient privacy under strict regulations.

Read Original Article

Fidelity, Diversity, and Privacy: A Multi-Dimensional LLM Evaluation for Clinical Data Augmentation

Why It Matters

Stay Ahead in AI