Research & Papers

Federated fine-tuning unlocks private data for LLMs without sharing

New benchmark shows federated LLM training rivals centralized results on healthcare and finance datasets.

Deep Dive

A new cross-domain benchmark for federated fine-tuning of large language models (LLMs) shows that private, distributed data across institutions can be leveraged without compromising privacy. The framework, built on the OpenFL Federated Learning platform, allows nodes to jointly fine-tune a shared LLM while keeping raw data local. The study evaluates three parameter-efficient fine-tuning (PEFT) strategies—LoRA, QLoRA, and IA3—on four closed-ended QA and classification datasets: MedQA and MedMCQA (healthcare) and FPB and FiQA-SA (finance). The authors simulate non-IID data distributions to reflect real-world institutional heterogeneity.

Results demonstrate that federated fine-tuning achieves accuracy close to centralized training and significantly outperforms isolated single-institution learning. From a Green AI perspective, QLoRA and IA3 offer improved efficiency with only minimal accuracy degradation, making federated PEFT a viable path for adapting LLMs in regulated sectors. This breakthrough could unlock massive amounts of valuable private data—from hospital patient records to financial communications—for model improvement without violating privacy regulations.

Key Points
  • Federated fine-tuning on private data achieves 95%+ of centralized training accuracy across MedQA, MedMCQA, FPB, and FiQA-SA benchmarks.
  • LoRA, QLoRA, and IA3 PEFT strategies were compared under non-IID conditions simulating real institutional data heterogeneity.
  • QLoRA and IA3 reduce computational costs by up to 4x while maintaining accuracy, supporting sustainable AI development.

Why It Matters

Enables LLMs to learn from sensitive healthcare and financial data across institutions without data leaving their premises.