Open Source

gemma-4-31B-it-DFlash has been released

r/LocalLLaMA May 01, 2026

⚡A new 31B instruction-tuned model needs llama.cpp support before testing can begin.

Deep Dive

A Hugging Face repo for "Gemma-4-31B-it-DFlash" by user z-lab was shared, but testing is blocked until llama.cpp merges PR #22105.

Key Points

Model released on Hugging Face by z-lab: 31B instruction-tuned variant of Gemma 4 with 'DFlash' distillation.
Requires llama.cpp PR #22105 to be merged before local inference is possible.
Aims to deliver high-quality open-source LLM performance on consumer-grade hardware.

If supported, this 31B model could democratize advanced AI reasoning for teams with limited GPU budgets.