Open Source

Google Gemma 4 confirmed with QAT support – developers advised to wait

Omar from Google's Gemma team hints at native quantization-aware training in the next release.

Deep Dive

A comment from Omar on the Gemma team suggests to pause testing quantization and wait for refinements.

Key Points
  • Omar from Google’s Gemma team confirmed Gemma 4 will include QAT (Quantization-Aware Training).
  • QAT helps maintain model accuracy after quantization, critical for local and edge deployment.
  • Developers are urged to delay quantization testing on current Gemma models and wait for Gemma 4.

Why It Matters

Native QAT in Gemma 4 could make high‑accuracy quantized models a reality for local LLM users.