Gemma 4's April launch was a spec sheet. The May multi-token prediction update is what made on-prem inference production-viable for EU CTOs in 2026.
Gemma 4's licence terms, 27B-parameter sweet spot, and EU-data RAG accuracy beat Llama 3.3 for regulated enterprise — the 90-day deployment benchmarks.