LIVE DeepSeek V3 · 50 RPD 免费 · Gemini 2.0 · 15 RPM 免费 API · 硅基流动 · 14 模型免费 · Groq · Llama 3.3 免费推理 · 通义千问 · 100 万 tokens 免费 · Kimi · 无限免费对话 · LIVE DeepSeek V3 · 50 RPD 免费 · Gemini 2.0 · 15 RPM 免费 API · 硅基流动 · 14 模型免费 · Groq · Llama 3.3 免费推理 · 通义千问 · 100 万 tokens 免费 · Kimi · 无限免费对话

Gemma 4

Google's open-source reasoning & agent model, Apache 2.0, four sizes

✅ Free Tier 🇨🇳 China Accessible

What is Gemma 4

Gemma 4 is Google DeepMind's open model family released April 2026, built for reasoning and agentic workflows. Four sizes: 31B (Dense, strongest), 26B-A4B (MoE, efficient), E4B and E2B (lightweight). Apache 2.0 license, no commercial restrictions.

How to Run Locally

Easiest way: ollama run gemma4:31b. The 31B version needs ~20GB VRAM/RAM, 26B-A4B only ~4GB active params. Supports speculative decoding for 29% speedup (50% on code).

Coding Benchmark Comparison

Gemma 4 31B excels at reasoning and agent tasks, but slightly behind Qwen 3.6-35B-A3B on coding benchmarks (64.7% vs 71.4%). Advantages: most permissive Apache 2.0 license, best native function calling support.

🐑 小羊助手

Gemma 4

What is Gemma 4

How to Run Locally

Coding Benchmark Comparison

Sign up free to unlock all content