Gemini 3.5 Flash vs Grok 4

Benchmarks, API pricing and specs, head to head. Data updated 2026-06-10.

Gemini 3.5 Flash

Google · May 2026

Google's I/O 2026 headliner: near-Pro intelligence at Flash speed, 78.8% on SWE-bench Verified with full multimodal input.

Grok 4

xAI · Jul 2025

xAI's mid-2025 flagship, trained on the 200K-GPU Colossus cluster — a math and science reasoning standout of its generation.

The verdict

These two models don't yet share verified results on the benchmarks we track, so judge them on specs, pricing and intended use. Gemini 3.5 Flash is about 1.8x cheaper per blended million tokens (3:1 input:output mix). Gemini 3.5 Flash also takes 1M of context versus 256K for Grok 4.

Specs & pricing

Gemini 3.5 FlashGrok 4
modhub Index
Input price / 1M$1.5$3
Output price / 1M$9$15
Context window1M256K
Max output66K
Open weightsnono
Reasoning modelyesyes
Multimodal inputtext, image, audio, videotext, image
Knowledge cutoffFeb 2026Jul 2025
ReleasedMay 2026Jul 2025
Example monthly cost*$28.50$52.50

* 10M input + 1.5M output tokens per month at list prices, no caching. Green = better value on that row.

Frequently asked questions

Which is better, Gemini 3.5 Flash or Grok 4?
These two models don't yet share verified results on the benchmarks we track, so judge them on specs, pricing and intended use. Gemini 3.5 Flash is about 1.8x cheaper per blended million tokens (3:1 input:output mix). Gemini 3.5 Flash also takes 1M of context versus 256K for Grok 4.
Which is cheaper, Gemini 3.5 Flash or Grok 4?
Gemini 3.5 Flash costs $1.5/$9 per million input/output tokens, while Grok 4 costs $3/$15. For a typical workload of 10M input and 1.5M output tokens per month, that's $28.50 versus $52.50.
Which model is better for coding, Gemini 3.5 Flash or Grok 4?
We don't yet track SWE-bench Verified results for both models; check their individual pages for coding-related scores.

More comparisons