Gemini 2.5 Flash vs Grok 4

Benchmarks, API pricing and specs, head to head. Data updated 2026-06-10.

Gemini 2.5 Flash

Google · Jun 2025

The 2025 fast-tier favorite: hybrid reasoning, full multimodality and a 1M context at $0.30/$2.50.

Grok 4

xAI · Jul 2025

xAI's mid-2025 flagship, trained on the 200K-GPU Colossus cluster — a math and science reasoning standout of its generation.

The verdict

Grok 4 wins 1 of the 1 benchmarks these models share, against 0 for Gemini 2.5 Flash. On price the gap is dramatic: Gemini 2.5 Flash works out roughly 7.1x cheaper per blended million tokens. Gemini 2.5 Flash also takes 1.0M of context versus 256K for Grok 4.

Benchmark head-to-head 01

GPQA Diamond
~78.3%87.5%
Gemini 2.5 FlashGrok 4

Specs & pricing

Gemini 2.5 FlashGrok 4
modhub Index
Input price / 1M$0.3$3
Output price / 1M$2.5$15
Context window1.0M256K
Max output66K
Open weightsnono
Reasoning modelyesyes
Multimodal inputtext, image, audio, videotext, image
Knowledge cutoffJan 2025Jul 2025
ReleasedJun 2025Jul 2025
Example monthly cost*$6.75$52.50

* 10M input + 1.5M output tokens per month at list prices, no caching. Green = better value on that row.

Frequently asked questions

Which is better, Gemini 2.5 Flash or Grok 4?
Grok 4 wins 1 of the 1 benchmarks these models share, against 0 for Gemini 2.5 Flash. On price the gap is dramatic: Gemini 2.5 Flash works out roughly 7.1x cheaper per blended million tokens. Gemini 2.5 Flash also takes 1.0M of context versus 256K for Grok 4.
Which is cheaper, Gemini 2.5 Flash or Grok 4?
Gemini 2.5 Flash costs $0.3/$2.5 per million input/output tokens, while Grok 4 costs $3/$15. For a typical workload of 10M input and 1.5M output tokens per month, that's $6.75 versus $52.50.
Which model is better for coding, Gemini 2.5 Flash or Grok 4?
We don't yet track SWE-bench Verified results for both models; check their individual pages for coding-related scores.

More comparisons