gpt-oss-120b vs OpenAI o4-mini
Benchmarks, API pricing and specs, head to head. Data updated 2026-06-10.
OpenAI · Aug 2025
OpenAI's open-weights MoE reasoning model under Apache 2.0 — near o4-mini quality, runnable on a single 80GB GPU.
OpenAI · Apr 2025
A compact reasoning model that punched far above its price on math — 92.7% on AIME 2025 at $1.10 per million input tokens.
The verdict
OpenAI o4-mini wins 2 of the 2 benchmarks these models share, against 0 for gpt-oss-120b. On price the gap is dramatic: gpt-oss-120b works out roughly 9.6x cheaper per blended million tokens. OpenAI o4-mini also takes 200K of context versus 131K for gpt-oss-120b. And gpt-oss-120b is open-weights (Apache 2.0), so it can be self-hosted — a structural advantage if data control or vendor independence matters.
Benchmark head-to-head 0–2
Specs & pricing
| gpt-oss-120b | OpenAI o4-mini | |
|---|---|---|
| modhub Index | — | 80.9 |
| Input price / 1M | $0.1 | $1.1 |
| Output price / 1M | $0.5 | $4.4 |
| Context window | 131K | 200K |
| Max output | 131K | 100K |
| Open weights | yes (Apache 2.0) | no |
| Reasoning model | yes | yes |
| Multimodal input | text | text, image |
| Knowledge cutoff | Jun 2024 | May 2024 |
| Released | Aug 2025 | Apr 2025 |
| Example monthly cost* | $1.75 | $17.60 |
* 10M input + 1.5M output tokens per month at list prices, no caching. Green = better value on that row.
Frequently asked questions
- Which is better, gpt-oss-120b or OpenAI o4-mini?
- OpenAI o4-mini wins 2 of the 2 benchmarks these models share, against 0 for gpt-oss-120b. On price the gap is dramatic: gpt-oss-120b works out roughly 9.6x cheaper per blended million tokens. OpenAI o4-mini also takes 200K of context versus 131K for gpt-oss-120b. And gpt-oss-120b is open-weights (Apache 2.0), so it can be self-hosted — a structural advantage if data control or vendor independence matters.
- Which is cheaper, gpt-oss-120b or OpenAI o4-mini?
- gpt-oss-120b costs $0.1/$0.5 per million input/output tokens, while OpenAI o4-mini costs $1.1/$4.4. For a typical workload of 10M input and 1.5M output tokens per month, that's $1.75 versus $17.60.
- Which model is better for coding, gpt-oss-120b or OpenAI o4-mini?
- We don't yet track SWE-bench Verified results for both models; check their individual pages for coding-related scores.