Claude Opus 4.8 vs DeepSeek V4 Flash

Claude Opus 4.8 edges ahead on overall intelligence (RunFree 94.7 vs 59.1). Here's how they stack up on benchmarks, price and specs.

Claude Opus 4.8
Anthropic
DeepSeek V4 Flash
DeepSeek

Shared benchmarks (4)

Loading chart…
Claude Opus 4.8DeepSeek V4 Flash
RunFree Score94.759.1
Blended price / 1M$10.00$0.11
Input / 1M$5.00$0.09
Output / 1M$25.00$0.18
Context window1M1.0M
Max output128K66K
Reasoning modelYesYes
GPQA Diamond93.6%88.1%
Humanity's Last Exam49.8%34.8%
SWE-Bench Verified88.6%79%
Terminal-Bench74.6%56.9%
Wins64

A blank means the metric is tied or one model has no verified data. Prices are blended 3:1 input:output.

Compare more: Claude Opus 4.8 card · DeepSeek V4 Flash card · full leaderboard