DeepSeek V4 Flash vs Kimi K2 Thinking

DeepSeek V4 Flash edges ahead on overall intelligence (RunFree 59.1 vs 45.2). Here's how they stack up on benchmarks, price and specs.

DeepSeek V4 Flash
DeepSeek
Kimi K2 Thinking
MoonshotAI

Shared benchmarks (6)

Loading chart…
DeepSeek V4 FlashKimi K2 Thinking
RunFree Score59.145.2
Blended price / 1M$0.11$1.07
Input / 1M$0.09$0.60
Output / 1M$0.18$2.50
Context window1.0M262K
Max output66K262K
Reasoning modelYesYes
GPQA Diamond88.1%84.5%
Humanity's Last Exam34.8%23.9%
LiveCodeBench91.6%83.1%
MMLU-Pro86.4%84.6%
SWE-Bench Verified79%71.3%
Terminal-Bench56.9%47.1%
Wins111

A blank means the metric is tied or one model has no verified data. Prices are blended 3:1 input:output.

Compare more: DeepSeek V4 Flash card · Kimi K2 Thinking card · full leaderboard