DeepSeek V4 Flash vs Kimi K2 Thinking

DeepSeek V4 Flash edges ahead on overall intelligence (RunFree 59.1 vs 45.2). Here's how they stack up on benchmarks, price and specs.

DeepSeek V4 Flash

Kimi K2 Thinking

Shared benchmarks (6)

Loading chart…

	DeepSeek V4 Flash	Kimi K2 Thinking
RunFree Score	59.1	45.2
Blended price / 1M	$0.11	$1.07
Input / 1M	$0.09	$0.60
Output / 1M	$0.18	$2.50
Context window	1.0M	262K
Max output	66K	262K
Reasoning model	Yes	Yes
GPQA Diamond	88.1%	84.5%
Humanity's Last Exam	34.8%	23.9%
LiveCodeBench	91.6%	83.1%
MMLU-Pro	86.4%	84.6%
SWE-Bench Verified	79%	71.3%
Terminal-Bench	56.9%	47.1%
Wins	11	1

A blank means the metric is tied or one model has no verified data. Prices are blended 3:1 input:output.

Compare more: DeepSeek V4 Flash card · Kimi K2 Thinking card · full leaderboard