LiveCodeBench

coding

Contamination-free competitive-programming problems collected continuously from LeetCode, AtCoder and Codeforces, so newer problems can't be in training data.

Official benchmark page

Model rankings on LiveCodeBench

#ModelScoreAs ofSource
1DeepSeek V4 Pro93.5%Apr 24, 2026 cite
2DeepSeek V4 Flash91.6%Apr 24, 2026 cite
3Qwen3 Max Thinking91.4%Nov 12, 2025 cite
4Kimi K2 Thinking83.1%Nov 6, 2025 cite
5Gemini 2.5 Pro69%Jun 27, 2025 cite
6Llama 4 Maverick43.4%Apr 5, 2025 cite

Scores are self-reported or from primary evaluations, each linked to its source. Test conditions (tools, shots, prompt) vary between labs — see the source for details.

← All benchmarks · Full leaderboard