Skip to content

Updated 3 hours agoSources:Code ArenaLiveBench Coding

/ Live Benchmarks / Coding

Coding benchmarks

Code generation and completion tasks from Code Arena (Elo) and LiveBench.

#ModelScore
1
1567Elo
2
1562Elo
3
1542Elo
4
1541Elo
5
1538Elo
6
1533Elo
7
1523Elo
8
Kimi K2.6Moonshot
1518Elo
9
1508Elo
10
1506Elo
11
1505Elo
12
1490Elo
13
1486Elo
14
1479Elo
15
1471Elo
16
1467Elo
17
1464Elo
18
1460Elo
19
1457Elo
20
1448Elo
21
1444Elo
22
1440Elo
23
Mimo v2.5Xiaomi
1440Elo
24
1438Elo
25
1437Elo
26
1437Elo
27
GLM-5Z.ai
1436Elo
28
1434Elo
29
1431Elo
30
1408Elo
31
1407Elo
32
GPT-5.2OpenAI
1404Elo
33
1402Elo
34
1401Elo
35
1395Elo
36
1394Elo
37
1393Elo
38
1392Elo
39
1391Elo
40
Gpt 5.4OpenAI
1388Elo
41
1388Elo
42
1387Elo
43
1386Elo
44
1386Elo
45
1382Elo
46
1380Elo
47
1377Elo
48
1373Elo
49
1368Elo
50
1365Elo
51
1365Elo
52
1360Elo
53
1358Elo
54
1355Elo
55
GPT-5.1OpenAI
1340Elo
56
1337Elo
57
1335Elo
58
1332Elo
59
1329Elo
60
1329Elo
61
1322Elo
62
MiniMax M2MiniMax
1305Elo
63
1300Elo
64
1287Elo
65
1282Elo
66
1259Elo
67
1249Elo
68
1248Elo
69
1245Elo
70
1240Elo
71
1237Elo
72
1234Elo
73
1223Elo
74
1209Elo
75
1204Elo
76
1202Elo
77
Devstral 2Mistral
1199Elo
78
Mercury 2Inception AI
1165Elo
79
1150Elo
80
1140Elo
81
1091Elo

/ Live Benchmarks

Need help choosing the right AI model for your business?

Benchmarks are a starting point, not an answer. The right model depends on your workload, budget, and integration constraints — let's figure it out together.