LLM Leaderboard

Comprehensive benchmarks for large language models, sorted by intelligence index.

Live data | — models

Metrics Guide

Intelligence: Overall capability index
Coding: Code generation quality
Math: Mathematical reasoning