LLM Leaderboard
Comprehensive benchmarks for large language models, sorted by intelligence index.
Live data
|
— models
Metrics Guide
Intelligence: Overall capability index
Coding: Code generation quality
Math: Mathematical reasoning