Industry Leaderboard
Updates
model
02/17/26Claude Sonnet 4.6 - Anthropic's latest model
Claude Sonnet 4.6 - Anthropic's latest model
View Details
Benchmarks
Accuracy
Rankings
Academic Benchmarks
Proprietary Benchmarks (contact us to get access)
Top performing models from the Vals Index. Includes a range of tasks across finance, coding and law.
All Top Performing Models →Vals Index
2/17/2026Top performing open weight models from the Vals Index. Includes a range of tasks across finance, coding and law.
All Top Open Weight Models →Vals Index
2/17/2026The top performing models from the Vals Index which are cost efficient.
View full Pareto curve →Vals Index
2/17/2026View Details
Benchmarks
Accuracy
Rankings
Model benchmarks are seriously lacking. With Vals AI, we report how language models perform on the industry-specific tasks where they will be used.
By subscribing, I agree to Vals' Privacy Policy.