Industry Leaderboard
Updates
model
02/05/26Claude Opus 4.6 is the new SOTA
Claude Opus 4.6 is the new SOTA
View Details
Benchmarks
Accuracy
Rankings
Academic Benchmarks
Proprietary Benchmarks (contact us to get access)
Top performing models from the Vals Index. Includes a range of tasks across finance, coding and law.
All Top Performing Models →Vals Index
2/5/2026Top performing open weight models from the Vals Index. Includes a range of tasks across finance, coding and law.
All Top Open Weight Models →Vals Index
2/5/2026The top performing models from the Vals Index which are cost efficient.
View full Pareto curve →Vals Index
2/5/2026View Details
Benchmarks
Accuracy
Rankings
Model benchmarks are seriously lacking. With Vals AI, we report how language models perform on the industry-specific tasks where they will be used.
By subscribing, I agree to Vals' Privacy Policy.