Industry Leaderboard
Updates
model
02/04/26Qwen 3 Max Thinking Evaluated on Vals Index!
Qwen 3 Max Thinking Evaluated on Vals Index!
View Details
Benchmarks
Accuracy
Rankings
Academic Benchmarks
Proprietary Benchmarks (contact us to get access)
Top performing models from the Vals Index. Includes a range of tasks across finance, coding and law.
All Top Performing Models →Vals Index
2/2/2026Top performing open weight models from the Vals Index. Includes a range of tasks across finance, coding and law.
All Top Open Weight Models →Vals Index
2/2/2026The top performing models from the Vals Index which are cost efficient.
View full Pareto curve →Vals Index
2/2/2026View Details
Benchmarks
Accuracy
Rankings
Model benchmarks are seriously lacking. With Vals AI, we report how language models perform on the industry-specific tasks where they will be used.
By subscribing, I agree to Vals' Privacy Policy.