Independent Evaluation, Unbiased Benchmarks

Testing AI on Real-World Tasks

We benchmark the world's leading AI models on rigorous, domain-specific tasks in finance, law, software, healthcare, and more. We run all of our own evaluations and create many of our benchmarks in-house.

Vals AI Updates

Fresh updates from our testing queue

benchmark
05/13/2026

Vals Index and Multimodal Index v1.1 Released

Vals Index and Multimodal Index v1.1 Released

View Details

System

Accuracy

67.62%

± 1.59

66.10%

± 1.36

60.30%

± 1.62

56.23%

± 1.76

55.55%

± 1.99

53.42%

± 1.67

52.14%

± 1.69

51.42%

± 2.05

49.31%

± 1.63

48.04%

± 1.54
Showing top 10 models from the benchmark. Visit the benchmark page to view more

Industry Leaderboard

Independent benchmarks for industry-specific AI performance.

Industry
Benchmark