Claude Opus 4.6 (Thinking)

Release Date: 2/5/2026

Vals Index

Accuracy (Vals Index)

65.88% ± 1.94

Latency (Vals Index)

334.54s

Cost/Test (Vals Index)

$0.89

Context Window

200k

Max Output Tokens

128k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

128,000

Compute Effort

max

Benchmarks

Accuracy

Rankings

0.0%

± 1.94
3/ 40

0.0%

± 1.53
4/ 28

0.0%

± 0.37
18/ 47

0.0%

± 0.93
3/ 97

0.0%

± 2.78
4/ 45

0.0%

± 2.09
11/ 51

0.0%

± 1.94
3/ 51

0.0%

± 0.91
7/ 69

0.0%

± 5.03
3/ 24

0.0%

± 3.34
5/ 49

0.0%

± 0.83
3/ 104

0.0%

± 4.68
6/ 26

0.0%

± 0.64
9/ 96

0.0%

± 1.19
7/ 99

0.0%

± 1.02
14/ 103

0.0%

± 0.37
7/ 116

0.0%

± 0.19
12/ 95

0.0%

± 0.45
4/ 97

0.0%

± 0.88
10/ 66

0.0%

± 1.85
4/ 41

0.0%

± 5.25
7/ 52
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.