Anthropic's most intelligent model.

Released Date: 2/24/2025

Avg. Accuracy:

75.3%

Latency:

6.14s

Performance by Benchmark

Benchmarks

Accuracy

Rankings

LegalBench

78.1%

( 11 / 43 )

CorpFin

65.5%

( 2 / 19 )

CaseLaw

80.7%

( 12 / 39 )

ContractLaw

68.1%

( 17 / 47 )

TaxEval

75.9%

( 5 / 25 )

MedQA

83.3%

( 11 / 24 )

Academic Benchmarks
Proprietary Benchmarks (contact us to get access)

Cost Analysis

Input Cost

$3.00 / M Tokens

Output Cost

$15.00 / M Tokens

Input Cost (per char)

$0.72 / M chars

Output Cost (per char)

$5.57 / M chars

Overview

Important: This evaluation was performed with Thinking Mode disabled. For results with Thinking Mode enabled, see Claude 3.7 Sonnet (Thinking).

This ensures the model was evaluated under the same conditions as other non-reasoning models.

Claude 3.7 Sonnet is Anthropic’s latest model, succeeding Claude 3.5 Sonnet Latest which was released in October 2024.

What sets Claude 3.7 apart from its predecessors and competitors is its hybrid architecture, which makes thinking capabilities optional and fully configurable. Users can specify the number of thinking tokens independently from output tokens. These thinking tokens are preserved after generation, enabling users to examine and analyze the model’s reasoning process.

Key Specifications

  • Context Window: 200,000 tokens
  • Max Output Tokens: 8,192 tokens
  • Extended Thinking: 64,000 tokens
  • Training Cutoff: October 2024
  • Pricing:
    • Input: $3.00 / 1M tokens
    • Output: $15.00 / 1M tokens
Join our mailing list to receive benchmark updates on

Stay up to date as new benchmarks and models are released.