Claude Sonnet 4

Claude Sonnet 4 hybrid reasoning model with superior intelligence for high-volume use cases, and 200K context window.
TypeChat(Thinking)OrganizationAnthropicRelease Date2025-05-14
Websitehttps://www.anthropic.com/claude/sonnet

Ability Scores

Evaluation Case Report

Indicator NameIndicator WeightEvaluation TypePassed CasesFailed CasesPass RateCase Details
Logical Equivalence3hybrid11285%
Optimization Depth4subjective161355%
Syntax Error Detection2hybrid12192%

Evaluation Process