Qwen3-235B-A22B-Thinking-2507

Qwen3 full series models achieve effective integration of thinking and non-thinking modes, enabling mode switching during conversations.
TypeChat(Thinking)OrganizationAlibabaRelease Date2025-07-25
Websitehttps://tongyi.aliyun.com/welcome

Ability Scores

Evaluation Case Report

Indicator NameIndicator WeightEvaluation TypePassed CasesFailed CasesPass RateCase Details
Logical Equivalence3hybrid7654%
Optimization Depth4subjective181162%
Syntax Error Detection2hybrid12192%

Evaluation Process