gemini-2.5-flash-preview-05-20

Gemini 2.5 Flash is our first Flash model model that features thinking capabilities, which lets you see the thinking process that the model goes through when generating its response.
TypeChatOrganizationGoogleRelease Date2025/5/20
Websitehttps://deepmind.google/models/gemini/

Ability Scores

Evaluation Case Report

Indicator NameIndicator WeightEvaluation TypePassed CasesFailed CasesPass RateCase Details
Logical Equivalence3hybrid8562%
Optimization Depth4subjective161355%
Syntax Error Detection2hybrid130100%

Evaluation Process