Gemini 2.5 Flash
GooglemidGoogle's fast and affordable thinking model with native multimodal support and a 1M token context window. Combines reasoning capabilities with exceptional speed and low cost. One of the best value models available for multimodal and long-context workloads.
Specifications
1.0M tokens
65.5K tokens
$0.150 / 1M tokens
$0.600 / 1M tokens
Ultra Fast (speed score: 9.5/10)
Capability Profile
Feature Support
Best Use Cases
Not Ideal For
Strengths
Weaknesses
Edge Cases & Notes
Provider Notes
The best value multimodal model in the market. Available through Gemini API and Vertex AI. Free tier available. Recommended as the default for cost-sensitive multimodal workloads.
Benchmarks
Benchmark Notes
MMLU-Pro 86.5%. Impressive for its price tier. Multimodal benchmarks are especially strong relative to cost. SWE-bench ~40%.
Research Meta
Last Evaluated
2026-04-01
Source Confidence
88%
Evaluation Method
LMSYS Arena, MMLU-Pro, multimodal evaluations, cost-quality Pareto analysis
Needs Re-evaluation
NoSources
- Google Gemini 2.5 Flash technical report
- LMSYS Chatbot Arena
- Artificial Analysis