Gemini 3.1 Pro
GooglefrontierGoogle's latest reasoning-first frontier model, still in preview. Built from the ground up for agentic workflows with native planning, tool orchestration, and self-verification. Early benchmarks suggest it rivals Claude Opus 4.6 on coding and exceeds Gemini 2.5 Pro on reasoning.
Specifications
1.0M tokens
65.5K tokens
$2.00 / 1M tokens
$12.00 / 1M tokens
Moderate (speed score: 6/10)
Capability Profile
Feature Support
Best Use Cases
Not Ideal For
Strengths
Weaknesses
Edge Cases & Notes
Provider Notes
Available in preview through the Gemini API and Vertex AI. Not recommended for production workloads until GA. Expect API changes. Pricing is preliminary.
Benchmarks
Benchmark Notes
Preliminary benchmarks from Google: MMLU-Pro 93.5%, HumanEval 93%. Independent Arena evaluation places it near GPT-5.4 and Claude Opus 4.6. SWE-bench evaluation pending. Numbers may shift at GA.
Research Meta
Last Evaluated
2026-04-01
Source Confidence
72%
Evaluation Method
Preliminary Google benchmarks, early LMSYS Arena data, limited independent evaluation
Needs Re-evaluation
YesSources
- Google Gemini 3.1 Pro preview announcement (Mar 2026)
- Early LMSYS Chatbot Arena data
- Google I/O 2026 keynote