DeepSeek R1
DeepSeekspecializedDeepSeek's reasoning-specialized model built on the 671B MoE architecture. Rivals OpenAI's o1 on reasoning benchmarks at a tiny fraction of the cost. Uses visible chain-of-thought reasoning (unlike o1/o3 where reasoning is hidden). Open-weight and fully inspectable.
Specifications
128K tokens
32K tokens
$0.550 / 1M tokens
$2.19 / 1M tokens
Slow (speed score: 4/10)
Capability Profile
Feature Support
Best Use Cases
Not Ideal For
Strengths
Weaknesses
Edge Cases & Notes
Provider Notes
Available through DeepSeek's API and third-party providers. Distilled versions available in multiple sizes. Self-hosting the full 671B model requires extensive GPU infrastructure. Data processing considerations apply.
Benchmarks
Benchmark Notes
GSM8K 97.5%. AIME ~79%. Rivaling o1 on most reasoning benchmarks. MMLU-Pro 90.8%. The most cost-efficient reasoning model available. Arena Elo reflects reasoning-specific evaluation.
Research Meta
Last Evaluated
2026-03-01
Source Confidence
88%
Evaluation Method
AIME, GPQA Diamond, GSM8K, MATH, LMSYS Arena (reasoning), cost-reasoning Pareto analysis
Needs Re-evaluation
NoSources
- DeepSeek R1 technical report
- LMSYS Chatbot Arena
- AIME evaluation results
- Open LLM Leaderboard