Llama 3.3 70B
MetabudgetMeta's 70B dense model from the Llama 3.3 generation. Still widely used for self-hosted deployments due to its straightforward dense architecture, strong fine-tuning ecosystem, and proven reliability. Not the most capable model anymore, but the most battle-tested open-weight option with massive community support.
Specifications
128K tokens
16.4K tokens
$0.180 / 1M tokens
$0.180 / 1M tokens
Fast (speed score: 8/10)
Capability Profile
Feature Support
Best Use Cases
Not Ideal For
Strengths
Weaknesses
Edge Cases & Notes
Provider Notes
Available through Together AI, Fireworks, Replicate, Ollama, and self-hosted. The most widely deployed open model. Consider migrating to Llama 4 Scout for new projects unless dense architecture is specifically needed.
Benchmarks
Benchmark Notes
MMLU 86%. HumanEval 84.5%. Solid for a 70B dense model. GSM8K 91% shows good mathematical ability. Outperformed by newer MoE models but still competitive for its simplicity and maturity.
Research Meta
Last Evaluated
2026-02-01
Source Confidence
90%
Evaluation Method
Open LLM Leaderboard, LMSYS Arena, community fine-tune evaluations, self-hosting benchmarks
Needs Re-evaluation
NoSources
- Meta Llama 3.3 technical report
- Open LLM Leaderboard
- LMSYS Chatbot Arena
- Community benchmarks and evaluations