GPT-5.4
OpenAIfrontierOpenAI's flagship model and one of the most capable general-purpose LLMs available. Natively multimodal with vision, audio, reasoning, tool use, computer use, and web search. Excels across virtually every dimension with a 1M token context window and 128K output.
Specifications
1M tokens
128K tokens
$2.50 / 1M tokens
$15.00 / 1M tokens
Fast (speed score: 7/10)
Capability Profile
Feature Support
Best Use Cases
Not Ideal For
Strengths
Weaknesses
Edge Cases & Notes
Provider Notes
Available via the OpenAI API with Tier 4+ access for full 1M context. Batch API at 50% discount. Azure OpenAI Service offers managed deployments with SLA. Rate limits scale with usage tier.
Benchmarks
Benchmark Notes
Top-3 on LMSYS Arena across all categories. MMLU-Pro 92.8%, HumanEval 95.1%. SWE-bench Verified 62.4%. Strong GPQA Diamond scores. Web-search grounding evaluation shows 94%+ factuality on current events.
Research Meta
Last Evaluated
2026-04-01
Source Confidence
93%
Evaluation Method
LMSYS Chatbot Arena, SWE-bench Verified, MMLU-Pro, GPQA Diamond, internal comparative testing
Needs Re-evaluation
NoSources
- OpenAI GPT-5.4 system card (Feb 2026)
- LMSYS Chatbot Arena leaderboard
- SWE-bench Verified leaderboard
- Artificial Analysis quality index