OpenAI GPT-5.4: 1 млн токенов и computer-use

◷ 4 min read 3/5/2026 by: Alexey, VibeCode

Main chat

A chat for vibe coders: news, guides, live cases, marketplace, and finding executors.

OpenAI GPT-5.4: 1 млн токенов и computer-use - обложка

On March 5, 2026, OpenAI officially unveiled the GPT-5.4, the most powerful and efficient frontier model to date. It is immediately available in ChatGPT (as GPT-5.4 Thinking and Pro), API and Codex. The main emphasis is on real professional use: complex code, agent workflow, working with documents, tables and full-fledged computer management.

In short: this is not just a “big model”, but a ready-made tool for developers that really speeds up work and reduces the number of errors.

What's new about GPT-5? 4

Context of up to 1 million tokens is ideal for long-term projects, large codebase analysis and long-term agent tasks.
Native computer-use – the model can manage the browser itself, click on coordinates, fill out forms, send letters and perform complex workflow (the demo with Playwright Interactive is impressive).
Upfront planning + steerability - now ChatGPT builds a response plan first and can adjust it along the way. Plus, preambles and mid-response instructions work much better.
** Improved multimodality** – better recognition of documents, tables, presentations and visuals. ChatGPT for Excel add-in.
Tool search and token efficiency – the model chooses the right tools and spends fewer tokens (faster and cheaper to use).

There are two versions:

*GPT-5.4 Thinking - for normal tasks (replaces GPT-5.2 Thinking).
GPT-5.4 Pro - maximum performance for the most complex cases (Pro/Enterprise only).

Benchmark: a direct comparison with previous models

OpenAI has published an official table comparing GPT-5.4 with GPT-5.3-Codex and GPT-5.2. The numbers speak for themselves – especially in agent tasks, computer-use, and professional work.

Бенчмарк	GPT-5.4	GPT-5.3-Codex	GPT-5.2
GDPval (wins or ties)	83.0%	70.9%	70.9%
SWE-Bench Pro (Public)	57.7%	56.8%	55.6%
OSWorld-Verified	75.0%	74.0%*	47.3%
Toolathlon	54.6%	51.9%	46.3%
BrowseComp	82.7%	77.3%	65.8%

*Note: A new image resolution setting was used for GPT-5.3-Codex.

Additional GPT-5.4 record results:

Spreadsheet modeling: 87.3% (was 68.4%)
Presentations (as measured by people): 68.0% preference vs. GPT-5. 2
WebArena-Verified: **67.3% **
MMMU-Pro: **81.2% **
ARC-AGI-1: 93.7% (was 86.2%)

Especially strongly increased performance in working with tables, presentations and agent tasks. For developers, that means fewer bugs in the code, faster prototyping, and more reliable autonomous agents.

Availability and prices

Now (gradual rollout from March 5):

ChatGPT Plus / Team / Pro – GPT-5.4 Thinking (GPT-5.2 Thinking will remain for another 3 months).
Pro and Enterprise - GPT-5.4 Pro.
API: gpt-5.4 and gpt-5.4-pro (Codex supports 1M context experimentally).

Prices rose (as expected):