Why developers are outraged by Claude Code and switch to OpenAI Codex with GPT-5 5
Main chat
A chat for vibe coders: news, guides, live cases, marketplace, and finding executors.
In 2025 and early 2026, Anthropic’s *Claude Code was considered one of the most powerful tools for professional development. The Claude Opus/Sonnet-based terminal agent offered a huge context of up to 1 million tokens, a deep understanding of architecture, clean code and good autonomy. Many senior developers have moved to it with Cursor and other solutions, calling it the best choice for complex tasks.
However, since March 2026, the situation has changed dramatically. After a series of updates, users began to complain massively about the drop in quality. Discussions filled Reddit (r/ClaudeAI, r/ClaudeCode), Hacker News, X and GitHub Issues. Many canceled expensive Max subscriptions (between $100 and $200 per month) and switched to alternatives, most notably OpenAI Codex, especially after the release of GPT-5.5 on April 23, 2026.
On April 23, Anthropic published a detailed post-mortem in which it acknowledged the problems. The quality has fallen, not because of the model itself, but because of three product changes and the Agent SDK
- Reducing the level of thinking (reasoning effort) from high to medium to reduce delays.
- A caching bug caused the model to “forget” its previous reasoning in each iteration.
- A change in system prompt that limited the amount of text between tool calls (25 words) and final answers (100 words).
All three problems were fixed by April 20 (version v2.1.116), but the trust of many users has already been undermined. People felt cheated: complaints were ignored for months or blamed on “prompt issues.”.
The main reasons for dissatisfaction with Claude Code
Lower quality and lazy behavior The model began to produce superficial solutions, ignore instructions, leave tasks unfinished, add extra code or delete the desired one. AMD’s Senior Director of AI published an analysis of more than 6,800 sessions, showing a 67% drop in depth thinking. The developers complained that the tool became unsuitable for complex engineering work.
Rapid consumption of limits and tokens The sessions burned down much faster than before. This was especially painful for Max users.
** Problems with stability** The model often repeated steps, forgot context, worked worse with long sessions and the terminal.
Strict security policies and denials Claude is traditionally more cautious than OpenAI models. In Claude Code, refusals are even more noticeable on relatively secure tasks.
** Anthropic communications** The company remained silent for a long time, and when it responded, many considered the explanations belated and incomplete. It's really eroded trust.
Why are you switching to Codex with GPT-5? 5
The release of GPT-5.5 (and GPT-5.5 Pro) on April 23, 2026 was a powerful migration catalyst. The new model was immediately available in ChatGPT and Codex.
The main advantages of Codex + GPT-5.5 according to the latest developer reviews:
- High autonomy on long and complex tasks (refactoring large bases, running tests, creating PR).
- Significantly fewer failures in coding tasks.
- GPT-5.5 uses significantly fewer output tokens for similar tasks.
- Strong results in agentic and terminal tasks. On Terminal-Bench 2.0, GPT-5.5 shows 82.7%, while Claude Opus 4.7 shows 69.4% (a gap of 13 points).
- Convenient cloud-sandbox, support for parallel work of several agents, integration with the OpenAI ecosystem.
- More generous and predictable limits compared to the recent experience of Claude Code.
Many developers write, “Work that used to take hours for Claude and still required edits is fast and accurate in Codex with GPT-5.5.”.
Comparison at the end of April 2026
| Аспект | Claude Code (Opus 4.7) | Codex (GPT-5.5) | Кто выигрывает чаще |
|---|---|---|---|
| Глубокий reasoning и архитектура | Сильнее | Хороший, но иногда слабее | Claude |
| Автономная работа (execution) | Средне, часто требует контроля | Отлично, лучше на длинных задачах | Codex |
| Terminal и agentic-задачи | Хорошо | Значительно лучше (Terminal-Bench +13%) | Codex |
| Решение реальных GitHub issues (SWE-Bench Pro) | Лучше (64,3%) | 58,6% | Claude |
| Чистота и безопасность кода | Раньше была сильной, сейчас проблемы | Более стабильная сейчас | Codex (на данный момент) |
| Эффективность токенов | Выше расход | Лучше (экономия до 40%) | Codex |
| Количество отказов | Строже | Мягче | Codex |
| Цена/ценность при активном использовании | Дорого при высоком расходе | Часто выгоднее | Codex |
The most popular approach among experienced developers is hybrid:
- Claude Opus 4.7 for deep architecture planning, codebase analysis and code review.
- Codex with GPT-5.5 for heavy battery life, debugging, DevOps and long-term tasks.
- Cursor or similar IDE - as a convenient shell with quick switching between models.
Important details
- Not all tasks are the same. For frontend and fast iterations, Claude often remains cleaner and more accurate. For backend, legacy code, large migrations, and standalone tasks, Codex with GPT-5.5 is usually more reliable.
- Quality is highly dependent on prompting. The best results are given by structured approaches: a separate plan → implementation → testing → review.
- Both instruments require a mandatory human review. AI still makes mistakes on complex edge cases and subtle architectural solutions.
- The market is changing very quickly. Anthropic has already fixed major issues and promises improvements. OpenAI continues to actively develop Codex.
Developer recommendations
- **Test yourself on your real projects. Compare Claude Codex with GPT-5.5 side-by-side.
- Using a hybrid approach is now the most effective option for most senior developers.
- Break down big tasks into clear steps and always check the generated code carefully.
- Keep an eye out for model updates – everyone has regressions, but reaction rates and transparency vary.
Withdrawal
Claude Code's outrage in the spring of 2026 is not just a hype, but a real loss of trust due to accumulated technical problems, rigid limits and insufficient timely communication from Anthropic. The release of GPT-5.5 made Codex noticeably stronger in key battery life scenarios, especially in terminal and long-duration tasks.
There is no universally “best” tool. The best developers today do not choose one model, and competently combine the strengths of Claude (deep thinking) and Codex with GPT-5.5 (autonomy and efficiency). Those who master the stack will get maximum productivity in 2026.