Go back

Anthropic Releases Claude Opus 4.5, Outperforms Humans in Engineering Tests

Anthropic Releases Claude Opus 4.5, Outperforms Humans in Engineering Tests

Anthropic’s Claude Opus 4.5 beats all human candidates in internal engineering tests, escalating the AI capability race with Gemini 3 and others.

Anthropic launched Claude Opus 4.5 in early December 2025, achieving a new benchmark by outperforming all human job candidates in the company’s internal engineering evaluations. This release, highlighted in a December 4 HumAI digest, sets a record in AI capabilities and intensifies pressure in the ‘AI capability race’ following launches like Google’s Gemini 3, Nano Banana Pro, and Flux 2.

The model’s superior performance reframes expectations for top-tier LLMs, challenging regulators, governments, and companies to adapt to accelerating advancements. It contributes to a flurry of December model releases, driving faster innovation cycles across major labs.

Source: etcjournal.com


Share this post on:

Previous Post
MIT's DisCIPL Enables Small LMs to Match GPT-4o in Reasoning Efficiency
Next Post
OpenAI Launches GPT-5.2 After 'Code Red' Sprint

Related Posts