NOW LET US – AI RAG SaaS Studio TP.HCM
NOW LET US
Digital Product Studio
Back to news
DEV-TOOLS...2 min read

Claude Sonnet 5

Share
NOW LET US Article – Claude Sonnet 5

Anthropic has launched Claude Sonnet 5, its most agentic model yet, offering near-Opus 4.8 performance at a fraction of the cost.

Introducing Claude Sonnet 5

Claude Sonnet 5 is built to be the most agentic Sonnet model yet. It can make plans, use tools like browsers and terminals, and run autonomously at a level that, just a few months ago, required larger and more expensive models.

For many developers, the agentic AI era began with Sonnet-class models: Claude Sonnet 3.5, 3.6, and 3.7 were the first models that showed impressive skills in coding and tool use. More recently, though, the clearest gains in agentic capabilities have been in our Opus-class models.

Sonnet 5 narrows the gap: its performance is close to that of Opus 4.8, but at lower prices. It’s a substantial improvement over its predecessor, Sonnet 4.6, on important aspects of agentic performance like reasoning, tool use, coding, and knowledge work:

Our safety assessments found that Sonnet 5 shows an overall lower rate of undesirable behaviors than Sonnet 4.6, and is generally safer to use in agentic contexts. Evaluations also show that it has a much lower ability to perform cybersecurity tasks than our current Opus models.

From today, Claude Sonnet 5 is available across all plans: it is the default model for Free and Pro plans, and is available to Max, Team, and Enterprise users. It’s also available in Claude Code and on the Claude Platform, where it launches with introductory pricing of $2 per million input tokens and $10 per million output tokens through August 31, 2026, after which it will be priced at $3 per million input tokens and $15 per million output tokens. Developers can use claude-sonnet-5

via the Claude API.

Working with Claude Sonnet 5

The charts below compare the performance of Sonnet 5 with Sonnet 4.6 and Opus 4.8 at different effort levels on the agentic search evaluation BrowseComp and the computer use evaluation OSWorld-Verified. Sonnet 5 (orange line) is a strict improvement over Sonnet 4.6 (gray line) and covers a much wider range of cost-performance options than Opus 4.8 (yellow line). It provides substantially improved cost efficiency at medium effort; its higher-effort performance can match Opus 4.8 on some tasks. Between Sonnet 5 and Opus 4.8, users can adjust the effort level to find the right balance of cost and performance.

Feedback from our early access partners has been consistent: Sonnet 5 is much more agentic than its predecessors. Testers described how it finishes complex tasks where previous Sonnet models would stop short, how it checks its own output without explicitly be

© 2026 Now Let Us. All rights reserved.

Source: Hacker News

Advertisement
Ad slot ready: 5887729102

More in this category

NOW LET US Related – Why Specialization Is Inevitable

dev-tools

Why Specialization Is Inevitable

Contrary to the expectation of universal AI, principles from mathematics, biology, and economics show that specialization is inevitable for peak performance. Under finite resources, trading breadth for depth is the only viable path to true AI breakthroughs.

NOW LET US Related – European digital ID wallets rely on safety services of Google and Apple

dev-tools

European digital ID wallets rely on safety services of Google and Apple

European governments are rolling out digital identity wallets that rely on security services from Google and Apple, raising concerns about technological sovereignty. This dependency risks reinforcing Big Tech monopolies and excluding alternative, privacy-focused operating systems from public infrastructure.

NOW LET US Related – The end of the AArch64 desktop experiment

dev-tools

The end of the AArch64 desktop experiment

After about eleven months of using an Ampere Altra AArch64 system as a desktop, the author decided to end the experiment due to hardware errata, GPU driver issues, and software ecosystem limitations.

NOW LET US Related – Memory Safe Context Switching

dev-tools

Memory Safe Context Switching

This article explores how Fil-C achieves absolute memory safety during context switching using setjmp/longjmp and ucontext APIs, preventing stack corruption and dangling stack exploits.

NOW LET US Related – .self: A new top-level domain designed to support self-hosting

dev-tools

.self: A new top-level domain designed to support self-hosting

The Human-Centered Computing Foundation has launched a campaign for '.self', a new top-level domain dedicated to ethical, self-hosted technology. However, the initiative's decision to publish its manifesto as a PDF has sparked a heated debate among developers about web standards and usability.

NOW LET US Related – DiScoFormer: One transformer for density and score, across distributions

dev-tools

DiScoFormer: One transformer for density and score, across distributions

DiScoFormer is a novel Transformer-based model that estimates both the density and score of a distribution in a single forward pass, eliminating the need for retraining and overcoming traditional trade-offs in generative modeling.

EXPLORE TOPICS

Discover All Categories

Deep dive into the specific technology sectors that matter most to you.