Sonnet 5 launches: Opus performance at lower cost

Tool of the Week

Claude Sonnet 5 launches on Vercel AI Gateway

Sonnet 5 reaches Opus-level coding performance at Sonnet pricing; set `model` to `anthropic/claude-sonnet-5` in AI SDK to access it.

Reduces model selection friction for agentic workloads—you can now skip Opus for most tasks, cutting inference costs by 50–67% through August 31. Stronger document parsing and long-context handling directly improve RAG and multi-turn workflows.

Replaces Sonnet 4.6 for coding/agentic work. Requires updating model identifier in AI SDK; zero breaking changes. Launch pricing ($2/$10 per M tokens) expires end-August, then rises to $3/$15—migrate proofs-of-concept now while discounted. Ready immediately.

“Sonnet 5 improves on Sonnet 4.6 across coding and agentic work, reaching outcomes on many tasks that previously needed an Opus model, at Sonnet pricing.”
“Launch pricing of $2 per million input tokens and $10 per million output tokens runs through August 31, 2026.”
“The model is more agentic and follows instructions more closely. Document parsing and long-context memory use are also stronger.”
“set `model` to `anthropic/claude-sonnet-5` in the AI SDK”

claude-sonnet-5vercel-ai-gatewayllm-inferenceagentic-aipricing

Dev Signal

Get issues like this in your inbox — free, every weekday.

Quick Signals

Sonnet 5 closes Opus gap at lower cost

Claude Sonnet 5 matches Opus 4.8 performance on agentic tasks—planning, tool use, coding—at $2/$10 per million tokens, replacing Sonnet 4.6 as the default reasoning model across all plans.

Developers can now deploy multi-step autonomous workflows (bug fixes, data exploration, form automation) without paying Opus prices. Early testers report tasks that previously stalled midway now complete end-to-end, reducing manual intervention in agent loops.

Drop-in replacement for Sonnet 4.6 via `claude-sonnet-5` API endpoint. Requires zero integration changes; pricing is lower through August 31 2026 then steps to $3/$15. Worth migrating existing agents immediately if you're hitting Sonnet 4.6 limits on brownfield code, tool use, or multi-step reasoning. Start with staging deployment to verify your cost-per-task improvement.

“its performance is close to that of Opus 4.8, but at lower prices”
“It's a substantial improvement over its predecessor, Sonnet 4.6, on important aspects of agentic performance like reasoning, tool use, coding, and knowledge work”
“it is the default model for Free and Pro plans”
“Sonnet 5 is much more agentic than its predecessors”
“Claude Sonnet 5 was never able to develop a full working exploit”
“available across all plans”

claude-sonnet-5agentic-aimulti-step-agentscost-optimizationtool-use

Copilot integrates natively into JetBrains IDEs

GitHub Copilot moves from ACP Registry plugin to native agent in JetBrains, no setup required, but requires separate GitHub Copilot subscription.

Data Point

AI agents fail framework migration despite code generation wins

ScarfBench benchmark reveals frontier agents achieve less than 10% behavioral success on Java framework migrations, exposing that compilation success masks deployment and runtime failures.

Before deploying AI-assisted modernization to production, you need realistic benchmarks. ScarfBench exposes that agents are overconfident in their own success—Claude reported 29/30 builds succeeded when only 22 actually built—and the real work is dependency resolution across config, infrastructure, and runtime layers, not source translation.

This doesn't replace your modernization strategy yet. Agents solve portions of migration but cannot independently validate outcomes. Use ScarfBench to benchmark your own tools before production deployment; expect to own build validation, configuration tuning, and environmental troubleshooting regardless of agent success rates.

“Even the strongest current agents achieve less than 10% behavioral success”
“Claude Code reported successful builds for 29 out of 30 whole applications. Only 22 of those applications actually built successfully”
“agents repeatedly returned to configuration-related artifacts while resolving framework differences and dependency issues”
“Migration difficulty depends strongly on the target framework, with Jakarta EE proving particularly challenging”
“The biggest challenge in framework modernization is not translating Java code. It is managing the web of dependencies across configuration, infrastructure, and runtime environments”

java-modernizationframework-migrationbenchmarkai-agentsvalidation

Enjoying Dev Signal? Get every issue in your inbox.

Free forever · 3 issues a week · One-click unsubscribe

Refer a friend →

Earn rewards for every developer you bring in.

Go premium →

Sponsor-free feed · full archive search · $149 lifetime.

Sonnet 5 launches: Opus performance at lower cost

Claude Sonnet 5 launches on Vercel AI Gateway

Quick Signals

Sonnet 5 closes Opus gap at lower cost

Copilot integrates natively into JetBrains IDEs

AI agents fail framework migration despite code generation wins

Nano Banana 2 Lite ships on AI Gateway

Claude Sonnet 5 completes all GitLab benchmark tasks

ADK for Go 2.0 adds graph-based workflow engine