Models · 28 May 2026 · 2 min read

Anthropic ships Claude Opus 4.8 with new agent controls

Anthropic's new Opus 4.8 model adds fine-grained controls for agentic workflows, giving builders more direct influence over cost, speed, and quality.

Pen-and-ink illustration: a conductor's baton poised over a. For the story "Anthropic ships Claude Opus 4.8 with new agent controls".
— Pen-and-ink illustration: a conductor's baton poised over a. For the story "Anthropic ships Claude Opus 4.8 with new agent controls". —

What happened

Anthropic has released Claude Opus 4.8, an update to its frontier model. According to the company's announcement, covered by AI News, the new model offers improved performance for coding, agent work, and reasoning. It's available through the claude-opus-4-8 API endpoint.

The release introduces new features like user-controlled 'effort' levels to manage token burn, dynamic workflows for large codebases, and the ability to update instructions to the Messages API mid-task. Pricing for standard mode remains the same, with a new, faster mode available at double the cost.

How the room's reading it

Early testers are focusing on the new agentic capabilities. The consensus among developers cited in early reports is that the new controls — like 'effort' settings and live API updates — offer more granular command over complex workflows. Infra teams are noting the new pricing tier for a 'fast mode', which creates an explicit trade-off between speed and cost.

Some early benchmarks from testers like CursorBench suggest the model achieves tasks in fewer steps, a potential efficiency gain. The comparison to a future GPT model on cost parity has also caught the attention of teams running production workloads, who see a path to managing expenses without sacrificing performance.

Sailfish's take

The benchmark improvements are table stakes. We think the real story here is Anthropic giving builders more direct control over the machine. Features like 'effort' control and live instruction updates are a welcome shift away from treating models as opaque black boxes. We've shipped enough agentic systems to know that managing cost and latency is just as important as output quality.

These new levers let you tune that trade-off directly. This isn't just a model update — it’s a sign that the tooling around models is finally starting to mature. We'd prioritise testing these new controls for cost optimisation before chasing raw performance gains.

Our take — your read?

Be the first to weigh in.

Sources
— END OF DISPATCH — Models