Anthropic Turns Agent Infra Into an OS

TL;DR

Anthropic shipped Managed Agents, a hosted service that virtualizes agent sessions, harnesses, and sandboxes into swappable interfaces. The "pet container" problem is dead. p50 time-to-first-token dropped 60%.
UC Berkeley broke every major AI agent benchmark without solving a single task. 100% on SWE-bench Verified with a 10-line conftest.py. The scorecard the industry uses to pick models is exploitable.
AWS Agent Registry (our Tool of the Week) launches in Bedrock AgentCore preview. Think npm registry for AI agents: centralized discovery, versioning, governance, and compliance metadata across your org.

The Big One

Anthropic Managed Agents: Your Agent Infra Is Now an OS

Every team building production agents hits the same wall. You put the model, the harness, and the sandbox in one container. The container becomes a pet. If it dies, the session is gone. If it hangs, you can't debug it because user data lives in the same box. Anthropic just shipped the fix.

Managed Agents virtualizes three agent components into independent interfaces: a session (append-only event log), a harness (the Claude loop that routes tool calls), and a sandbox (code execution). Each can fail or be replaced without disturbing the others. The analogy is deliberate: operating systems virtualized hardware into process and file abstractions. Managed Agents does the same for agent infrastructure.

Read the full issue

Subscribe to The Agentic Engineer to unlock this and every future issue.