ENFR

Tech • IA • Crypto

Today Topics Videos Crypto Archives Favorites

Build Hour: Agents SDK

8/10

AIOpenAIMay 28, 2026 at 08:09 PM47:41

Audio player

0:00 / 0:00

TL;DR

OpenAI’s updated Agents SDK introduces a Codex-style harness, sandboxed execution, and persistent state features to make long-running, production-grade AI agents easier to build and deploy.

KEY POINTS

Agents are becoming long-running systems

AI models are increasingly capable of handling tasks over extended periods, from minutes to days. Internal tools have demonstrated agents running continuously for up to a week, completing complex workflows such as coding, data analysis, and security scanning. This shift is driving demand for infrastructure that supports sustained, autonomous operation.

Production deployment remains complex

Despite improved model capabilities, deploying agents in real-world systems presents challenges. Developers must balance performance with flexibility across model providers, manage state across failures, and handle orchestration logic. Issues like container crashes, state loss, and secure handling of secrets complicate scaling.

Codex-style harness improves orchestration

The SDK now integrates a Codex-inspired harness that automates the agent loop, including tool use, context updates, and task continuation. Features like asynchronous shell execution, command tracking, and automatic context compaction allow agents to operate continuously without manual orchestration.

Separation of harness and compute

A key architectural change splits the agent harness from the execution environment. Instead of coupling logic and compute in a single container, agents can run in ephemeral sandboxes while orchestration persists elsewhere. This enables recovery from failures and avoids state loss when containers terminate.

Sandboxed environments enable safe file operations

Agents can operate inside isolated sandbox environments with access to files, enabling tasks like code editing, document processing, and asset generation. These sandboxes can run locally via Docker or in cloud platforms such as Modal, Cloudflare, Vercel, and others, offering flexibility for development and production.

Persistent state with snapshotting and rehydration

The SDK introduces built-in state management by snapshotting both the file system and conversation history. These snapshots can be stored locally or in cloud storage like R2 and later reloaded, allowing agents to resume tasks seamlessly even after interruptions.

Skills system enables reusable capabilities

A new Skills API allows developers to package domain-specific logic, instructions, and resources into reusable units. Skills can be versioned, stored centrally, and loaded dynamically, enabling agents to perform specialized tasks such as tax preparation or content editing with consistent behavior.

Hosted containers simplify lightweight use cases

The Responses API now includes a hosted shell tool that spins up temporary containers for single tasks. Developers can upload files, execute code, and retrieve outputs in one call, offering a lightweight alternative to full agent deployment.

Custom tools and integrations are supported

Developers can extend agents with custom tools, defined as functions that the model can call automatically. These tools support validation, guardrails, timeouts, and dynamic enablement, enabling integration with external systems such as task trackers or databases.

Flexible storage via manifests and mounts

The SDK introduces a manifest system to define file system structure and data sources. Files can be copied into sandboxes or mounted from external storage like S3-compatible buckets, balancing performance with scalability depending on workload needs.

Multi-agent coordination is possible

While still evolving, the framework supports orchestrating multiple agents working in parallel. Coordination can occur through shared storage, messaging, or supervisory agents, with expectations that large-scale multi-agent systems will become more common.

CONCLUSION

The updated Agents SDK reflects a shift toward persistent, autonomous AI systems by combining structured orchestration, sandboxed execution, and built-in state management into a flexible developer framework.

Full transcript

More from AI