Models & Releases

The next evolution of the Agents SDK

New sandbox execution and model-native harness let developers build persistent agents that safely handle files and tools.

Deep Dive

OpenAI has rolled out a significant upgrade to its Agents SDK, marking a substantial step forward in making autonomous AI agents more practical and secure for real-world deployment. The centerpiece of this update is the introduction of native sandbox execution, a critical security feature that isolates agent operations from the host system. This sandbox environment allows developers to safely grant agents access to files, code execution, and external APIs without risking system integrity or data breaches. The new model-native harness further streamlines development by providing a standardized interface for agents to interact with OpenAI's models, including GPT-4 and the upcoming GPT-4o, ensuring consistent behavior and easier debugging.

Together, these features specifically target the challenge of building 'long-running' agents—AI systems designed to operate persistently over hours or days, rather than completing single-turn tasks. This capability is essential for complex workflows like automated data analysis, customer support bots that maintain context across sessions, or coding assistants that manage entire projects. By combining secure sandboxing with a robust harness, OpenAI is lowering the barrier for developers to create sophisticated agentic applications that can reliably and safely manipulate tools and data, moving AI assistants from simple chatbots toward becoming true autonomous collaborators.

Key Points
  • Adds native sandbox execution for secure isolation of agent operations from the host system.
  • Introduces a model-native harness for consistent interaction with OpenAI models like GPT-4 and GPT-4o.
  • Enables development of secure, persistent 'long-running' agents that can operate over extended periods.

Why It Matters

This makes building production-ready, autonomous AI agents that safely handle real-world tools and data significantly more accessible.