Windows sandbox security for Codex agents

What Windows Sandbox Security Means for Codex

Windows sandbox security for Codex is a layered isolation approach that uses Windows security identifiers, access control lists, restricted tokens, and dedicated accounts to keep autonomous coding agents from escaping their session or hijacking the host system while they control desktop applications. OpenAI built this custom model because Codex needs direct access to a developer’s real tools, source code, and working directories, which rules out disposable virtual machines such as the built-in Windows Sandbox. Instead, the company treats Codex computer use as a controlled Windows security domain: the agent can run local commands, edit files, and drive GUI workflows, but only inside tightly scoped boundaries. This isolation lets developers run AI agent isolation workflows such as GUI tests or installer checks with less constant supervision, while still maintaining meaningful protection against rogue actions, privilege escalation, or lateral movement across the rest of the machine.

How OpenAI's Windows Sandbox Keeps Codex Agents in Check

SIDs, ACLs, and Restricted Tokens: The Unelevated Sandbox

OpenAI’s first Windows security architecture for Codex was an “unelevated sandbox” built from standard operating system primitives: security identifiers (SIDs), access control lists (ACLs), and write-restricted tokens. The team defined a synthetic SID called sandbox-write, which allowed write access only to specific locations such as the active workspace and user-approved directories. Sensitive paths, including Git metadata directories, remained locked behind ACL rules, preventing the agent from rewriting version control history or altering configuration files outside its lane. Restricted tokens further narrowed the agent’s effective rights, stripping unneeded privileges from the process even when it ran under a regular user account. According to OpenAI’s description, this combination gave Codex enough reach to support real development tasks while still stopping it from treating the filesystem like a playground, especially when autonomous coding agents needed to run for more than a few isolated commands.

Elevated Sandbox Accounts and Network Boundaries

To make AI agent isolation more reliable across workflows, OpenAI later redesigned the system into an “elevated sandbox” based on dedicated Windows accounts and stricter token controls. During setup, Codex creates local accounts such as CodexSandboxOffline and CodexSandboxOnline and runs commands inside those identities rather than under the developer’s primary login. Restricted tokens still apply, further trimming available privileges, while firewall rules can be attached to the sandbox accounts to limit or disable network access as needed. This architecture gives OpenAI a way to enforce both filesystem and networking boundaries without breaking common tools or IDEs. It also reduces the blast radius if a coding agent misbehaves: even if a task goes wrong, its actions are tied to isolated user profiles that do not share broad permissions or reusable credentials with the rest of the system.

Why Foreground-Only Execution Matters for Security

One of the most visible constraints in Codex computer use on Windows is that agents run on the active desktop only. The same graphical session that Codex controls cannot be used for normal work in parallel, which forces users to treat the machine as a task surface instead of a second screen for background automation. This rule sharply reduces the chance of hidden background activity: anything Codex does must be visible on the desktop, making it easier for developers to notice unexpected behavior. It also aligns with the intended use cases, such as GUI testing, installer verification, and bug reproduction, which inherently involve foreground interaction. Foreground-only execution, combined with sandbox tokens and SIDs, means an autonomous coding agent can drive real applications while still being constrained to transparent, observable actions rather than silent, long-running jobs that might quietly abuse system resources.

Desktop Control, Phone Oversight, and Safe Autonomy

OpenAI’s Windows security architecture aims to balance autonomy and safety instead of giving Codex raw desktop access. On the one hand, the agent can read the screen, click UI elements, type through workflows, and interact with local tools where the project context already lives. On the other hand, its abilities stop at the sandbox boundary enforced by SIDs, ACLs, restricted tokens, and dedicated accounts. The workflow extends to phones as well: developers can connect their PC from the ChatGPT mobile app, review diffs, test results, and screenshots, then approve or adjust actions without sitting at the desk. Because work still runs on the Windows host, the phone serves as a review surface rather than a separate execution environment. This combination of visible foreground control, remote oversight, and multi-layer isolation lets developers trust Codex computer use without hovering over every individual step.