Windows sandbox security for Codex Computer Use

What Codex Computer Use on Windows Actually Is

Codex Computer Use on Windows is a system where an autonomous AI coding agent can control desktop applications within a tightly restricted sandbox that balances secure code execution with practical developer workflows. Instead of running inside a full virtual machine, Codex operates directly on the user’s Windows desktop, reading the screen, clicking UI elements, and typing through tasks such as GUI testing, installer checks, and bug reproduction. This foreground-only design turns the active Windows session into the “task surface,” meaning you hand over the current desktop while the AI agent works. At the same time, a connected phone running ChatGPT becomes a control panel for approvals, screenshots, and follow-up instructions. The result is a hybrid workflow: AI agent isolation on the PC, with human supervision and guidance happening comfortably from a mobile device.

How OpenAI Built a Secure Sandbox for Windows AI Agents

Why OpenAI Needed Its Own Windows Sandbox

OpenAI found that off‑the‑shelf Windows sandbox security tools did not map cleanly to the needs of autonomous AI agents. Windows Sandbox, for example, gives strong isolation through a disposable virtual machine, but it cannot access the developer’s real tools, repositories, or active desktop, and it is not available on every Windows edition. Mandatory Integrity Control helps limit privilege, but alone it does not describe a full environment where an AI agent can read and edit code while keeping the rest of the system safe. According to an OpenAI LinkedIn post, this work “helps make Codex on Windows both powerful and secure, enabling developers to use coding agents in real-world environments with greater confidence.” That goal forced the team to combine several primitives into a custom architecture rather than rely on a single security feature.

Unelevated Sandbox: SIDs, ACLs, and Restricted Tokens

The first design, called the unelevated sandbox, combined Windows security identifiers (SIDs), access control lists (ACLs), and write‑restricted tokens to build AI agent isolation on top of the normal desktop. OpenAI added a synthetic SID, often described as sandbox‑write, that allowed write access only to explicitly chosen locations such as the current project workspace. Sensitive directories, including Git metadata, remained protected by ACLs so Codex could not silently rewrite internal repository state. Restricted tokens removed dangerous privileges from the agent’s process, reducing what it could do even if it ran within the user’s session. This approach gave fine‑grained control over where the AI could write, but it still shared the same account and broader environment as the developer, making it harder to cleanly separate network policies or completely wall off background resources.

Elevated Sandbox: Dedicated Accounts and Network Control

To improve Windows sandbox security, OpenAI redesigned the system into an elevated sandbox that creates dedicated local accounts, such as CodexSandboxOffline and CodexSandboxOnline, during setup. Commands now run under these isolated accounts using restricted tokens, so the AI’s processes are no longer tied to the developer’s main login. This makes filesystem ACLs easier to enforce and lets OpenAI define narrower permission sets for each sandbox identity. Network access is controlled with firewall rules, so one account can run offline tasks while another is allowed controlled online activity. This dual focus on filesystem and networking boundaries supports secure code execution while keeping compatibility with common development tools. As one commenter observed, “Every other coding agent treats your filesystem like a playground,” but this architecture limits what Codex can touch without constant human micromanagement.

Foreground-Only Control and Phone-Based Oversight

Even with stronger isolation, Codex Computer Use on Windows still runs only on the active desktop. The agent takes over the foreground session, so users cannot keep working normally in the same Windows session while it controls another app. This constraint is deliberate: deliberate runs such as GUI testing and installer checks are better suited to a foreground environment than to quiet background jobs with opaque side effects. The Windows machine remains the execution surface, while the phone becomes the oversight surface. Developers connect a PC from the ChatGPT mobile app, review approvals, diffs, screenshots, and terminal output, then send new instructions without returning to the desk. This workflow lets you start a test run, walk away with your phone, and still keep a human in the loop, while the sandboxed agent stays within well‑defined security boundaries.