10 KiB
Overview of Protocol Defined in protocol.rs and the Codex engine in codex.rs.
The goal of this document is to define terminology used in the system and explain the expected behavior of the system.
NOTE: This document summarizes the protocol at a high level. The Rust types and enums in protocol.rs are the source of truth and may occasionally include additional fields or variants beyond what is covered here.
Entities
These are entities that exist on the Codex backend. The intent of this section is to establish vocabulary and construct a shared mental model for the Codex core system.
Model- In our case, this is the Responses REST API
Codex- The core engine of codex
- Runs locally, either in a background thread or separate process
- Communicated to via a queue pair – SQ (Submission Queue) / EQ (Event Queue)
- Takes user input, makes requests to the
Model, executes commands and applies patches.
Session- The
Codex's current configuration and state Codexstarts with noSession, and it is initialized byOp::ConfigureSession, which should be the first message sent by the UI.- The current
Sessioncan be reconfigured with additionalOp::ConfigureSessioncalls. - Any running execution is aborted when the session is reconfigured.
- The
Task- A
TaskisCodexexecuting work in response to user input. Sessionhas at most oneTaskrunning at a time.- Receiving
Op::UserInputstarts aTask - Consists of a series of
Turns - The
Taskexecutes to until:- The
Modelcompletes the task and there is no output to feed into an additionalTurn - Additional
Op::UserInputaborts the current task and starts a new one - UI interrupts with
Op::Interrupt - Fatal errors are encountered, eg.
Modelconnection exceeding retry limits - Blocked by user approval (executing a command or patch)
- The
- A
Turn- One cycle of iteration in a
Task, consists of:- A request to the
Model- (initially) prompt + (optional)last_response_id, or (in loop) previous turn output - The
Modelstreams responses back in an SSE, which are collected until "completed" message and the SSE terminates Codexthen executes command(s), applies patch(es), and outputs message(s) returned by theModel- Pauses to request approval when necessary
- A request to the
- The output of one
Turnis the input to the nextTurn - A
Turnyielding no output terminates theTask
- One cycle of iteration in a
The term "UI" is used to refer to the application driving Codex. This may be the CLI / TUI chat-like interface that users operate, or it may be a GUI interface like a VSCode extension. The UI is external to Codex, as Codex is intended to be operated by arbitrary UI implementations.
Agent identifiers
Every participant in a session (the root UI thread plus each spawned/forked child) is assigned a monotonically increasing numeric AgentId. Agent 0 is always the root thread. Subagents inherit their parent's AgentId as parent_agent_id so UIs can correlate trees even when conversations are forked or exported. These IDs are surfaced in SubagentSummary payloads and in a dedicated inbox event described below.
When a Turn completes, the response_id from the Model's final response.completed message is stored in the Session state to resume the thread given the next Op::UserInput. The response_id is also returned in the EventMsg::TurnComplete to the UI, which can be used to fork the thread from an earlier point by providing it in the Op::UserInput.
Each Session still runs at most one Task at a time. For parallel work, you can either run multiple Codex sessions or use subagents (via the subagent_* tools) to orchestrate multiple child sessions within a single daemon.
Subagent sessions run in parallel with the root thread, so you scale overlapping conversations without launching new daemons.
Enable the subagent_tools feature flag (see ../../docs/config.md#feature-flags) and tune how many child sessions stay active with max_active_subagents (../../docs/config.md#max_active_subagents).
Interface
Codex- Communicates with UI via a
SQ(Submission Queue) andEQ(Event Queue).
- Communicates with UI via a
Submission- These are messages sent on the
SQ(UI ->Codex) - Has an string ID provided by the UI, referred to as
sub_id Oprefers to the enum of all possibleSubmissionpayloads- This enum is
non_exhaustive; variants can be added at future dates
- This enum is
- These are messages sent on the
Event- These are messages sent on the
EQ(Codex-> UI) - Each
Eventhas a non-unique ID, matching thesub_idfrom theOp::UserInputthat started the current task. EventMsgrefers to the enum of all possibleEventpayloads- This enum is
non_exhaustive; variants can be added at future dates - It should be expected that new
EventMsgvariants will be added over time to expose more detailed information about the model's actions.
- This enum is
- These are messages sent on the
For complete documentation of the Op and EventMsg variants, refer to protocol.rs. Some example payload types:
OpOp::UserInput– Any input from the user to kick off aTaskOp::Interrupt– Interrupts a running taskOp::ExecApproval– Approve or deny code execution
EventMsgEventMsg::AgentMessage– Messages from theModelEventMsg::ExecApprovalRequest– Request approval from user to execute a commandEventMsg::TaskComplete– A task completed successfullyEventMsg::Error– A task stopped with an errorEventMsg::Warning– A non-fatal warning that the client should surface to the userEventMsg::TurnComplete– Contains aresponse_idbookmark for lastresponse_idexecuted by the task. This can be used to continue the task at a later point in time, perhaps with additional user input.
EventMsg::SubagentLifecycle– EmitsSubagentSummarypayloads that describe each child session, including itsagent_id,parent_agent_id, and current pending inbox counts. These lifecycle events are emitted whenever the daemon’s view of a subagent changes (creation, status/reasoning-header updates, or removal). They also persist in rollout files socodex resumecan rebuild prior subagent state—including attachments on spawn/fork and detach on cancel/prune—before replaying model turns.EventMsg::AgentInbox– Notifies the UI when a subagent’s inbox depth changes, for example after the parent sends an interrupt or a watchdog ping arrives. The payload includes the targetagent_id,session_id, and the counts of pending regular vs interrupt messages so UIs can render badges without polling. For example, if the root interrupts child agent3, the UI may receive anAgentInboxevent foragent_id = 3showing one pending interrupt message and zero regular messages.
Subagent tool reminders
subagent_awaitaccepts an optionaltimeout_scapped at 1,800 s (30 minutes). Omit it or pass0to use the 30-minute default. Eachtimeout_smust be at least 300 s (5 minutes); prefer 5–30 minute timeouts and use backoff (for example, 300s → 600s → 1,200s) so you can check on children, log progress, or deliver interrupts instead of parking for the full cap.subagent_logsis read-only and does not change a child’s state; prefer it when you only need to inspect recent activity without advancing the subagent.
The response_id returned from each task matches the OpenAI response_id stored in the API's /responses endpoint. It can be stored and used in future Sessions to resume threads of work.
Transport
Can operate over any transport that supports bi-directional streaming. - cross-thread channels - IPC channels - stdin/stdout - TCP - HTTP2 - gRPC
Non-framed transports, such as stdin/stdout and TCP, should use newline-delimited JSON in sending messages.
Example Flows
Sequence diagram examples of common interactions. In each diagram, some unimportant events may be eliminated for simplicity.
Basic UI Flow
A single user input, followed by a 2-turn task
sequenceDiagram
box UI
participant user as User
end
box Daemon
participant codex as Codex
participant session as Session
participant task as Task
end
box Rest API
participant agent as Model
end
user->>codex: Op::ConfigureSession
codex-->>session: create session
codex->>user: Event::SessionConfigured
user->>session: Op::UserInput
session-->>+task: start task
task->>user: Event::TaskStarted
task->>agent: prompt
agent->>task: response (exec)
task->>-user: Event::ExecApprovalRequest
user->>+task: Op::ExecApproval::Allow
task->>user: Event::ExecStart
task->>task: exec
task->>user: Event::ExecStop
task->>user: Event::TurnComplete
task->>agent: stdout
agent->>task: response (patch)
task->>task: apply patch (auto-approved)
task->>agent: success
agent->>task: response<br/>(msg + completed)
task->>user: Event::AgentMessage
task->>user: Event::TurnComplete
task->>-user: Event::TaskComplete
Task Interrupt
Interrupting a task and continuing with additional user input.
sequenceDiagram
box UI
participant user as User
end
box Daemon
participant session as Session
participant task1 as Task1
participant task2 as Task2
end
box Rest API
participant agent as Model
end
user->>session: Op::UserInput
session-->>+task1: start task
task1->>user: Event::TaskStarted
task1->>agent: prompt
agent->>task1: response (exec)
task1->>task1: exec (auto-approved)
task1->>user: Event::TurnComplete
task1->>agent: stdout
task1->>agent: response (exec)
task1->>task1: exec (auto-approved)
user->>task1: Op::Interrupt
task1->>-user: Event::Error("interrupted")
user->>session: Op::UserInput w/ last_response_id
session-->>+task2: start task
task2->>user: Event::TaskStarted
task2->>agent: prompt + Task1 last_response_id
agent->>task2: response (exec)
task2->>task2: exec (auto-approve)
task2->>user: Event::TurnCompleted
task2->>agent: stdout
agent->>task2: msg + completed
task2->>user: Event::AgentMessage
task2->>user: Event::TurnCompleted
task2->>-user: Event::TaskCompleted