mirror of https://github.com/openai/codex.git synced 2026-05-15 08:42:34 +00:00

Files

Matthew Zeng e15ecc9c35 Add production startup and TTFT telemetry (#22198 )

## Why

While investigating `codex exec hi` startup latency, the useful
questions were not "is startup slow?" but "which durable bucket is slow
in production?"

The path we observed has a few distinct stages:

1. `thread/start` creates the session
2. startup prewarm builds the turn context, tools, and prompt
3. startup prewarm warms the websocket
4. the first real turn resolves the prewarm
5. the model produces the first token

Before this PR, production telemetry had some of the raw measurements
already:

- aggregate startup-prewarm duration / age-at-first-turn metrics
- TTFT as a metric
- websocket request telemetry

But there was no coherent production event stream for the startup
breakdown itself, and TTFT was metric-only. That made it hard to answer
the same latency questions from OpenTelemetry-backed logs without adding
one-off local instrumentation.

## What changed

Add durable production telemetry on the existing `SessionTelemetry`
path:

- new `codex.startup_phase` OTel log/trace events plus
`codex.startup.phase.duration_ms`
- new `codex.turn_ttft` OTel log/trace events while preserving the
existing TTFT metric

The startup phase event is emitted for the coarse buckets we actually
observed while running `exec hi`:

- `thread_start_create_thread`
- `startup_prewarm_total`
- `startup_prewarm_create_turn_context`
- `startup_prewarm_build_tools`
- `startup_prewarm_build_prompt`
- `startup_prewarm_websocket_warmup`
- `startup_prewarm_resolve`

These phases are intentionally low-cardinality so they remain safe as
production telemetry tags.

## Why this shape

This keeps the instrumentation on the same production path as the rest
of the session telemetry instead of adding a local debug-only trace
mode. It also avoids changing startup behavior:

- prewarm still runs
- no control flow changes
- no extra remote calls
- no user-visible behavior changes

One boundary is intentional: very early process bootstrap that happens
before a session exists is not included here, because this PR uses
session-scoped production telemetry. The expensive buckets we were
trying to understand after `thread/start` are now covered durably.

## Verification

- `cargo test -p codex-otel`
- `cargo test -p codex-core turn_timing`
- `cargo test -p codex-core
regular_turn_emits_turn_started_without_waiting_for_startup_prewarm`
- `cargo test -p codex-core
interrupting_regular_turn_waiting_on_startup_prewarm_emits_turn_aborted`
- `cargo test -p codex-app-server thread_start`
- `just fix -p codex-otel -p codex-core -p codex-app-server`

I also ran `cargo test -p codex-core`; it built successfully and then
hit an existing unrelated stack overflow in
`tools::handlers::multi_agents::tests::tool_handlers_cascade_close_and_resume_and_keep_explicitly_closed_subtrees_closed`.

2026-05-11 23:58:36 +00:00

.cargo

fix: cargo deny (#20627 )

2026-05-01 18:15:38 +02:00

.config

…

.github/workflows

fix: cargo deny (#20627 )

2026-05-01 18:15:38 +02:00

agent-graph-store

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

agent-identity

…

analytics

Stop uploading accepted line fingerprints (#22180 )

2026-05-11 15:41:38 -07:00

ansi-escape

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

app-server

Add production startup and TTFT telemetry (#22198 )

2026-05-11 23:58:36 +00:00

app-server-client

[codex] request desktop attestation from app (#20619 )

2026-05-08 12:36:02 -07:00

app-server-daemon

Update codex remote-control to start the daemon (#22218 )

2026-05-11 15:38:30 -07:00

app-server-protocol

Add Windows hook command overrides (#22159 )

2026-05-11 22:22:29 +00:00

app-server-test-client

[codex] request desktop attestation from app (#20619 )

2026-05-08 12:36:02 -07:00

app-server-transport

app-server: remove TCP websocket listener (#21843 )

2026-05-11 10:17:26 -07:00

apply-patch

Support multi-environment apply_patch selection (#21617 )

2026-05-11 16:33:44 -07:00

arg0

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

async-utils

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

aws-auth

feat: enable AWS login credentials for Bedrock auth (#21623 )

2026-05-08 04:07:59 +00:00

backend-client

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

bwrap

fix(bwrap): emit libcap after standalone archive (#21285 )

2026-05-05 22:22:01 -07:00

chatgpt

Using cached connector directory for discoverable tools list (#21497 )

2026-05-08 14:14:11 -07:00

cli

Update codex remote-control to start the daemon (#22218 )

2026-05-11 15:38:30 -07:00

cloud-requirements

feat(connectors): support managed app tool approval requirements (#21061 )

2026-05-11 19:08:26 +00:00

cloud-tasks

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

cloud-tasks-client

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

cloud-tasks-mock-client

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

code-mode

Enable V8 sandboxing for source-built builds (#21146 )

2026-05-05 14:36:37 -07:00

codex-api

api: send hyphenated session and thread headers (#21757 )

2026-05-08 17:11:19 +02:00

codex-backend-openapi-models

Enable --deny-warnings for cargo shear (#21616 )

2026-05-08 20:29:00 +00:00

codex-client

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

codex-experimental-api-macros

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

codex-mcp

[elicitation] Advertise new url elicitation capability when auth_elicitation is enabled. (#22188 )

2026-05-11 12:23:55 -07:00

collaboration-mode-templates

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

config

Add Windows hook command overrides (#22159 )

2026-05-11 22:22:29 +00:00

connectors

Using cached connector directory for discoverable tools list (#21497 )

2026-05-08 14:14:11 -07:00

core

Add production startup and TTFT telemetry (#22198 )

2026-05-11 23:58:36 +00:00

core-api

extension: wire extension registries into sessions (#21737 )

2026-05-11 11:38:18 +02:00

core-plugins

Read cached metadata for installed Git plugins (#20825 )

2026-05-10 16:59:57 -07:00

core-skills

Remove skills list extra roots (#21485 )

2026-05-07 20:56:42 -07:00

debug-client

[codex] request desktop attestation from app (#20619 )

2026-05-08 12:36:02 -07:00

docs

[codex] Fix pathless thread summaries (#21266 )

2026-05-07 11:18:16 -07:00

exec

Add process-scoped SQLite telemetry (#22154 )

2026-05-11 11:32:40 -07:00

exec-server

fix(exec-server): suppress Windows taskkill output (#22058 )

2026-05-11 15:40:56 -03:00

execpolicy

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

execpolicy-legacy

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

ext

feat: wire extension tool bundles into core (#22147 )

2026-05-11 16:42:29 +02:00

external-agent-migration

…

external-agent-sessions

Import external agent sessions in background (#20284 )

2026-04-30 00:00:41 +00:00

features

feat: add network proxy feature flag (#20147 )

2026-05-11 14:12:00 -07:00

feedback

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

file-search

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

file-system

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

file-watcher

Move file watcher out of core (#21290 )

2026-05-08 18:19:23 -07:00

git-utils

Emit accepted line fingerprint analytics (#21601 )

2026-05-08 12:16:24 -07:00

hooks

Add Windows hook command overrides (#22159 )

2026-05-11 22:22:29 +00:00

install-context

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

keyring-store

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

linux-sandbox

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

lmstudio

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

[login] revoke superseded auth tokens on relogin (#21747 )

2026-05-11 13:36:46 -07:00

mcp-server

Support multi-environment apply_patch selection (#21617 )

2026-05-11 16:33:44 -07:00

memories

chore: drop built-in MCPs (#22173 )

2026-05-11 19:45:08 +02:00

message-history

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

model-provider

[codex] Delete function-style apply_patch (#21651 )

2026-05-08 13:00:57 -07:00

model-provider-info

feat: add Bedrock Mantle client agent header (#21840 )

2026-05-08 23:58:41 +00:00

models-manager

Update models.json (#21776 )

2026-05-08 21:37:23 +03:00

network-proxy

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

ollama

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

otel

Add production startup and TTFT telemetry (#22198 )

2026-05-11 23:58:36 +00:00

plugin

Enable --deny-warnings for cargo shear (#21616 )

2026-05-08 20:29:00 +00:00

process-hardening

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

protocol

Reapply "Move skills watcher to app-server" (#21652 )

2026-05-08 17:41:15 -07:00

realtime-webrtc

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

response-debug-context

Add safety check notification and error handling (#19055 )

2026-04-22 22:24:12 -07:00

responses-api-proxy

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

rmcp-client

fix(tui): suppress taskkill output for MCP teardown on Windows (#21759 )

2026-05-10 15:51:26 +00:00

rollout

Add process-scoped SQLite telemetry (#22154 )

2026-05-11 11:32:40 -07:00

rollout-trace

Reapply "Move skills watcher to app-server" (#21652 )

2026-05-08 17:41:15 -07:00

sandboxing

Fix rust-ci-full failures due to missing bwrap (#21604 )

2026-05-08 09:52:19 -07:00

scripts

Upgrade to rust 1.93 (#10080 )

2026-01-28 17:46:18 +00:00

secrets

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

shell-command

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

shell-escalation

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

skills

[codex] Coordinate OpenAI docs sample with API key setup (#21263 )

2026-05-06 13:46:15 -04:00

state

Add process-scoped SQLite telemetry (#22154 )

2026-05-11 11:32:40 -07:00

stdio-to-uds

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

terminal-detection

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

test-binary-support

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

thread-manager-sample

extension: wire extension registries into sessions (#21737 )

2026-05-11 11:38:18 +02:00

thread-store

Use goal preview metadata for goal-first threads (#21981 )

2026-05-11 10:12:46 -07:00

tool-api

feat: wire extension tool bundles into core (#22147 )

2026-05-11 16:42:29 +02:00

tools

feat: wire extension tool bundles into core (#22147 )

2026-05-11 16:42:29 +02:00

tui

Add Windows hook command overrides (#22159 )

2026-05-11 22:22:29 +00:00

uds

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

utils

Support openai library tool (#20293 )

2026-05-08 22:56:13 +00:00

v8-poc

Disable empty Cargo test targets (#21584 )

2026-05-07 15:44:17 -07:00

vendor

vendor: update bubblewrap to 0.11.2 (#21389 )

2026-05-06 18:10:30 +00:00

windows-sandbox-rs

Enable --deny-warnings for cargo shear (#21616 )

2026-05-08 20:29:00 +00:00

.gitignore

…

BUILD.bazel

…

Cargo.lock

daemon: refresh updater after validated binary rollout (#21853 )

2026-05-11 12:37:10 -07:00

Cargo.toml

chore: drop built-in MCPs (#22173 )

2026-05-11 19:45:08 +02:00

clippy.toml

…

config.md

…

default.nix

…

deny.toml

fix: cargo deny (#20627 )

2026-05-01 18:15:38 +02:00

README.md

revert legacy notify deprecation (#21152 )

2026-05-05 10:34:44 -07:00

rust-toolchain.toml

…

rustfmt.toml

…

README.md

Codex CLI (Rust Implementation)

We provide Codex CLI as a standalone executable to ensure a zero-dependency install.

Installing Codex

Today, the easiest way to install Codex is via npm:

npm i -g @openai/codex
codex

You can also install via Homebrew (brew install --cask codex) or download a platform-specific release directly from our GitHub Releases.

Documentation quickstart

First run with Codex? Start with docs/getting-started.md (links to the walkthrough for prompts, keyboard shortcuts, and session management).
Want deeper control? See docs/config.md and docs/install.md.

What's new in the Rust CLI

The Rust implementation is now the maintained Codex CLI and serves as the default experience. It includes a number of features that the legacy TypeScript CLI never supported.

Config

Codex supports a rich set of configuration options. Note that the Rust CLI uses config.toml instead of config.json. See docs/config.md for details.

Model Context Protocol Support

MCP client

Codex CLI functions as an MCP client that allows the Codex CLI and IDE extension to connect to MCP servers on startup. See the configuration documentation for details.

MCP server (experimental)

Codex can be launched as an MCP server by running codex mcp-server. This allows other MCP clients to use Codex as a tool for another agent.

Use the @modelcontextprotocol/inspector to try it out:

npx @modelcontextprotocol/inspector codex mcp-server

Use codex mcp to add/list/get/remove MCP server launchers defined in config.toml, and codex mcp-server to run the MCP server directly.

Notifications

You can enable notifications by configuring a script that is run whenever the agent finishes a turn. The notify documentation includes a detailed example that explains how to get desktop notifications via terminal-notifier on macOS. When Codex detects that it is running under WSL 2 inside Windows Terminal (WT_SESSION is set), the TUI automatically falls back to native Windows toast notifications so approval prompts and completed turns surface even though Windows Terminal does not implement OSC 9.

`codex exec` to run Codex programmatically/non-interactively

To run Codex non-interactively, run codex exec PROMPT (you can also pass the prompt via stdin) and Codex will work on your task until it decides that it is done and exits. If you provide both a prompt argument and piped stdin, Codex appends stdin as a <stdin> block after the prompt so patterns like echo "my output" | codex exec "Summarize this concisely" work naturally. Output is printed to the terminal directly. You can set the RUST_LOG environment variable to see more about what's going on. Use codex exec --ephemeral ... to run without persisting session rollout files to disk.

Experimenting with the Codex Sandbox

To test to see what happens when a command is run under the sandbox provided by Codex, we provide the following subcommands in Codex CLI:

# macOS
codex sandbox macos [--log-denials] [COMMAND]...

# Linux
codex sandbox linux [COMMAND]...

# Windows
codex sandbox windows [COMMAND]...

# Legacy aliases
codex debug seatbelt [--log-denials] [COMMAND]...
codex debug landlock [COMMAND]...

To try a writable legacy sandbox mode with these commands, pass an explicit config override such as -c 'sandbox_mode="workspace-write"'.

Selecting a sandbox policy via `--sandbox`

The Rust CLI exposes a dedicated --sandbox (-s) flag that lets you pick the sandbox policy without having to reach for the generic -c/--config option:

# Run Codex with the default, read-only sandbox
codex --sandbox read-only

# Allow the agent to write within the current workspace while still blocking network access
codex --sandbox workspace-write

# Danger! Disable sandboxing entirely (only do this if you are already running in a container or other isolated env)
codex --sandbox danger-full-access

The same setting can be persisted in ~/.codex/config.toml via the top-level sandbox_mode = "MODE" key, e.g. sandbox_mode = "workspace-write". In workspace-write, Codex also includes ~/.codex/memories in its writable roots so memory maintenance does not require an extra approval.

Code Organization

This folder is the root of a Cargo workspace. It contains quite a bit of experimental code, but here are the key crates:

core/ contains the business logic for Codex. Ultimately, we hope this becomes a library crate that is generally useful for building other Rust/native applications that use Codex.
exec/ "headless" CLI for use in automation.
tui/ CLI that launches a fullscreen TUI built with Ratatui.
cli/ CLI multitool that provides the aforementioned CLIs via subcommands.

If you want to contribute or inspect behavior in detail, start by reading the module-level README.md files under each crate and run the project workspace from the top-level codex-rs directory so shared config, features, and build scripts stay aligned.

README.md

Codex CLI (Rust Implementation)

Installing Codex

Documentation quickstart

What's new in the Rust CLI

Config

Model Context Protocol Support

MCP client

MCP server (experimental)

Notifications

codex exec to run Codex programmatically/non-interactively

Experimenting with the Codex Sandbox

Selecting a sandbox policy via --sandbox

Code Organization

`codex exec` to run Codex programmatically/non-interactively

Selecting a sandbox policy via `--sandbox`