codex

mirror of https://github.com/openai/codex.git synced 2026-04-30 17:36:40 +00:00

Author	SHA1	Message	Date
jif-oai	12f0e0b0eb	chore: merge name and title (#17116 ) Merge title and name concept to leverage the sqlite title column and have more efficient queries --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-09 18:44:26 +01:00
jif-oai	c0b5d8d24a	Skip local shell snapshots for remote unified exec (#17217 ) ## Summary - detect remote exec-server sessions in the unified-exec runtime - bypass the local shell-snapshot bootstrap only for those remote sessions - preserve existing local snapshot wrapping, PowerShell UTF-8 prefixing, sandbox orchestration, and zsh-fork handling ## Why The shell snapshot file is currently captured and stored next to Core. If Core wraps a remote command with `. /path/to/local/snapshot`, the process starts on the executor and tries to source a path from the orchestrator filesystem. This keeps remote commands from receiving that known-local path until shell snapshots are captured/restored on the executor side. ## Validation - `just fmt` - `git diff --check` - `cargo test -p codex-core --lib tools::runtimes::tests` Co-authored-by: Codex <noreply@openai.com>	2026-04-09 17:30:18 +01:00
Eric Traut	598d6ff056	Render statusline context as a meter (#17170 ) Problem: The statusline reported context as an “X% left” value, which could be mistaken for quota, and context usage was included in the default footer. Solution: Render configured context status items as a filling context meter, preserve `context-used` as a legacy alias while hiding it from the setup menu, and remove context from the default statusline. It will still be available as an opt-in option for users who want to see it. <img width="317" height="39" alt="image" src="https://github.com/user-attachments/assets/3aeb39bb-f80d-471f-88fe-d55e25b31491" />	2026-04-09 07:52:07 -07:00
jgershen-oai	8f705b0702	[codex] Defer steering until after sampling the model post-compaction (#17163 ) ## Summary - keep pending steered input buffered until the active user prompt has received a model response - keep steering pending across auto-compact when there is real model/tool continuation to resume - allow queued steering to follow compaction immediately when the prior model response was already final - keep pending-input follow-up owned by `run_turn` instead of folding it into `SamplingRequestResult` - add regression coverage for mid-turn compaction, final-response compaction, and compaction triggered before the next request after tool output ## Root Cause Steered input was drained at the top of every `run_turn` loop. After auto-compaction, the loop continued and immediately appended any pending steer after the compact summary, making a queued prompt look like the newest task instead of letting the model first resume interrupted model/tool work. ## Implementation Notes This patch keeps the follow-up signals separated: - `SamplingRequestResult.needs_follow_up` means model/tool continuation is needed - `sess.has_pending_input().await` means queued user steering exists - `run_turn` computes the combined loop condition from those two signals In `run_turn`: ```rust let has_pending_input = sess.has_pending_input().await; let needs_follow_up = model_needs_follow_up \|\| has_pending_input; ``` After auto-compact we choose whether the next request may drain steering: ```rust can_drain_pending_input = !model_needs_follow_up; ``` That means: - model/tool continuation + pending steer: compact -> resume once without draining steer - completed model answer + pending steer: compact -> drain/send the steer immediately - fresh user prompt: do not drain steering before the model has answered the prompt once The drain is still only `sess.get_pending_input().await`; when `can_drain_pending_input` is false, core uses an empty local vec and leaves the steer pending in session state. ## Validation - PASS `cargo test -p codex-core --test all steered_user_input -- --nocapture` - PASS `just fmt` - PASS `git diff --check` - NOT PASSING HERE `just fix -p codex-core` currently stops before linting this change on an unrelated mainline test-build error: `core/src/tools/spec_tests.rs` initializes `ToolsConfigParams` without `image_generation_tool_auth_allowed`; this PR does not touch that file.	2026-04-09 02:08:41 -07:00
Ahmed Ibrahim	84a24fe333	make webrtc the default experience (#17188 ) ## Summary - make realtime default to the v2 WebRTC path - keep partial realtime config tables inheriting `RealtimeConfig::default()` ## Validation - CI found a stale config-test expectation; fixed in `974ba51bb3` - just fmt - git diff --check --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-08 23:52:32 -07:00
Ahmed Ibrahim	1fdb695e42	Default realtime startup to v2 model (#17183 ) - Default realtime sessions to v2 and gpt-realtime-1.5 when no override is configured. - Add Op::RealtimeConversationStart integration coverage and keep v1-specific tests explicit. --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-08 22:11:30 -07:00
Eric Traut	6dc5391c7c	Add TUI notification condition config (#17175 ) Problem: TUI desktop notifications are hard-gated on terminal focus, so terminal/IDE hosts that want in-focus notifications cannot opt in. Solution: Add a flat `[tui] notification_condition` setting (`unfocused` by default, `always` opt-in), carry grouped TUI notification settings through runtime config, apply method + condition together in the TUI, and regenerate the config schema.	2026-04-08 21:50:02 -07:00
Ahmed Ibrahim	2f9090be62	Add realtime voice selection (#17176 ) - Add realtime voice selection for realtime/start. - Expose the supported v1/v2 voice lists and cover explicit, configured, default, and invalid voice paths.	2026-04-08 20:19:15 -07:00
Ahmed Ibrahim	4c2a1ae31b	Move default realtime prompt into core (#17165 ) - Adds a core-owned realtime backend prompt template and preparation path. - Makes omitted realtime start prompts use the core default, while null or empty prompts intentionally send empty instructions. - Covers the core realtime path and app-server v2 path with integration coverage. --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-08 19:34:40 -07:00
Eric Traut	36586eafed	Fix stale thread-name resume lookups (#16646 ) Addresses #15943 Problem: Name-based resume could stop on a newer session_index entry whose rollout was never persisted, shadowing an older saved thread with the same name. Solution: Materialize rollouts before indexing thread names and make name lookup skip unresolved entries until it finds a persisted rollout.	2026-04-08 18:51:29 -07:00
Leo Shimonaka	01537f0bd2	Auto-approve MCP server elicitations in Full Access mode (#17164 ) Currently, when a MCP server sends an elicitation to Codex running in Full Access (`sandbox_policy: DangerFullAccess` + `approval_policy: Never`), the elicitations are auto-cancelled. This PR updates the automatic handling of MCP elicitations to be consistent with other approvals in full-access, where they are auto-approved. Because MCP elicitations may actually require user input, this mechanism is limited to empty form elicitations. ## Changeset - Add policy helper shared with existing MCP tool call approval auto-approve - Update `ElicitationRequestManager` to auto-approve elicitations in full access when `can_auto_accept_elicitation` is true. - Add tests Co-authored-by: Codex <noreply@openai.com>	2026-04-08 16:41:02 -07:00
maja-openai	dcbc91fd39	Update guardian output schema (#17061 ) ## Summary - Update guardian output schema to separate risk, authorization, outcome, and rationale. - Feed guardian rationale into rejection messages. - Split the guardian policy into template and tenant-config sections. ## Validation - `cargo test -p codex-core mcp_tool_call` - `env -u CODEX_SANDBOX_NETWORK_DISABLED INSTA_UPDATE=always cargo test -p codex-core guardian::` --------- Co-authored-by: Owen Lin <owen@openai.com>	2026-04-08 15:47:29 -07:00
Ahmed Ibrahim	794a0240f9	Attach WebRTC realtime starts to sideband websocket (#17057 ) Summary: - parse the realtime call Location header and join that call over the direct realtime WebSocket - keep WebRTC starts alive on the existing realtime conversation path Validation: - just fmt - git diff --check - cargo check -p codex-api - cargo check -p codex-core --tests - local cargo tests not run; relying on PR CI	2026-04-08 15:25:42 -07:00
canvrno-oai	58ad79b60e	Fix missing fields (#17149 ) Fix missing `image_generation_tool_auth_allowed` in two locations.	2026-04-08 13:53:53 -07:00
pakrym-oai	e4d6702b87	[codex] Support remote exec cwd in TUI startup (#17142 ) When running with remote executor the cwd is the remote path. Today we check for existence of a local directory on startup and attempt to load config from it. For remote executors don't do that.	2026-04-08 13:09:28 -07:00
Won Park	e003f84e1e	release ready, enabling only for siwc users (#17046 ) Disabling Image-Gen for Non-SIWC Codex Users We are only enabling image-gen feature for SIWC Codex users until there comes a fix in ResponsesAPI to omit output from responses.completed, to prevent the following issues: 1. websocket blows up due to heavier load (images) than before (text) 2. http parser streams through n^2 of n-base64 bytes (sum of base64s of all images generated in turn) that causes long delays in turn_completion.	2026-04-08 11:22:39 -07:00
Owen Lin	e794457a59	fix(debug-config, guardian): fix /debug-config rendering and guardian… (#17138 ) ## Description This PR fixes `/debug-config` so it shows more of the active requirements state, including reviewer requirements and managed feature pins. This made it clear that legacy MDM config was setting `approvals_reviewer = "guardian_subagent"` and that we were translating that into a requirements constraint. Also, translate `approvals_reviewer = "guardian_subagent"` (from legacy managed_config.toml) to `allowed_approvals_reviewers: guardian_subagent, user` instead of `allowed_approvals_reviewers: guardian_subagent`. Example `/debug-config`: ``` Config layer stack (lowest precedence first): 1. system (/etc/codex/config.toml) (enabled) 2. user (/Users/owen/.codex/config.toml) (enabled) 3. project (/Users/owen/repos/codex/.codex/config.toml) (enabled) 4. legacy managed_config.toml (MDM) (enabled) MDM value: ... # Enable Guardian Mode features.guardian_approval = true approvals_reviewer = "guardian_subagent" Requirements: - allowed_approvals_reviewers: guardian_subagent, user (source: MDM managed_config.toml (legacy)) - features: apps=true, plugins=true (source: cloud requirements) ``` Before this PR, the `Requirements` section showed None.	2026-04-08 11:08:09 -07:00
pakrym-oai	35b5720e8d	Use AbsolutePathBuf for exec cwd plumbing (#17063 ) ## Summary - Carry `AbsolutePathBuf` through tool cwd parsing/resolution instead of resolving workdirs to raw `PathBuf`s. - Type exec/sandbox request cwd fields as `AbsolutePathBuf` through `ExecParams`, `ExecRequest`, `SandboxCommand`, and unified exec runtime requests. - Keep `PathBuf` conversions at external/event boundaries and update existing tests/fixtures for the typed cwd. ## Validation - `cargo check -p codex-core --tests` - `cargo check -p codex-sandboxing --tests` - `cargo test -p codex-sandboxing` - `cargo test -p codex-core --lib tools::handlers::` - `just fix -p codex-sandboxing` - `just fix -p codex-core` - `just fmt` Full `codex-core` test suite was not run locally; per repo guidance I kept local validation targeted.	2026-04-08 10:54:12 -07:00
Ahmed Ibrahim	06d88b7e81	Add realtime transport config (#17097 ) Adds realtime.transport config with websocket as the default and webrtc wired through the effective config. Co-authored-by: Codex <noreply@openai.com>	2026-04-08 09:53:53 -07:00
Eric Traut	dc5feb916d	Show global AGENTS.md in /status (#17091 ) Addresses #3793 Problem: /status only reported project-level AGENTS files, so sessions with a loaded global $CODEX_HOME/AGENTS.md still showed Agents.md as <none>. Solution: Track the global instructions file loaded during config initialization and prepend that path to the /status Agents.md summary, with coverage for AGENTS.md, AGENTS.override.md, and global-plus-project ordering.	2026-04-08 09:04:32 -07:00
pakrym-oai	4c07dd4d25	Configure multi_agent_v2 spawn agent hints (#17071 ) Allow multi_agent_v2 features to have its own temporary configuration under `[features.multi_agent_v2]` ``` [features.multi_agent_v2] enabled = true usage_hint_enabled = false usage_hint_text = "Custom delegation guidance." hide_spawn_agent_metadata = true ``` Absent `usage_hint_text` means use the default hint. ``` [features] multi_agent_v2 = true ``` still works as the boolean shorthand.	2026-04-08 08:42:18 -07:00
jif-oai	11eff760d1	codex debug 2 (guardian approved) (#17118 ) Removes lines 8-14 from core/templates/agents/orchestrator.md.	2026-04-08 14:14:06 +01:00
jif-oai	2b65f24de6	codex debug 15 (guardian approved) (#17131 ) Removes lines 99-106 from core/templates/agents/orchestrator.md.	2026-04-08 14:11:01 +01:00
jif-oai	95d27bfe8c	codex debug 13 (guardian approved) (#17129 ) Removes lines 85-91 from core/templates/agents/orchestrator.md.	2026-04-08 14:10:54 +01:00
jif-oai	6e9ffa9a1c	codex debug 11 (guardian approved) (#17127 ) Removes lines 71-77 from core/templates/agents/orchestrator.md.	2026-04-08 14:10:47 +01:00
jif-oai	c39477a7d5	codex debug 9 (guardian approved) (#17125 ) Removes lines 57-63 from core/templates/agents/orchestrator.md.	2026-04-08 14:10:41 +01:00
jif-oai	cb77bbfed0	codex debug 7 (guardian approved) (#17123 ) Removes lines 43-49 from core/templates/agents/orchestrator.md.	2026-04-08 14:10:34 +01:00
jif-oai	5f1363d6d0	codex debug 5 (guardian approved) (#17121 ) Removes lines 29-35 from core/templates/agents/orchestrator.md.	2026-04-08 14:10:28 +01:00
jif-oai	8558e8aa51	codex debug 3 (guardian approved) (#17119 ) Removes lines 15-21 from core/templates/agents/orchestrator.md.	2026-04-08 14:10:22 +01:00
jif-oai	22c1fc0131	codex debug 1 (guardian approved) (#17117 ) Removes lines 1-7 from core/templates/agents/orchestrator.md.	2026-04-08 14:10:15 +01:00
Vivian Fang	d47b755aa2	Render namespace description for tools (#16879 )	2026-04-08 02:39:40 -07:00
Vivian Fang	9091999c83	Render function attribute descriptions (#16880 )	2026-04-08 02:10:45 -07:00
Vivian Fang	ea516f9a40	Support anyOf and enum in JsonSchema (#16875 ) This brings us into better alignment with the JSON schema subset that is supported in <https://developers.openai.com/api/docs/guides/structured-outputs#supported-schemas>, and also allows us to render richer function signatures in code mode (e.g., anyOf{null, OtherObjectType})	2026-04-08 01:07:55 -07:00
viyatb-oai	3c1adbabcd	fix: refresh network proxy settings when sandbox mode changes (#17040 ) ## Summary Fix network proxy sessions so changing sandbox mode recomputes the effective managed network policy and applies it to the already-running per-session proxy. ## Root Cause `danger_full_access_denylist_only` injects `"*"` only while building the proxy spec for Full Access. Sessions built that spec once at startup, so a later permission switch to Full Access left the live proxy in its original restricted policy. Switching back needed the same recompute path to remove the synthetic wildcard again. ## What Changed - Preserve the original managed network proxy config/requirements so the effective spec can be recomputed for a new sandbox policy. - Refresh the current session proxy when sandbox settings change, then reapply exec-policy network overlays. - Add an in-place proxy state update path while rejecting listener/port/SOCKS changes that cannot be hot-reloaded. - Keep runtime proxy settings cheap to snapshot and update. - Add regression coverage for workspace-write -> Full Access -> workspace-write.	2026-04-08 03:07:55 +00:00
pash-openai	80ebc80be5	Use model metadata for Fast Mode status (#16949 ) Fast Mode status was still tied to one model name in the TUI and model-list plumbing. This changes the model metadata shape so a model can advertise additional speed tiers, carries that field through the app-server model list, and uses it to decide when to show Fast Mode status. For people using Codex, the behavior is intended to stay the same for existing models. Fast Mode still requires the existing signed-in / feature-gated path; the difference is that the UI can now recognize any model the model list marks as Fast-capable, instead of requiring a new client-side slug check.	2026-04-07 17:55:40 -07:00
pakrym-oai	600c3e49e0	[codex] Apply patches through executor filesystem (#17048 ) ## Summary - run apply_patch through the executor filesystem when a remote environment is present instead of shelling out to the local process - thread the executor FileSystem into apply_patch interception and keep existing local behavior for non-remote turns - make the apply_patch integration harness use the executor filesystem for setup/assertions - add remote-aware skips for turn-diff coverage that still reads the test-runner filesystem ## Why Remote apply_patch needed to mutate the remote workspace instead of the local checkout. The tests also needed to seed and assert workspace state through the same filesystem abstraction so local and remote runs exercise the same behavior. ## Validation - `just fmt` - `git diff --check` - `cargo check -p core_test_support --tests` - `cargo test -p codex-core --test all suite::shell_serialization::apply_patch_custom_tool_call -- --nocapture` - `cargo test -p codex-core --test all suite::apply_patch_cli::apply_patch_cli_updates_file_appends_trailing_newline -- --nocapture` - remote `cargo test -p codex-core --test all apply_patch_cli -- --nocapture` (229 passed)	2026-04-07 16:35:02 -07:00
Ahmed Ibrahim	fb3dcfde1d	Add WebRTC transport to realtime start (#16960 ) Adds WebRTC startup to the experimental app-server `thread/realtime/start` method with an optional transport enum. The websocket path remains the default; WebRTC offers create the realtime session through the shared start flow and emit the answer SDP via `thread/realtime/sdp`. --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-07 15:43:38 -07:00
Dylan Hurd	6c36e7d688	fix(app-server) revert null instructions changes (#17047 )	2026-04-07 15:18:34 -07:00
pakrym-oai	e9702411ab	[codex] Migrate apply_patch to executor filesystem (#17027 ) - Migrate apply-patch verification and application internals to use the async `ExecutorFileSystem` abstraction from `exec-server`. - Convert apply-patch `cwd` handling to `AbsolutePathBuf` through the verifier/parser/handler boundary. Doesn't change how the tool itself works.	2026-04-07 21:20:22 +00:00
Dylan Hurd	d45513ce5a	fix(core) revert Command line in unified exec output (#17031 ) ## Summary https://github.com/openai/codex/pull/13860 changed the serialized output format of Unified Exec. This PR reverts those changes and some related test changes ## Testing - [x] Update tests --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-07 13:35:40 -07:00
pakrym-oai	8614f92fc4	[codex] Fix unified exec test build (#17032 ) ## Summary - Remove the stale `?` after `AbsolutePathBuf::join` in the unified exec integration test helper. ## Root Cause - `AbsolutePathBuf::join` was made infallible, but `core/tests/suite/unified_exec.rs` still treated it as a `Result`, which broke the Windows test build for the `all` integration test target. ## Validation - `just fmt` - `cargo test -p codex-core --test all unified_exec_resolves_relative_workdir`	2026-04-07 12:01:06 -07:00
pakrym-oai	470b3592e6	Add full-ci branch trigger (#16980 ) Allow branches to trigger full ci (helpful to run remote tests)	2026-04-07 11:33:35 -07:00
pakrym-oai	365154d5da	[codex] Make unified exec tests remote aware (#16977 ) ## Summary - Convert unified exec integration tests that can run against the remote executor to use the remote-aware test harness. - Create workspace directories through the executor filesystem for remote runs. - Install `python3` and `zsh` in the remote test container so restored Python/zsh-based test commands work in fresh Ubuntu containers. ## Validation - `just fmt` - `cargo test -p codex-core --test all unified_exec_defaults_to_pipe` - `cargo test -p codex-core --test all unified_exec_can_enable_tty` - `cargo test -p codex-core --test all unified_exec` - Remote on `codex-remote`: `source scripts/test-remote-env.sh && cd codex-rs && cargo test -p codex-core --test all unified_exec` - `just fix -p codex-core`	2026-04-07 10:56:08 -07:00
pakrym-oai	f1a2b920f9	[codex] Make AbsolutePathBuf joins infallible (#16981 ) Having to check for errors every time join is called is painful and unnecessary.	2026-04-07 10:52:08 -07:00
Owen Lin	0b9e42f6f7	fix(guardian): don't throw away transcript when over budget (#16956 ) ## Description This PR changes guardian transcript compaction so oversized conversations no longer collapse into a nearly empty placeholder. Before this change, if the retained user history alone exceeded the message budget, guardian would replace the entire transcript with `<transcript omitted to preserve budget for planned action>`! That meant approvals, especially network approvals, could lose the recent tool call and tool result that explained what guardian was actually reviewing. Now we keep a compact but usable transcript instead of dropping it all. ### Before ``` The following is the Codex agent history whose request action you are assessing... >>> TRANSCRIPT START <transcript omitted to preserve budget for planned action> >>> TRANSCRIPT END Conversation transcript omitted due to size. The Codex agent has requested the following action: >>> APPROVAL REQUEST START Retry reason: Sandbox blocked outbound network access. Assess the exact planned action below. Use read-only tool checks when local state matters. Planned action JSON: { "tool": "network_access", "target": "https://example.com:443", "host": "example.com", "protocol": "https", "port": 443 } >>> APPROVAL REQUEST END ``` ### After ``` The following is the Codex agent history whose request action you are assessing... >>> TRANSCRIPT START [1] user: Please investigate why uploads to example.com are failing and retry if needed. [8] user: If the request looks correct, go ahead and try again with network access. [9] tool shell call: {"command":["curl","-X","POST","https://example.com/upload"],"cwd":"/repo"} [10] tool shell result: sandbox blocked outbound network access >>> TRANSCRIPT END Some conversation entries were omitted. The Codex agent has requested the following action: >>> APPROVAL REQUEST START Retry reason: Sandbox blocked outbound network access. Assess the exact planned action below. Use read-only tool checks when local state matters. Planned action JSON: { "tool": "network_access", "target": "https://example.com:443", "host": "example.com", "protocol": "https", "port": 443 } >>> APPROVAL REQUEST END ```	2026-04-07 10:19:16 -07:00
Owen Lin	5d1671ca70	feat(analytics): generate an installation_id and pass it in responsesapi client_metadata (#16912 ) ## Summary This adds a stable Codex installation ID and includes it on Responses API requests via `x-codex-installation-id` passed in via the `client_metadata` field for analytics/debugging. The main pieces are: - persist a UUID in `$CODEX_HOME/installation_id` - thread the installation ID into `ModelClient` - send it in `client_metadata` on Responses requests so it works consistently across HTTP and WebSocket transports	2026-04-07 09:52:17 -07:00
Ahmed Ibrahim	cd591dc457	Preserve null developer instructions (#16976 ) Preserve explicit null developer-instruction overrides across app-server resume and fork flows.	2026-04-07 09:32:14 -07:00
Eric Traut	feb4f0051a	Fix nested exec thread ID restore (#16882 ) Addresses #15527 Problem: Nested `codex exec` commands could source a shell snapshot that re-exported the parent `CODEX_THREAD_ID`, so commands inside the nested session were attributed to the wrong thread. Solution: Reapply the live command env's `CODEX_THREAD_ID` after sourcing the snapshot.	2026-04-07 09:26:22 -07:00
Eric Traut	82506527f1	Fix read-only apply_patch rejection message (#16885 ) Addresses #15532 Problem: Nested read-only `apply_patch` rejections report in-project files as outside the project. Solution: Choose the rejection message based on sandbox mode so read-only sessions report a read-only-specific reason, and add focused safety coverage.	2026-04-07 09:25:39 -07:00
Eric Traut	3b32de4fab	Stabilize flaky multi-agent followup interrupt test (#16739 ) Problem: The multi-agent followup interrupt test polled history before interrupt cleanup and mailbox wakeup were guaranteed to settle, which made it flaky under CI scheduling variance. Solution: Wait for the child turn's `TurnAborted(Interrupted)` event before asserting that the redirected assistant envelope is recorded and no plain user message is left behind.	2026-04-07 09:24:14 -07:00

1 2 3 4 5 ...

2695 Commits