codex

mirror of https://github.com/openai/codex.git synced 2026-05-23 20:44:50 +00:00

Author	SHA1	Message	Date
Michael Bolin	7364013fe0	tests: mutate spawn-agent permission profile directly	2026-04-30 03:43:57 -07:00
Michael Bolin	c886193921	app-server tests: select turn permission profiles by name	2026-04-30 03:39:50 -07:00
Michael Bolin	b6d55cd9f2	file-system: drop unused legacy sandbox constructor	2026-04-30 03:33:23 -07:00
Michael Bolin	2876493bae	tests: use disabled profile in exec capture check	2026-04-30 03:28:48 -07:00
Michael Bolin	e894ac76f7	tests: use profile constructors in config checks	2026-04-30 03:26:51 -07:00
Michael Bolin	4cf7855a99	tests: use permission profiles in multi-agent config checks	2026-04-30 03:23:18 -07:00
Michael Bolin	52d200da00	tests: remove sandbox policy fixture from rollout trace	2026-04-30 03:20:17 -07:00
Michael Bolin	c48043f4e4	tests: use permission profiles in session network checks	2026-04-30 03:18:22 -07:00
Michael Bolin	8a2144d700	tests: use permission profiles in config loader checks	2026-04-30 03:18:22 -07:00
Michael Bolin	0fc2a7b068	tests: submit websocket turns with permission profiles	2026-04-30 03:08:22 -07:00
Michael Bolin	4f646e0aca	tests: use permission profiles in exec policy checks	2026-04-30 03:04:35 -07:00
Michael Bolin	e28bb5c396	tests: use permission profiles in request permission suite	2026-04-30 03:01:06 -07:00
Michael Bolin	521cf5bdd4	tests: use permission profiles in unified exec suite	2026-04-30 03:01:06 -07:00
Michael Bolin	57094ee86d	core: use permission profiles in small read-only contexts	2026-04-30 03:01:06 -07:00
Michael Bolin	550adee585	exec tests: launch sandbox cases from permission profiles	2026-04-30 02:36:30 -07:00
Michael Bolin	200c83f7d7	tests: use permission profiles in suite turn submits	2026-04-30 02:36:30 -07:00
Michael Bolin	cfeaa5aab1	guardian: configure review session permissions directly	2026-04-30 02:36:30 -07:00
Michael Bolin	75c9c98aed	tests: use permission profiles in small core fixtures	2026-04-30 02:36:30 -07:00
Michael Bolin	d2e3e3613b	exec-server: use permission profiles in file system handler tests	2026-04-30 02:36:30 -07:00
Michael Bolin	57f895a7c0	memories: configure consolidation permissions directly	2026-04-30 02:36:30 -07:00
Michael Bolin	0cc3264ed4	analytics: distinguish custom managed permission profiles	2026-04-30 02:36:30 -07:00
Michael Bolin	05d341f0d4	tests: use permission profiles in guardian config checks	2026-04-30 02:36:30 -07:00
Michael Bolin	d53c86e0da	tests: use permission profiles in unix escalation checks	2026-04-30 02:36:30 -07:00
Michael Bolin	44ec706a44	tests: use permission profiles in patch safety checks	2026-04-30 02:36:30 -07:00
Michael Bolin	a3880e937b	tests: use permission profiles in tool sandbox tests	2026-04-30 02:36:30 -07:00
Michael Bolin	ee05c896f7	tests: use permission profile fixtures in config checks	2026-04-30 02:36:30 -07:00
Michael Bolin	ada7881352	core: build permission instructions from profiles only	2026-04-30 02:36:30 -07:00
Michael Bolin	c4c371f257	utils: summarize permission profiles directly	2026-04-30 02:36:30 -07:00
Michael Bolin	97aaf4cea4	tests: copy plugin stdio server before launch	2026-04-30 02:36:21 -07:00
jif-oai	c37f7434ba	Gate multi-agent v2 tools independently of collab (#20246 ) ## Why `multi_agents_v2` is meant to be independently gated from the older `collab` feature. The tool registry still treated the collaboration-style agent tools as `collab`-only, so enabling `multi_agents_v2` without `collab` omitted the v2 agent tools. Review and guardian sub-sessions also need to keep agent spawning disabled even when the outer session has `multi_agents_v2` enabled. ## What changed - Include the collab-backed agent tools when either `multi_agents_v2` or `collab` is enabled. - Explicitly disable `multi_agents_v2` for review and guardian review sub-sessions, matching the existing `spawn_csv` and `collab` restrictions. - Add a registry test that enables `multi_agents_v2`, disables `collab`, and verifies the v2 agent tools are present while legacy `send_input` and `resume_agent` remain hidden. ## Testing - Added `test_build_specs_multi_agent_v2_does_not_require_collab_feature`.	2026-04-30 10:23:31 +02:00
Eric Traut	a73403a890	Make missing config clears no-ops (#20334 ) ## Why Fixes #20145. `config/value/write` treats a JSON `null` value as a request to clear the config key. Clearing a key that is already absent should be idempotent, but clearing a nested key such as `features.personality` from an empty `config.toml` returned `configPathNotFound` because `clear_path` treated the missing `features` parent table as an error. That makes app-server reset flows brittle because clients have to read first and avoid sending a clear request unless the parent path already exists. ## What Changed - Updated app-server config clearing so missing intermediate tables, or non-table parents, are treated as an unchanged no-op. - Removed the now-unreachable `MergeError::PathNotFound` path from config write merging. - Added a regression test covering `features.personality = null` against an empty user config. ## Verification - `cargo test -p codex-app-server clear_missing_nested_config_is_noop` - `cargo test -p codex-app-server` was run; the config manager unit suite passed, but one unrelated integration test failed because `turn_start_emits_thread_scoped_warning_notification_for_trimmed_skills` expected `7` trimmed skills and observed `8`. - `just fix -p codex-app-server`	2026-04-30 10:13:33 +02:00
xl-openai	87d0cf1a62	feat: Add workspace plugin sharing APIs (#20278 ) 1. Adds v2 plugin/share/save, plugin/share/list, and plugin/share/delete RPCs. 2. Implements save by archiving a local plugin root, enforcing a size limit, uploading through the workspace upload flow, and supporting updates via remotePluginId. 3. Lists created workspace plugins 4. Deletes a previously uploaded/shared plugin.	2026-04-29 23:49:20 -07:00
Michael Bolin	ae863e72a2	ci: increase Windows release workflow timeouts (#20343 ) ## Why #20271 increased the `90`-minute timeout in `rust-release.yml`, but it did not update the reusable Windows workflow in `rust-release-windows.yml`. As a result, the Windows release compile jobs were still capped at `60` minutes and the `windows-x64` primary build could continue timing out. We are keeping the existing `90`-minute timeout in `rust-release.yml`. That increase was still directionally correct because the top-level release build benefits from extra headroom; the mistake was assuming it also covered the reusable Windows jobs. ## What Changed - increase the reusable Windows release workflow timeouts in `rust-release-windows.yml` from `60` minutes to `90` minutes - update the comment in `rust-release.yml` so it no longer implies that the top-level timeout covers the Windows reusable jobs	2026-04-29 23:27:04 -07:00
Abhinav	8f3c06cc97	Add persisted hook enablement state (#19840 ) ## Why After `hooks/list` exposes the hook inventory, clients need a way to persist user hook preferences, make those changes effective in already-open sessions, and distinguish user-controllable hooks from managed requirements without adding another bespoke app-server write API. ## What - Extends `hooks/list` entries with effective `enabled` state. - Persists user-level hook state under `hooks.state.<hook-id>` so the model can grow beyond a single boolean over time. - Uses the existing `config/batchWrite` path for hook state updates instead of introducing a dedicated hook write RPC. - Refreshes live session hook engines after config writes so already-open threads observe updated enablement without a restart. ## Stack 1. openai/codex#19705 2. openai/codex#19778 3. This PR - openai/codex#19840 4. openai/codex#19882 ## Reviewer Notes The generated schema files account for much of the raw diff. The core behavior is in: - `hooks/src/config_rules.rs`, which resolves per-hook user state from the config layer stack. - `hooks/src/engine/discovery.rs`, which projects effective enablement into `hooks/list` from source-derived managedness. - `config/src/hook_config.rs`, which defines the new `hooks.state` representation. - `core/src/session/mod.rs`, which rebuilds live hook state after user config reloads. --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-30 04:46:32 +00:00
Michael Bolin	ac4332c05b	permissions: expose active profile metadata (#20095 )	2026-04-29 20:54:59 -07:00
Matthew Zeng	ebe602d005	[plugins] Allow MSFT curated plugins in tool_suggest (#20304 ) ## Summary - [x] Move the allowlist out of core crate - [x] Add Teams, SharePoint, Outlook Email, and Outlook Calendar to the tool_suggest discoverable plugin allowlist - [x] Add focused coverage for Microsoft curated plugin discovery ## Testing - just fmt - cargo test -p codex-core-plugins - cargo test -p codex-core list_tool_suggest_discoverable_plugins_returns_	2026-04-29 19:45:52 -07:00
pakrym-oai	4e677d62da	app-server: remove dead api version handling from bespoke events (#20291 ) Remove ApiVersion::V1	2026-04-30 01:55:44 +00:00
rhan-oai	bb536d65bd	[codex-analytics] prevent stale guardian events from satisfying reused reviews (#20080 ) ## Why Reused Guardian review trunks can still have older child-turn events queued when a later review starts. The review waiter currently accepts the first terminal event it sees from the shared child session, so a stale `TurnComplete` can be attributed to the new review. That produces impossible analytics combinations such as non-null TTFT with sub-10 ms completion latency and zero token deltas on `trunk_reused` reviews. ## What changed - Preserve the child turn id returned by the Guardian review `Op::UserTurn` submission. - Restrict Guardian review waiting to events correlated with that submitted child turn. - Restrict timeout/abort draining to terminal events for the same child turn. - Add regression coverage for stale prior-turn completions, stale prior-turn errors, and interrupt draining in `codex-rs/core/src/guardian/review_session.rs`. ## Verification - `cargo test -p codex-core guardian::review_session::tests::` - `cargo clippy -p codex-core --tests -- -D warnings`	2026-04-29 18:26:39 -07:00
Alex Zamoshchin	8b07132e09	update codex_plugins_beta_setting (from workspace settings) (#20250 ) update the name after rename internally see https://github.com/openai/openai/pull/871006	2026-04-30 00:40:25 +00:00
Eric Traut	515aa9a4fb	tui: return from side chat on Ctrl-D (#20282 ) ## Why Fixes #20264. Side conversations are an ephemeral layer on top of the main chat. Pressing `Ctrl+D` from an empty side-chat composer should unwind back to the parent thread, matching the existing side-return behavior, instead of falling through to the global quit shortcut and exiting Codex. ## What changed The side-return shortcut matcher now treats `Ctrl+D` the same way it already treats `Esc` and `Ctrl+C`. Because app-level side-return handling runs before the chat widget's global quit handling, this returns from `/side` while preserving normal `Ctrl+D` quit behavior outside side conversations. The existing shortcut coverage was updated to include lowercase and uppercase `Ctrl+D` key events. ## Verification - `cargo test -p codex-tui side_return_shortcuts_match_esc_ctrl_c_and_ctrl_d` - `cargo test -p codex-tui` starts successfully and the new shortcut test passes, but the broader suite later aborts in the unrelated existing test `app::tests::attach_live_thread_for_selection_rejects_unmaterialized_fallback_threads` with a stack overflow.	2026-04-29 17:26:11 -07:00
pakrym-oai	fedcefe9da	Reduce the surface of collaboration modes (#20149 ) Collaboration modes were slightly invasive both into ThreadManager construction and ModelProvider	2026-04-29 17:22:41 -07:00
stefanstokic-oai	c8abcbf925	Import external agent sessions in background (#20284 ) Summary: - Return from external agent import before session history import finishes - Run session import work in the background and emit the existing completion notification when it is done - Serialize session imports so duplicate requests do not create duplicate imported threads Verification: - cargo test -p codex-app-server external_agent_config_ - cargo test -p codex-external-agent-sessions - just fix -p codex-app-server - just fix -p codex-external-agent-sessions - git diff --check	2026-04-30 00:00:41 +00:00
alexsong-oai	7bcd4626c4	Consume ai-title from external sessions and add end marker (#20261 ) ## Summary - Support Claude Code `ai-title` / `aiTitle` records when detecting and importing external agent sessions. - Preserve existing `custom-title` / `customTitle` precedence; only fall back to `aiTitle` when no custom title is present. - Add coverage for both detection and import title selection, including the custom-title-over-ai-title case. ## Testing - `cargo test -p codex-external-agent-sessions` - `just fix -p codex-external-agent-sessions`	2026-04-30 00:00:13 +00:00
Abhinav	8774229a89	Add hooks/list app-server RPC (#19778 ) ## Why We need a way to list the available hooks to expose via the TUI and App so users can view and manage their hooks ## What - Adds `hooks/list` for one or more `cwd` values that returns discovered hook metadata ## Stack 1. openai/codex#19705 2. This PR - openai/codex#19778 3. openai/codex#19840 4. openai/codex#19882 ## Review Notes The generated schema files account for most of the raw diff, these files have the core change: - `hooks/src/engine/discovery.rs` builds the inventory entries during hook discovery while leaving runtime handlers focused on execution. - `app-server/src/codex_message_processor.rs` wires `hooks/list` into the app-server flow for each requested `cwd`. - `app-server-protocol/src/protocol/v2.rs` defines the new v2 request/response payloads exposed on the wire. ### Core Changes `core/src/plugins/manager.rs` adds `plugins_for_layer_stack(...)` so `skills/list` and `hooks/list`can resolve plugin state for each requested `cwd` --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-29 23:39:57 +00:00
Michael Bolin	6eab7519b4	chore: increase release build timeout from 60 min to 90 (#20271 ) Build times are creeping up, so increase the timeout as a precaution.	2026-04-29 16:19:59 -07:00
rafael-jac	98f67b15d3	Update Codex login success page UX (#20136 ) ## Summary update the local login success page to match the Codex desktop auth UX use theme-aware colors and an inline 20px Codex mark keep the actual localhost success page aligned with the browser auth UX PR ## Tests <img width="1728" height="1117" alt="Screenshot 2026-04-29 at 12 00 34 PM" src="https://github.com/user-attachments/assets/76a40c3f-07c3-452c-97da-e7c43717cd2c" />	2026-04-29 19:14:53 -04:00
evawong-oai	74f06dcdfb	Enforce workspace metadata protections in Linux sandbox (#19852 ) ## Summary Enforce FileSystemSandboxPolicy protected metadata names in the Linux bubblewrap adapter so `.git`, `.agents`, and `.codex` remain read only inside writable workspace roots unless the policy grants an explicit write carveout. ## Scope 1. Translate protected metadata names from FileSystemSandboxPolicy into bubblewrap masks for existing metadata paths. 2. Represent missing protected metadata paths as guarded mount targets so agents cannot create `.git`, `.agents`, or `.codex` under writable roots. 3. Preserve normal git discovery for existing repos, worktrees, and parent repos. 4. Keep explicit user write grants working when policy allows a protected metadata path directly. ## Not in scope 1. No shell preflight UX. 2. No TUI runtime profile propagation. 3. No macOS Seatbelt changes in this PR. ## Reviewer focus 1. This should be reviewed as the Linux enforcement adapter for the policy primitive from PR 19846. 2. macOS enforcement already landed in PR 19847. 3. The important invariant is that `FileSystemSandboxPolicy` is the source of truth for `.git`, `.agents`, and `.codex`. ## Validation 1. `git diff` whitespace check passed. 2. `cargo fmt` check passed with the existing stable rustfmt warning about `imports_granularity`. 3. Full Linux sandbox Cargo test suite passed on the devbox. 4. Devbox forty six case suite passed at head `012accb703c13bd28df5b40079a9bf183036336a`. 5. Devbox summary: pass 46, fail 0. 6. The devbox suite was run through `just c sandbox linux`. 7. Focused repo test for Viyat parent repo case passed on the devbox.	2026-04-29 16:14:14 -07:00
iceweasel-oai	13dbcda28f	stop blocking unified_exec on Windows (#19435 ) ## Summary - remove the Windows-specific unified-exec environment block from tool selection - keep `unified_exec` default-off on Windows unless the feature is explicitly enabled - normalize model-provided `shell_type = unified_exec` to `shell_command` when the feature is disabled - drop obsolete tests tied to the removed environment gate and keep the feature-flag regression coverage ## Why Now that the session/long-lived process backend is implemented for the Windows sandbox, we don't need to hard disable it anymore. We will be rolling out slowly using a feature gate. ## Impact This allows manual Windows opt-in in CLI and app-backed flows while preserving the existing default-off behavior for Windows users. --------- Co-authored-by: canvrno-oai <kbond@openai.com> Co-authored-by: Codex <noreply@openai.com>	2026-04-29 16:06:33 -07:00
pakrym-oai	8de2a7a16d	Add codex-core public API listing (#20243 ) Summary: - Add a checked-in codex-core public API listing generated by cargo-public-api. - Add scripts/regen-public-api.sh with an embedded crate list, auto-install for cargo-public-api 0.51.0, pinned nightly, and --check mode. - Add Rust CI jobs on the codex Linux x64 runner pool to verify the listing stays up to date. Testing: - bash -n scripts/regen-public-api.sh - just regen-public-api --check - yq '.' .github/workflows/rust-ci.yml .github/workflows/rust-ci-full.yml - git diff --check	2026-04-29 22:58:08 +00:00
Rasmus Rygaard	782191547c	Add agent graph store interface (#19229 ) ## Summary Persisted subagent parent/child topology currently leaks through `StateRuntime`'s SQLite-specific thread-spawn helpers. This PR introduces a narrow `AgentGraphStore` boundary so follow-up work can route graph operations through a local or remote store without coupling orchestration code directly to the state DB graph API. ## Changes - Adds the new `codex-agent-graph-store` crate. - Defines a flat `AgentGraphStore` trait for the v1 graph surface: upsert edge, set edge status, list direct children, and list descendants. - Adds public graph types for `ThreadSpawnEdgeStatus`, `AgentGraphStoreError`, and `AgentGraphStoreResult`. - Implements `LocalAgentGraphStore` on top of an existing `codex_state::StateRuntime`, preserving today's SQLite-backed `thread_spawn_edges` behavior. - Registers the crate in Cargo/Bazel metadata. This PR only adds the local contract and implementation; call-site migration and the remote gRPC store are left to the follow-up PRs in the stack. ## Testing - `cargo test -p codex-agent-graph-store` The new unit tests cover local parity with the existing `StateRuntime` graph methods, `Open`/`Closed` filtering, status updates, and stable breadth-first descendant ordering.	2026-04-29 22:48:26 +00:00

1 2 3 4 5 ...

6056 Commits