codex

mirror of https://github.com/openai/codex.git synced 2026-05-01 18:06:47 +00:00

Author	SHA1	Message	Date
Eric Traut	59a1d53909	Merge branch 'goal-mode-3-tools' into goal-mode-4-core-runtime	2026-04-15 22:00:04 -07:00
Eric Traut	e5abd13232	codex: address PR review feedback (#18074 )	2026-04-15 21:57:37 -07:00
Eric Traut	aa6847c32b	Add goal mode core runtime	2026-04-15 21:39:01 -07:00
Eric Traut	270c426176	Add goal mode app-server API	2026-04-15 21:28:16 -07:00
pakrym-oai	bd61737e8a	Async config loading (#18022 ) Parts of config will come from executor. Prepare for that by making config loading methods async.	2026-04-15 19:18:38 -07:00
Ruslan Nigmatullin	f948690fc8	[codex] Make command exec delta tests chunk tolerant (#17999 ) ## Summary - Make command/exec output-delta tests accumulate streamed chunks instead of assuming complete logical output in a single notification. - Collect stdout and stderr independently so stream interleaving does not fail the pipe streaming test. ## Why The command/exec protocol exposes output as deltas, so tests should not rely on chunk boundaries being stable. A line like `out-start\n` may arrive split across multiple notifications, and stdout/stderr notifications may interleave. ## Validation - `just fmt` - `git diff --check` - `cargo test -p codex-app-server suite::v2::command_exec`	2026-04-15 17:57:02 -07:00
bxie-openai	c2bdb7812c	Clarify realtime v2 context and handoff messages (#17896 ) ## Summary - wrap realtime startup context in `<startup_context>...</startup_context>` tags - prefix V2 mirrored user text and relayed backend text with `[USER]` / `[BACKEND]` - remove the V2 progress suffix and replace the final V2 handoff output with a short completion acknowledgement while preserving the existing V1 wrapper ## Testing - cargo test -p codex-api realtime_v2_session_update_includes_background_agent_tool_and_handoff_output_item -- --exact - cargo test -p codex-app-server webrtc_v2_background_agent_ - cargo test -p codex-app-server webrtc_v2_text_input_is_ - cargo test -p codex-core conversation_user_text_turn_is_	2026-04-15 16:26:20 -07:00
evawong-oai	17d94bd1e3	[docs] Revert extra changes from PR 17848 (#18003 ) ## Summary 1. Revert https://github.com/openai/codex/pull/17848 so the Bazel and `BUILD` file changes leave `main`. 2. Prepare for a narrower follow up that restores only `SECURITY.md`. ## Validation 1. Reviewed the revert diff against `main`. 2. Ran a clean diff check before push.	2026-04-15 14:43:30 -07:00
Eugene Brevdo	bc969b6516	Dismiss stale app-server requests after remote resolution (#15134 ) Dismiss stale TUI app-server approvals after remote resolution When an approval, user-input prompt, or elicitation request is resolved by another client, the TUI now dismisses the matching local UI instead of leaving stale prompts behind and emitting a misleading local cancellation. This change teaches pending app-server request tracking to map `serverRequest/resolved` notifications back to the concrete request type and stable request key, then propagates that resolved request into TUI prompt state. Approval, request-user-input, and MCP elicitation overlays now drop the resolved current or queued request quietly, advance to the next queued request when present, and avoid emitting abort/cancel events for stale UI. The latest update also retires matching prompts while they are still deferred behind active streaming and suppresses buffered active-thread requests whose app-server request id has already been resolved before drain. `ChatWidget` removes a resolved request from both the deferred interrupt queue and the materialized bottom-pane stack, while active-thread request handling verifies the app-server request is still pending before showing a prompt. Lifecycle events such as exec begin/end remain queued so approved work can still render normally. Tests cover resolved-request mapping, overlay dismissal behavior, deferred prompt pruning for same-turn user input, exec approval IDs, lifecycle-event retention, and the buffered active-thread ordering regression. Validation: - `just fmt` - `git diff --check` - `cargo test -p codex-tui resolved_buffered_approval_does_not_become_actionable_after_drain` - `cargo test -p codex-tui enqueue_primary_thread_session_replays_buffered_approval_after_attach` - `cargo test -p codex-tui chatwidget::interrupts` - `just fix -p codex-tui` --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-15 13:57:41 -07:00
Tom	50d3128269	Migrate archive/unarchive to local ThreadStore (#17892 ) # Summary - implement local ThreadStore archive/unarchive operations - implement local ThreadStore read_thread operation - break up the various ThreadStore local method implementations into separate files - migrate app-server archive/unarchive and core archive fixture to use ThreadStore (but not all read operations yet!) - use the ThreadStore's read operation as a proxy check for thread persistence/existence in the app server code - move all other filesystem operations related to archive (path validation etc) into the local thread store. # Tests - add dedicated local store archive/unarchive tests	2026-04-15 20:48:09 +00:00
evawong-oai	0bb438bca6	[docs] Add security boundaries reference in SECURITY.md (#17848 ) ## Summary 1. Add a Security Boundaries section to `SECURITY.md`. 2. Point readers to the Codex Agent approvals and security documentation for sandboxing, approvals, and network controls. ## Validation 1. Reviewed the `SECURITY.md` diff in a clean worktree. 2. No tests run. Docs only change.	2026-04-15 20:12:46 +00:00
jif-oai	7e7b35b4d2	fix: propagate log db (#17953 ) It restores the TRACE logs in the DB and `/feedback` Fix https://github.com/openai/codex/pull/16184 Result: https://openai.sentry.io/issues/6972946529/?project=4510195390611458&query=019d91e9-f931-7451-8852-c5240514a419&referrer=issue-stream	2026-04-15 20:25:53 +01:00
Ruslan Nigmatullin	83abf67d20	app-server: track remote-control seq IDs per stream (#17902 ) ## Summary - Track outbound remote-control sequence IDs independently for each client stream. - Retain unacked outbound messages per stream using FIFO buffers. - Require stream-scoped acks and update tests for contiguous per-stream sequencing. ## Why The remote-control peer uses outbound sequence gaps to detect lost messages and re-initialize. A single global outbound sequence counter can create apparent gaps on an individual stream when another stream receives an interleaved message. ## Validation - `just fmt` - `cargo test -p codex-app-server remote_control` - `just fix -p codex-app-server` - `git diff --check`	2026-04-15 11:52:53 -07:00
Tom	cdfcd2ca92	[codex] Add local thread store listing (#17824 ) Builds on top of #17659 Move the filesystem + sqlite thread listing-related operations inside of a local ThreadStore implementation and call ThreadStore from the places that used to perform these filesystem/sqlite operations. This is the first of a series of PRs that will implement the rest of the local ThreadStore. Testing: - added unit tests for the thread store implementation - adjusted some unit tests in the realtime + personality packages whose callsites changed. Specifically I'm trying to hide ThreadMetadata inside of the local implementation and make ThreadMetadata a sqlite implementation detail concern rather than a public interface, preferring the more generate StoredThread interface instead - added a corner case test for the personality migration package that wasn't covered by the existing test suite - adjust the behavior of searched thread listing to run the existing local rollout repair/backfill pass _before_ querying SQLite results, so callers using ThreadStore::list_threads do not miss matches after a partial metadata warm-up	2026-04-15 11:34:27 -07:00
Adrian	8e784bba2f	Register agent identities behind use_agent_identity (#17386 ) ## Summary Stack PR 2 of 4 for feature-gated agent identity support. This PR adds agent identity registration behind `features.use_agent_identity`. It keeps the app-server protocol unchanged and starts registration after ChatGPT auth exists rather than requiring a client restart. ## Stack - PR1: https://github.com/openai/codex/pull/17385 - add `features.use_agent_identity` - PR2: https://github.com/openai/codex/pull/17386 - this PR - PR3: https://github.com/openai/codex/pull/17387 - register agent tasks when enabled - PR4: https://github.com/openai/codex/pull/17388 - use `AgentAssertion` downstream when enabled ## Validation Covered as part of the local stack validation pass: - `just fmt` - `cargo test -p codex-core --lib agent_identity` - `cargo test -p codex-core --lib agent_assertion` - `cargo test -p codex-core --lib websocket_agent_task` - `cargo test -p codex-api api_bridge` - `cargo build -p codex-cli --bin codex` ## Notes The full local app-server E2E path is still being debugged after PR creation. The current branch stack is directionally ready for review while that follow-up continues.	2026-04-15 10:08:27 -07:00
jif-oai	ea13527961	nit: doc (#17941 )	2026-04-15 14:51:20 +01:00
sayan-oai	0df7e9a820	register all mcp tools with namespace (#17404 ) stacked on #17402. MCP tools returned by `tool_search` (deferred tools) get registered in our `ToolRegistry` with a different format than directly available tools. this leads to two different ways of accessing MCP tools from our tool catalog, only one of which works for each. fix this by registering all MCP tools with the namespace format, since this info is already available. also, direct MCP tools are registered to responsesapi without a namespace, while deferred MCP tools have a namespace. this means we can receive MCP `FunctionCall`s in both formats from namespaces. fix this by always registering MCP tools with namespace, regardless of deferral status. make code mode track `ToolName` provenance of tools so it can map the literal JS function name string to the correct `ToolName` for invocation, rather than supporting both in core. this lets us unify to a single canonical `ToolName` representation for each MCP tool and force everywhere to use that one, without supporting fallbacks.	2026-04-15 21:02:59 +08:00
jif-oai	5e544be3c9	chore: do not disable memories for past rollouts on reset (#17919 )	2026-04-15 12:05:39 +01:00
jif-oai	7579d5ad75	feat: add endpoint to delete memories (#17913 )	2026-04-15 10:35:06 +01:00
viyatb-oai	e4a3612f11	fix: add websocket capability token hash support (#17871 ) ## Summary - Allow app-server websocket capability auth to accept a precomputed SHA-256 digest via `--ws-token-sha256`. - Keep token-file support and enforce exactly one capability token source. - Document the new auth flag. ## Testing - `just fmt` - `cargo test -p codex-app-server transport::auth::tests` - `cargo test -p codex-app-server websocket_capability_token_sha256_args_parse` - `cargo test -p codex-cli app_server_capability_token_flags_parse` - `cargo clippy -p codex-app-server --all-targets -- -D warnings` - `just fix -p codex-cli` --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-14 22:06:39 -07:00
alexsong-oai	ca650561d6	support plugins in external agent config migration (#17855 )	2026-04-14 19:39:10 -07:00
xli-oai	3cc689fb23	[codex] Support local marketplace sources (#17756 ) ## Summary - Port marketplace source support into the shared core marketplace-add flow - Support local marketplace directory sources - Support direct `marketplace.json` URL sources - Persist the new source types in config/schema and cover them in CLI and app-server tests ## Validation - `cargo test -p codex-core marketplace_add` - `cargo test -p codex-cli marketplace_add` - `cargo test -p codex-app-server marketplace_add` - `just write-config-schema` - `just fmt` - `just fix -p codex-core` - `just fix -p codex-cli` ## Context Current `main` moved marketplace-add behavior into shared core code and still assumed only git-backed sources. This change keeps that structure but restores support for local directories and direct manifest URLs in the shared path.	2026-04-14 15:58:14 -07:00
pakrym-oai	96254a763a	Make skill loading filesystem-aware (#17720 ) Migrates skill loading to support reading repo skills from the remote environment.	2026-04-14 15:40:40 -07:00
jif-oai	42166ba260	fix: apply patch bin refresh (#17808 ) Make sure the link to apply patch binary (i.e. codex) does not die in case of an update Fix this: https://openai.slack.com/archives/C08MGJXUCUQ/p1776183247771849 --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-14 22:27:47 +01:00
pakrym-oai	dd1321d11b	Spread AbsolutePathBuf (#17792 ) Mechanical change to promote absolute paths through code.	2026-04-14 14:26:10 -07:00
rhan-oai	d6b13276c7	[codex-analytics] enable general analytics by default (#17389 ) ## Summary - Make GeneralAnalytics stable and enabled by default. - Update feature tests and app-server lifecycle fixtures for explicit general_analytics=false. - Keep app-server integration tests isolated from host managed config so explicit feature fixtures are deterministic. ## Validation - cargo test -p codex-features - cargo test -p codex-app-server general_analytics (matched 0 tests) - cargo test -p codex-app-server thread_start_ - cargo test -p codex-app-server thread_fork_ - cargo test -p codex-app-server thread_resume_ - cargo test -p codex-app-server config_read_includes_system_layer_and_overrides	2026-04-14 13:20:46 -07:00
Eric Traut	1fd9c33207	[codex] Fix app-server initialized request analytics build (#17830 ) Problem: PR #17372 moved initialized request handling into `dispatch_initialized_client_request`, leaving analytics code that uses `connection_id` without a local binding and breaking `codex-app-server` builds. Solution: Restore the `connection_id` binding from `connection_request_id` before initialized request validation and analytics tracking.	2026-04-14 13:11:04 -07:00
Ruslan Nigmatullin	23d4098c0f	app-server: prepare to run initialized rpcs concurrently (#17372 ) ## Summary - Refactors `MessageProcessor` and per-connection session state so initialized service RPC handling can be moved into spawned tasks in a follow-up PR. - Shares the processor and initialized session data with `Arc`/`OnceLock` instead of mutable borrowed connection state. - Keeps initialized request handling synchronous in this PR; it does not call `tokio::spawn` for service RPCs yet. ## Testing - `just fmt` - `cargo test -p codex-app-server` (fails on existing hardening gaps covered by #17375, #17376, and #17377; the pipelined config regression passed before the unrelated failures) - `just fix -p codex-app-server`	2026-04-14 11:24:34 -07:00
viyatb-oai	81c0bcc921	fix: Revert danger-full-access denylist-only mode (#17732 ) ## Summary - Reverts openai/codex#16946 and removes the danger-full-access denylist-only network mode. - Removes the corresponding config requirements, app-server protocol/schema, config API, TUI debug output, and network proxy behavior. - Drops stale tests that depended on the reverted mode while preserving newer managed allowlist-only coverage. ## Verification - `just write-app-server-schema` - `just fmt` - `cargo test -p codex-config network_requirements` - `cargo test -p codex-core network_proxy_spec` - `cargo test -p codex-core managed_network_proxy_decider_survives_full_access_start` - `cargo test -p codex-app-server map_requirements_toml_to_api` - `cargo test -p codex-tui debug_config_output` - `cargo test -p codex-app-server-protocol` - `just fix -p codex-config -p codex-core -p codex-app-server-protocol -p codex-app-server -p codex-tui` - `git diff --cached --check` Not run: full workspace `cargo test` (repo instructions ask for confirmation before that broader run).	2026-04-14 09:50:14 -07:00
David de Regt	4f2fc3e3fa	Moving updated-at timestamps to unique millisecond times (#17489 ) To allow the ability to have guaranteed-unique cursors, we make two important updates: * Add new updated_at_ms and created_at_ms columns that are in millisecond precision * Guarantee uniqueness -- if multiple items are inserted at the same millisecond, bump the new one by one millisecond until it becomes unique This lets us use single-number cursors for forwards and backwards paging through resultsets and guarantee that the cursor is a fixed point to do (timestamp > cursor) and get new items only. This updated implementation is backwards-compatible since multiple appservers can be running and won't handle the previous method well.	2026-04-14 11:55:34 -04:00
Ahmed Ibrahim	2f6fc7c137	Add realtime output modality and transcript events (#17701 ) - Add outputModality to thread/realtime/start and wire text/audio output selection through app-server, core, API, and TUI.\n- Rename the realtime transcript delta notification and add a separate transcript done notification that forwards final text from item done without correlating it with deltas.	2026-04-14 00:13:13 -07:00
rhan-oai	b704df85b8	[codex-analytics] feature plumbing and emittance (#16640 ) --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/openai/codex/pull/16640). * #16870 * #16706 * #16641 * __->__ #16640	2026-04-13 23:11:49 -07:00
pakrym-oai	3b24a9a532	Refactor plugin loading to async (#17747 ) Simplifies skills migration.	2026-04-13 21:52:56 -07:00
xli-oai	ff584c5a4b	[codex] Refactor marketplace add into shared core flow (#17717 ) ## Summary Move `codex marketplace add` onto a shared core implementation so the CLI and app-server path can use one source of truth. This change: - adds shared marketplace-add orchestration in `codex-core` - switches the CLI command to call that shared implementation - removes duplicated CLI-only marketplace add helpers - preserves focused parser and add-path coverage while moving the shared behavior into core tests ## Why The new `marketplace/add` RPC should reuse the same underlying marketplace-add flow as the CLI. This refactor lands that consolidation first so the follow-up app-server PR can be mostly protocol and handler wiring. ## Validation - `cargo test -p codex-core marketplace_add` - `cargo test -p codex-cli marketplace_cmd` - `just fix -p codex-core` - `just fix -p codex-cli` - `just fmt`	2026-04-13 20:37:11 -07:00
pakrym-oai	f3cbe3d385	[codex] Add symlink flag to fs metadata (#17719 ) Add `is_symlink` to FsMetadata struct.	2026-04-13 17:46:56 -07:00
pakrym-oai	d4be06adea	Add turn item injection API (#17703 ) ## Summary - Add `turn/inject_items` app-server v2 request support for appending raw Responses API items to a loaded thread history without starting a turn. - Generate JSON schema and TypeScript protocol artifacts for the new params and empty response. - Document the new endpoint and include a request/response example. - Preserve compatibility with the typo alias `turn/injet_items` while returning the canonical method name. ## Testing - Not run (not requested)	2026-04-13 16:11:05 -07:00
Ahmed Ibrahim	0e31dc0d4a	change realtime tool description (#17699 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2026-04-13 14:31:31 -07:00
Ruslan Nigmatullin	a5507b59c4	app-server: Only unload threads which were unused for some time (#17398 ) Currently app-server may unload actively running threads once the last connection disconnects, which is not expected. Instead track when was the last active turn & when there were any subscribers the last time, also add 30 minute idleness/no subscribers timer to reduce the churn.	2026-04-13 12:25:26 -07:00
jif-oai	46a266cd6a	feat: disable memory endpoint (#17626 )	2026-04-13 18:29:49 +01:00
pakrym-oai	ac82443d07	Use AbsolutePathBuf in skill loading and codex_home (#17407 ) Helps with FS migration later	2026-04-13 10:26:51 -07:00
Eric Traut	d25a9822a7	Do not fail thread start when trust persistence fails (#17595 ) Addresses #17593 Problem: A regression introduced in https://github.com/openai/codex/pull/16492 made thread/start fail when Codex could not persist trusted project state, which crashes startup for users with read-only config.toml. Solution: Treat trusted project persistence as best effort and keep the current thread's config trusted in memory when writing config.toml fails.	2026-04-13 10:03:21 -07:00
Eric Traut	7c797c6544	Suppress duplicate compaction and terminal wait events (#17601 ) Addresses #17514 Problem: PR #16966 made the TUI render the deprecated context-compaction notification, while v2 could also receive legacy unified-exec interaction items alongside terminal-interaction notifications, causing duplicate "Context compacted" and "Waited for background terminal" messages. Solution: Suppress deprecated context-compaction notifications and legacy unified-exec interaction command items from the app-server v2 projection, and render canonical context-compaction items through the existing TUI info-event path.	2026-04-13 08:59:19 -07:00
starr-openai	d626dc3895	Run exec-server fs operations through sandbox helper (#17294 ) ## Summary - run exec-server filesystem RPCs requiring sandboxing through a `codex-fs` arg0 helper over stdin/stdout - keep direct local filesystem execution for `DangerFullAccess` and external sandbox policies - remove the standalone exec-server binary path in favor of top-level arg0 dispatch/runtime paths - add sandbox escape regression coverage for local and remote filesystem paths ## Validation - `just fmt` - `git diff --check` - remote devbox: `cd codex-rs && bazel test --bes_backend= --bes_results_url= //codex-rs/exec-server:all` (6/6 passed) --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-12 18:36:03 -07:00
pakrym-oai	7c1e41c8b6	Add MCP tool wall time to model output (#17406 ) Include MCP wall time in the output so the model is aware of how long it's calls are taking.	2026-04-12 18:26:15 -07:00
Eric Traut	46ab9974dc	Expose instruction sources (AGENTS.md) via app server (#17506 ) Addresses #17498 Problem: The TUI derived /status instruction source paths from the local client environment, which could show stale <none> output or incorrect paths when connected to a remote app server. Solution: Add an app-server v2 instructionSources snapshot to thread start/resume/fork responses, default it to an empty list when older servers omit it, and render TUI /status from that server-provided session data. Additional context: The app-server field is intentionally named instructionSources rather than AGENTS.md-specific terminology because the loaded instruction sources can include global instructions, project AGENTS.md files, AGENTS.override.md, user-defined instruction files, and future dynamic sources.	2026-04-12 15:50:12 -07:00
Ahmed Ibrahim	d840b247d7	Mirror user text into realtime (#17520 ) - Let typed user messages submit while realtime is active and mirror accepted text into the realtime text stream. - Add integration coverage and snapshot for outbound realtime text.	2026-04-12 15:03:14 -07:00
Adrian	39cc85310f	Add use_agent_identity feature flag (#17385 )	2026-04-11 09:52:06 -07:00
ningyi-oai	be13f03c39	Pass turn id with feedback uploads (#17314 ) ## Summary - Add an optional `tags` dictionary to feedback upload params. - Capture the active app-server turn id in the TUI and submit it as `tags.turn_id` with `/feedback` uploads. - Merge client-provided feedback tags into Sentry feedback tags while preserving reserved system fields like `thread_id`, `classification`, `cli_version`, `session_source`, and `reason`. ## Behavior / impact Existing feedback upload callers remain compatible because `tags` is optional and nullable. The wire shape is still a normal JSON object / TypeScript dictionary, so adding future feedback metadata will not require a new top-level protocol field each time. This change only adds feedback metadata for Codex CLI/TUI uploads; it does not affect existing pipelines, DAGs, exports, or downstream consumers unless they choose to read the new `turn_id` feedback tag. ## Tests - `cargo fmt -- --config imports_granularity=Item` passed; stable rustfmt warned that `imports_granularity` is nightly-only. - `cargo run -p codex-app-server-protocol --bin write_schema_fixtures` - `cargo test -p codex-feedback upload_tags_include_client_tags_and_preserve_reserved_fields` - `cargo test -p codex-app-server-protocol schema_fixtures_match_generated` - `cargo test -p codex-tui build_feedback_upload_params` - `cargo test -p codex-tui live_app_server_turn_started_sets_feedback_turn_id` - `cargo check -p codex-app-server --tests` - `git diff --check` --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-11 00:23:50 -07:00
Eric Traut	e9e7ef3d36	Fix thread/list cwd filtering for Windows verbatim paths (#17414 ) Addresses #17302 Problem: `thread/list` compared cwd filters with raw path equality, so `resume --last` could miss Windows sessions when the saved cwd used a verbatim path form and the current cwd did not. Solution: Normalize cwd comparisons through the existing path comparison utilities before falling back to direct equality, and add Windows regression coverage for verbatim paths. I made this a general utility function and replaced all of the duplicated instance of it across the code base.	2026-04-10 23:08:02 -07:00
Matthew Zeng	b7139a7e8f	[mcp] Support MCP Apps part 3 - Add mcp tool call support. (#17364 ) - [x] Add a new app-server method so that MCP Apps can call their own MCP server directly.	2026-04-11 04:39:19 +00:00

1 2 3 4 5 ...

782 Commits