codex

mirror of https://github.com/openai/codex.git synced 2026-05-24 04:54:52 +00:00

Author	SHA1	Message	Date
Michael Bolin	64eb789b69	Merge `f965330129` into sapling-pr-archive-bolinfest	2026-05-12 20:49:05 -07:00
Michael Bolin	574c05092e	merge commit for archive created by Sapling	2026-05-12 20:44:12 -07:00
Michael Bolin	f965330129	app-server: select permission profiles by id	2026-05-12 20:43:36 -07:00
Michael Bolin	bb7678219f	permissions: move workspace roots onto thread state	2026-05-12 20:43:36 -07:00
Michael Bolin	e724623361	Merge `24f01e177a` into sapling-pr-archive-bolinfest	2026-05-12 20:23:59 -07:00
Michael Bolin	24f01e177a	app-server: select permission profiles by id	2026-05-12 20:23:47 -07:00
Michael Bolin	7336736f44	permissions: move workspace roots onto thread state	2026-05-12 20:15:26 -07:00
Michael Bolin	71ad17cbba	Merge `03e72f10d8` into sapling-pr-archive-bolinfest	2026-05-12 20:03:20 -07:00
Michael Bolin	82e4981273	merge commit for archive created by Sapling	2026-05-12 20:02:52 -07:00
Michael Bolin	03e72f10d8	app-server: select permission profiles by id	2026-05-12 20:02:33 -07:00
Michael Bolin	e929d21081	permissions: move workspace roots onto thread state	2026-05-12 20:01:38 -07:00
Michael Bolin	b0134794d8	Merge `73d36f01e9` into sapling-pr-archive-bolinfest	2026-05-12 19:27:10 -07:00
Michael Bolin	38c1741025	merge commit for archive created by Sapling	2026-05-12 19:26:42 -07:00
Michael Bolin	73d36f01e9	app-server: select permission profiles by id	2026-05-12 19:26:18 -07:00
Michael Bolin	42689ffc1e	permissions: move workspace roots onto thread state	2026-05-12 19:24:18 -07:00
Andrei Eternal	fbfbfe5fc5	hooks: use new session IDs instead of thread IDs for hooks, apply parent's session ID to subagents' hooks (#22268 ) ## Why hook semantics treat `session_id` as shared across a root session and its subagents. Codex hooks were still emitting the current thread ID, which made spawned agents look like independent sessions and made it harder for hook integrations to correlate work across a root thread and its spawned helpers This change makes hooks use Codex's existing shared session identity so hook `session_id` matches the root-thread session across spawned subagents. ## What Changed - switch hook payloads to use the existing shared session identity from core instead of the current thread ID - cover all hook surfaces that expose `session_id`, including `SessionStart`, tool hooks, compact hooks, prompt-submit hooks, stop hooks, and legacy after-agent dispatch	2026-05-12 19:05:10 -07:00
Celia Chen	e2eb7c30fe	feat: route guardian review model selection through providers (#22258 ) ## Why Guardian review selection was hard-coded in `core`, which worked for the default OpenAI path but did not give provider implementations a way to choose backend-specific reviewer model IDs. That matters for Amazon Bedrock: guardian review should run through the Bedrock/Mantle provider using Bedrock's `openai.gpt-5.4` model ID, instead of accidentally selecting a reviewer model that implies the OpenAI backend. ## What Changed - Added provider-owned approval review model selection via `ModelProvider::approval_review_model_selection`. - Moved the existing default selection policy into the provider abstraction: prefer the requested reviewer model when it is available, otherwise fall back to the active turn model, preferring `Low` reasoning when supported. - Added an Amazon Bedrock override that pins guardian review to `openai.gpt-5.4` with `Low` reasoning.	2026-05-13 01:55:46 +00:00
Eric Traut	51bfb5f3b1	Restore app-server websocket listener with auth guard (#22404 ) ## Why PR #21843 removed the TCP websocket app-server listener, but that also removed functionality that still needs to exist. Restoring it as-is would reopen the old remote exposure problem, so this keeps the restored listener while making remote and non-loopback usage require explicit auth. ## What Changed - Mostly reverts #21843 and reapplies the small merge-conflict resolutions needed on top of current main. - Restores ws://IP:PORT parsing, the app-server TCP websocket acceptor, websocket auth CLI flags, and the associated tests. - The only intentional behavior change from the restored code is that non-loopback websocket listeners now fail startup unless --ws-auth capability-token or --ws-auth signed-bearer-token is configured. Loopback listeners remain available for local and SSH-forwarding workflows. ## Reviewer Focus Please focus review on the small auth-enforcement delta layered on top of the revert: - codex-rs/app-server-transport/src/transport/websocket.rs: start_websocket_acceptor now rejects unauthenticated non-loopback websocket binds before accepting connections. - codex-rs/app-server-transport/src/transport/auth.rs: helper logic classifies unauthenticated non-loopback listeners. - codex-rs/app-server/tests/suite/v2/connection_handling_websocket.rs: tests cover unauthenticated ws://0.0.0.0 startup rejection and authenticated non-loopback capability-token startup. Everything else is intended to be revert/merge-conflict restoration rather than new product behavior. ## Verification - Manually verified that TUI remoting is restored and that auth is enforced for non-localhost urls.	2026-05-12 18:40:53 -07:00
Michael Bolin	8a75001bfb	merge commit for archive created by Sapling	2026-05-12 18:28:43 -07:00
Michael Bolin	4d1ad0b7f9	config: add strict config parsing	2026-05-12 18:28:35 -07:00
Michael Bolin	2dcae67ab3	merge commit for archive created by Sapling	2026-05-12 18:06:50 -07:00
Michael Bolin	e5a8668c86	config: add strict config parsing	2026-05-12 18:06:41 -07:00
Michael Bolin	351f911c4c	Merge `68a66c514b` into sapling-pr-archive-bolinfest	2026-05-12 17:57:19 -07:00
Michael Bolin	68a66c514b	config: add strict config parsing	2026-05-12 17:57:13 -07:00
Michael Bolin	4f1c824d55	Merge `ad0b1199f0` into sapling-pr-archive-bolinfest	2026-05-12 17:56:53 -07:00
Michael Bolin	ad0b1199f0	config: add strict config parsing	2026-05-12 17:56:47 -07:00
xl-openai	d1430fd61e	feat: Expose plugin versions and gate plugin sharing (#22397 ) - Adds localVersion to plugin summaries and remoteVersion to share context, including generated API schemas. - Hydrates local and remote plugin versions from manifests and remote release metadata. - Adds default-on plugin_sharing gate for shared-with-me listing and plugin/share/save, with disabled-path errors and focused coverage.	2026-05-12 17:56:30 -07:00
efrazer-oai	01b4817bac	docs(skills): simplify plugin creator deeplink shape (#22240 ) ## Summary Plugin Creator now documents the shorter local-plugin handoff URL that the app can interpret directly. [#22221](https://github.com/openai/codex/pull/22221) teaches the skill to end marketplace-backed creation flows with named View and Share links; this follow-up updates those examples so the skill only emits the normalized plugin name, the absolute marketplace path, and optional share mode. The documented shape is: ```txt codex://plugins/<normalized-plugin-name>?marketplacePath=<absolute-marketplace-json-path> codex://plugins/<normalized-plugin-name>?marketplacePath=<absolute-marketplace-json-path>&mode=share ``` The skill text now states exactly where the normalized plugin name belongs, exactly where the absolute marketplace path belongs, and that it should not add `pluginName` or `hostId` query parameters. ## Testing Tests: plugin-creator skill validation.	2026-05-13 00:54:52 +00:00
Owen Lin	2237a13cf1	mark Feature::RemoteControl as removed (#22386 ) ## Why `remote_control` can appear in `config.toml`, CLI feature overrides, and the app-server config APIs. Before this PR, app-server startup treated `config.features.enabled(Feature::RemoteControl)` as the signal to start remote control ([base code](`5e3ee5eddf/codex-rs/app-server/src/lib.rs (L678-L680)`)). That meant a user with: ```toml [features] remote_control = true ``` would accidentally opt every app-server process into remote control. Remote-control startup should instead be a per-process launch decision made by CLI flags. ## What Changed - Marks `Feature::RemoteControl` as `Stage::Removed`, keeping `remote_control` as a known compatibility key while making it config-inert. - Adds a hidden `--remote-control` process flag to `codex app-server` and standalone `codex-app-server`. - Plumbs that flag through `AppServerRuntimeOptions.remote_control_enabled` and makes app-server startup use only that runtime option to decide whether to start remote control. - Removes the app-server config mutation hook that reloaded config and toggled remote control at runtime. - Updates managed daemon spawning to use `codex app-server --remote-control --listen unix://` instead of `--enable remote_control`. Config APIs can still list, read, write, and set `remote_control`; those operations just no longer affect remote-control process enrollment.	2026-05-13 00:52:45 +00:00
sayan-oai	1ae9867296	[codex] Remove tool search bucket limit override (#22381 ) ## Why `tool_search` still carries the server-specific result-cap path added in #17684 for `computer-use`: when the model omitted `limit`, a matching result expanded the search to 20 and then `limit_results_by_bucket` applied per-bucket caps. That makes default result handling depend on a one-off server exception instead of the single `TOOL_SEARCH_DEFAULT_LIMIT` path. This PR removes that custom branch so omitted `limit` values use the ordinary global default consistently. The implementation being retired is the pre-change bucketed search path in [`tool_search.rs`](`5e3ee5eddf/codex-rs/core/src/tools/handlers/tool_search.rs (L121-L190)`). ## What changed - Collapse `ToolSearchHandler::search` back to one BM25 search with the resolved limit. - Remove `limit_results_by_bucket`, the `computer-use` constants, and the omitted-limit plumbing that only existed for the override. - Drop dead `ToolSearchEntry::limit_bucket` metadata from deferred MCP and dynamic search entries. - Remove tests and helpers that only asserted the deleted override behavior. - Add direct handler-level unit coverage for omitted/default and explicit `tool_search` result limits. ## Validation - `cargo test -p codex-core tool_search` - The matching unit tests passed, including the new omitted/default and explicit result-limit coverage. - The broader `--test all` search-tool fixture phase then failed before sending mocked response requests in `tool_search_indexes_only_enabled_non_app_mcp_tools` and `tool_search_uses_non_app_mcp_server_instructions_as_namespace_description`. - `cargo test -p codex-core` - The touched tool-search coverage passed before the run later aborted in `tools::handlers::multi_agents::tests::tool_handlers_cascade_close_and_resume_and_keep_explicitly_closed_subtrees_closed` with a stack overflow.	2026-05-13 00:46:07 +00:00
Michael Bolin	35a82ef83e	merge commit for archive created by Sapling	2026-05-12 17:41:42 -07:00
Michael Bolin	5a0813d812	app-server: select permission profiles by id	2026-05-12 17:40:17 -07:00
Michael Bolin	8bafd4f6a5	permissions: move workspace roots onto thread state	2026-05-12 17:40:17 -07:00
Eric Traut	92930a8d40	Refactor chatwidget state into modules (#22269 ) ## Why `chatwidget.rs` is still carrying too many unrelated responsibilities in one file. After #21866 consolidated some of the state it tracks, this starts the next phase by moving coherent state/helper clusters out of the main module without changing behavior. This PR is intentionally mechanical: it only moves existing functions, structs, and helpers into focused modules so the boundaries are easier to review before the less mechanical refactors that should follow. ## What Changed - Moved user-message, composer, queue, pending steer, and merge/remap helpers into `codex-rs/tui/src/chatwidget/user_messages.rs`. - Added `codex-rs/tui/src/chatwidget/exec_state.rs` for unified exec bookkeeping helpers. - Added `codex-rs/tui/src/chatwidget/rate_limits.rs` for rate-limit warning, prompt, and error classification state. - Moved plugin list fetch and install auth-flow state into `codex-rs/tui/src/chatwidget/plugins.rs`. - Made a couple of test-only `VecDeque` imports explicit now that those tests no longer inherit the parent module import. ## Verification - `cargo test -p codex-tui` was run ## Follow-On Refactor Phases This PR is phase 1: mechanical helper and state moves. Planned follow-up PRs: - Phase 2: extract input and submission flow, including queued user messages, shell prompt submission, pending steer restoration, and thread input snapshot/restore behavior. - Phase 3: extract protocol, replay, streaming, and tool lifecycle handling, while preserving active-cell grouping, transcript invalidation, interrupt deferral, and final-message separator behavior. - Phase 4: extract settings, popups, and status surfaces, including model/reasoning/collaboration/personality popups, permission prompts, rate-limit UI, and connectors helpers. - Phase 5: clean up the remaining constructor and orchestration code once the larger behavior domains have moved out, leaving `chatwidget.rs` as the composition layer.	2026-05-12 17:33:33 -07:00
Michael Bolin	273135dc31	Merge `dc0f61af1f` into sapling-pr-archive-bolinfest	2026-05-12 17:30:38 -07:00
Tom	c51c65ad09	Unify thread metadata updates above store (#22236 ) - make ThreadStore::update_thread_metadata accept a broad range of metadata patches - keep ThreadStore::append_items as raw canonical history append (no metadata side effects) - in the local store, write these metadata updates to a combination of sqlite and rollout jsonl files for backwards-compat. It special cases which fields need to go into jsonl vs sqlite vs whatever, confining the awkwardness to just this implementation - in remote stores we can simply persist the metadata directly to a database, no special casing required. - move the "implicit metadata updates triggered by appending rollout items" from the RolloutRecorder (which is local-threadstore-specific) to the LiveThread layer above the ThreadStore, inside of a private helper utility called ThreadMetadataSync. LiveThread calls ThreadStore append_items and update_metadata separately. - Add a generic update metadata method to ThreadManager that works on both live threads and "cold" threads - Call that ThreadManager method from app server code, so app server doesn't need to worry about whether the thread is live or not	2026-05-13 00:28:15 +00:00
Michael Bolin	251d79117e	Merge `8153d12590` into sapling-pr-archive-bolinfest	2026-05-12 17:24:10 -07:00
pakrym-oai	f11ad1eacb	[codex] Add search term coverage for tool_search (#22398 ) ## Why `tool_search` already had solid end-to-end coverage for discovery and follow-up execution, but it did not prove that distinct pieces of indexed search text actually work in integration. In particular, we were not exercising whether unique tool names, descriptions, namespaces, underscore-expanded dynamic names, and schema-property terms were sufficient to surface the expected deferred tools. This change adds focused integration coverage for those term sources so regressions in search text construction are caught by a real `TestCodex` flow instead of only by lower-level unit tests. ## What changed - added a small helper in `core/tests/suite/search_tool.rs` to assert that a `tool_search_output` contains an expected namespace child tool - added an MCP integration test that issues several `tool_search_call`s and verifies distinct query terms match the expected app tools: - exact tool name: `calendar_timezone_option_99` - tool description phrase: `uploaded document` - top-level schema property: `starts_at` - added a dynamic-tool integration test that verifies distinct query terms match the expected deferred dynamic tool: - exact name: `quasar_ping_beacon` - underscore-expanded name: `quasar ping beacon` - description phrase: `saffron metronome` - namespace: `orbit_ops` - schema property: `chrono_spec` ## Validation - `cargo test -p codex-core tool_search_matches_` ## Docs No documentation update needed.	2026-05-13 00:24:07 +00:00
Michael Bolin	8153d12590	config: add strict config parsing	2026-05-12 17:23:56 -07:00
Michael Bolin	9e7cdbd0d2	core: box multi-agent handler futures (#22266 ) ## Why This is the base PR in the split stack for the permissions migration. It isolates stack-safety work that had been mixed into the larger permissions PR, so reviewers can evaluate the async-future changes separately from the permissions model changes in #22267. The main risk this addresses is large or recursive multi-agent futures overflowing smaller runner stacks. A follow-up review also called out that `shutdown_live_agent` must remain quiescent: callers should not remove a live agent from tracking or release its spawn slot until the worker loop has actually terminated. ## What Changed - Boxes the large async futures in the multi-agent spawn, resume, and close tool handlers. - Boxes the `AgentControl` spawn and recursive close/shutdown paths that can otherwise build very deep futures. - Keeps `shutdown_live_agent` waiting for thread termination before removing/releasing the live agent, preserving the previous shutdown ordering while still boxing the recursive close path. ## Verification Strategy The focused local coverage was `cargo test -p codex-core multi_agents`, which exercises the multi-agent spawn/resume/close handlers, cascade close/resume behavior, and the shutdown path touched by this PR. --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/openai/codex/pull/22266). * #22330 * #22329 * #22328 * #22327 * __->__ #22266	2026-05-12 17:22:25 -07:00
Michael Bolin	dc0f61af1f	app-server: select permission profiles by id	2026-05-12 17:18:17 -07:00
Michael Bolin	f68960302a	permissions: move workspace roots onto thread state	2026-05-12 17:18:17 -07:00
Channing Conger	589b820d6e	code-mode: Add pending-aware code mode execution (#22280 ) Introduce execute_to_pending and wait_to_pending APIs that freeze pending-mode runtimes until an explicit resume, while preserving the existing continuously-running execute path. Add runtime and service coverage for pending, resume, completion, and freeze behavior.	2026-05-12 17:16:57 -07:00
pakrym-oai	0173f71143	Refactor namespaced tool spec registration (#22256 ) ## Summary This refactor makes tool handlers the owner of the specs they can publish, so registry construction can register handlers once and separately publish only the specs that should be model-visible. The main motivation is deferred tools: MCP and dynamic tools still need handlers registered up front, but deferred tools should be discoverable through `tool_search` rather than emitted in the initial tool spec list. ## What changed - `McpHandler` and `DynamicToolHandler` can return their own `ToolSpec`. - `build_tool_registry_builder` now collects handlers, registers them through the no-spec path, and publishes only non-deferred handler specs. - Deferred MCP and dynamic tool names are combined into one `all_deferred_tools` set that drives spec filtering, code-mode deferred-tool signaling, and `tool_search` registration. - `tool_search` registration now requires both deferred tools and `namespace_tools`. - Namespace specs are merged in `spec_plan`, preserving top-level spec order, sorting tools within each namespace, and backfilling empty namespace descriptions. - Hosted web search and image-generation specs are included in the collected spec vector before namespace merge/publication, and tool-name tests that should not care about hosted relative order now compare sets. ## Testing - `cargo test -p codex-core tools::spec::tests:: -- --nocapture` - `cargo test -p codex-core tools::spec_plan::tests:: -- --nocapture` - `cargo test -p codex-core tools::router::tests::specs_filter_deferred_dynamic_tools -- --nocapture` - `cargo test -p codex-core suite::prompt_caching::prompt_tools_are_consistent_across_requests -- --nocapture` - `just fmt` - `just fix -p codex-core` - `cargo test -p codex-core -- --skip tools::handlers::multi_agents::tests::tool_handlers_cascade_close_and_resume_and_keep_explicitly_closed_subtrees_closed` passed the library suite after skipping the known stack-overflowing unit test. Full `cargo test -p codex-core` currently hits a stack overflow in `tools::handlers::multi_agents::tests::tool_handlers_cascade_close_and_resume_and_keep_explicitly_closed_subtrees_closed`; the same focused test reproduces on `origin/main`.	2026-05-12 17:09:14 -07:00
richardopenai	b6e718591b	[codex] Remove workspace owner usage nudge gate (#20509 ) ## Summary - make workspace owner nudge handling unconditional in the TUI now that it is fully rolled out - keep `workspace_owner_usage_nudge` as a removed no-op compatibility flag so old configs/app overrides remain accepted during rollout - remove flag-disabled test setup ## Companion PR - https://github.com/openai/openai/pull/876351 removes the Codex Apps Statsig rollout gate override after this change is available to the app/runtime path ## Validation - `just write-config-schema` - `just fmt` - `cargo test -p codex-features` - `cargo test -p codex-tui status_and_layout`	2026-05-12 17:07:33 -07:00
Michael Bolin	3d9af7a5b5	Merge `1f12d07fd1` into sapling-pr-archive-bolinfest	2026-05-12 16:51:01 -07:00
Michael Bolin	1b233d3714	core: box multi-agent handler futures	2026-05-12 16:50:48 -07:00
Michael Bolin	1f12d07fd1	docs: clarify permissions thread lifecycle API	2026-05-12 16:50:48 -07:00
Michael Bolin	9d5c8bfd88	app-server: test empty workspace roots roundtrip	2026-05-12 16:50:48 -07:00
Michael Bolin	9f75b476b1	app-server: test persisted active permission profile	2026-05-12 16:50:48 -07:00

1 2 3 4 5 ...

15387 Commits