codex

mirror of https://github.com/openai/codex.git synced 2026-05-01 01:47:18 +00:00

Author	SHA1	Message	Date
Ahmed Ibrahim	9179c9deac	Merge Modelfamily into modelinfo (#8763 ) - Merge ModelFamily into ModelInfo - Remove logic for adding instructions to apply patch - Add compaction limit and visible context window to `ModelInfo`	2026-01-07 10:35:09 -08:00
jif-oai	116059c3a0	chore: unify conversation with thread name (#8830 ) Done and verified by Codex + refactor feature of RustRover	2026-01-07 17:04:53 +00:00
jif-oai	4cef89a122	chore: rename unified exec sessions (#8822 ) Renaming done by Codex	2026-01-07 16:12:47 +00:00
Thibault Sottiaux	230a045ac9	chore: stabilize core tool parallelism test (#8805 ) Set login=false for the shell tool in the timing-based parallelism test so it does not depend on slow user login shells, making the test deterministic without user-facing changes. This prevents occasional flakes when running locally.	2026-01-07 09:26:47 +00:00
charley-oai	3389465c8d	Enable model upgrade popup even when selected model is no longer in picker (#8802 ) With `config.toml`: ``` model = "gpt-5.1-codex" ``` (where `gpt-5.1-codex` has `show_in_picker: false` in [`model_presets.rs`](https://github.com/openai/codex/blob/main/codex-rs/core/src/models_manager/model_presets.rs); this happens if the user hasn't used codex in a while so they didn't see the popup before their model was changed to `show_in_picker: false`) The upgrade picker used to not show (because `gpt-5.1-codex` was filtered out of the model list in code). Now, the filtering is done downstream in tui and app-server, so the model upgrade popup shows: <img width="1503" height="227" alt="Screenshot 2026-01-06 at 5 04 37 PM" src="https://github.com/user-attachments/assets/26144cc2-0b3f-4674-ac17-e476781ec548" />	2026-01-06 19:32:27 -08:00
sayan-oai	54ded1a3c0	add web_search_cached flag (#8795 ) Add `web_search_cached` feature to config. Enables `web_search` tool with access only to cached/indexed results (see [docs](https://platform.openai.com/docs/guides/tools-web-search#live-internet-access)). This takes precedence over the existing `web_search_request`, which continues to enable `web_search` over live results as it did before. `web_search_cached` is disabled for review mode, as `web_search_request` is.	2026-01-06 14:53:59 -08:00
jif-oai	1dd1355df3	feat: agent controller (#8783 ) Added an agent control plane that lets sessions spawn or message other conversations via `AgentControl`. `AgentBus` (core/src/agent/bus.rs) keeps track of the last known status of a conversation. ConversationManager now holds shared state behind an Arc so AgentControl keeps only a weak back-reference, the goal is just to avoid explicit cycle reference. Follow-ups: * Build a small tool in the TUI to be able to see every agent and send manual message to each of them * Handle approval requests in this TUI * Add tools to spawn/communicate between agents (see related design) * Define agent types	2026-01-06 19:08:02 +00:00
Michael Bolin	cafb07fe6e	feat: add justification arg to prefix_rule() in *.rules (#8751 ) Adds an optional `justification` parameter to the `prefix_rule()` execpolicy DSL so policy authors can attach human-readable rationale to a rule. That justification is propagated through parsing/matching and can be surfaced to the model (or approval UI) when a command is blocked or requires approval. When a command is rejected (or gated behind approval) due to policy, a generic message makes it hard for the model/user to understand what went wrong and what to do instead. Allowing policy authors to supply a short justification improves debuggability and helps guide the model toward compliant alternatives. Example: ```python prefix_rule( pattern = ["git", "push"], decision = "forbidden", justification = "pushing is blocked in this repo", ) ``` If Codex tried to run `git push origin main`, now the failure would include: ``` `git push origin main` rejected: pushing is blocked in this repo ``` whereas previously, all it was told was: ``` execpolicy forbids this command ```	2026-01-05 21:24:48 +00:00
Anton Panasenko	807f8a43c2	feat: expose outputSchema to user_turn/turn_start app_server API (#8377 ) What changed - Added `outputSchema` support to the app-server APIs, mirroring `codex exec --output-schema` behavior. - V1 `sendUserTurn` now accepts `outputSchema` and constrains the final assistant message for that turn. - V2 `turn/start` now accepts `outputSchema` and constrains the final assistant message for that turn (explicitly per-turn only). Core behavior - `Op::UserTurn` already supported `final_output_json_schema`; now V1 `sendUserTurn` forwards `outputSchema` into that field. - `Op::UserInput` now carries `final_output_json_schema` for per-turn settings updates; core maps it into `SessionSettingsUpdate.final_output_json_schema` so it applies to the created turn context. - V2 `turn/start` does NOT persist the schema via `OverrideTurnContext` (it’s applied only for the current turn). Other overrides (cwd/model/etc) keep their existing persistent behavior. API / docs - `codex-rs/app-server-protocol/src/protocol/v1.rs`: add `output_schema: Option<serde_json::Value>` to `SendUserTurnParams` (serialized as `outputSchema`). - `codex-rs/app-server-protocol/src/protocol/v2.rs`: add `output_schema: Option<JsonValue>` to `TurnStartParams` (serialized as `outputSchema`). - `codex-rs/app-server/README.md`: document `outputSchema` for `turn/start` and clarify it applies only to the current turn. - `codex-rs/docs/codex_mcp_interface.md`: document `outputSchema` for v1 `sendUserTurn` and v2 `turn/start`. Tests added/updated - New app-server integration tests asserting `outputSchema` is forwarded into outbound `/responses` requests as `text.format`: - `codex-rs/app-server/tests/suite/output_schema.rs` - `codex-rs/app-server/tests/suite/v2/output_schema.rs` - Added per-turn semantics tests (schema does not leak to the next turn): - `send_user_turn_output_schema_is_per_turn_v1` - `turn_start_output_schema_is_per_turn_v2` - Added protocol wire-compat tests for the merged op: - serialize omits `final_output_json_schema` when `None` - deserialize works when field is missing - serialize includes `final_output_json_schema` when `Some(schema)` Call site updates (high level) - Updated all `Op::UserInput { .. }` constructions to include `final_output_json_schema`: - `codex-rs/app-server/src/codex_message_processor.rs` - `codex-rs/core/src/codex_delegate.rs` - `codex-rs/mcp-server/src/codex_tool_runner.rs` - `codex-rs/tui/src/chatwidget.rs` - `codex-rs/tui2/src/chatwidget.rs` - plus impacted core tests. Validation - `just fmt` - `cargo test -p codex-core` - `cargo test -p codex-app-server` - `cargo test -p codex-mcp-server` - `cargo test -p codex-tui` - `cargo test -p codex-tui2` - `cargo test -p codex-protocol` - `cargo clippy --all-features --tests --profile dev --fix -- -D warnings`	2026-01-05 10:27:00 -08:00
Thibault Sottiaux	0b53aed2d0	fix: /review to respect session cwd (#8738 ) Fixes /review base-branch prompt resolution to use the session/turn cwd (respecting runtime cwd overrides) so merge-base/diff guidance is computed from the intended repo; adds a regression test for cwd overrides; tested with cargo test -p codex-core --test all review_uses_overridden_cwd_for_base_branch_merge_base.	2026-01-05 12:11:20 +00:00
Ahmed Ibrahim	efd2d76484	Account for last token count on resume (#8677 ) last token count in context manager is initialized to 0. Gets populated only on events from server. This PR populates it on resume so we can decide if we need to compact or not.	2026-01-02 23:20:20 +00:00
Ahmed Ibrahim	66b7c673e9	Refresh on models etag mismatch (#8491 ) - Send models etag - Refresh models on 412 - This wires `ModelsManager` to `ModelFamily` so we don't mutate it mid-turn	2026-01-01 11:41:16 -08:00
Michael Bolin	e61bae12e3	feat: introduce codex-utils-cargo-bin as an alternative to assert_cmd::Command (#8496 ) This PR introduces a `codex-utils-cargo-bin` utility crate that wraps/replaces our use of `assert_cmd::Command` and `escargot::CargoBuild`. As you can infer from the introduction of `buck_project_root()` in this PR, I am attempting to make it possible to build Codex under [Buck2](https://buck2.build) as well as `cargo`. With Buck2, I hope to achieve faster incremental local builds (largely due to Buck2's [dice](https://buck2.build/docs/insights_and_knowledge/modern_dice/) build strategy, as well as benefits from its local build daemon) as well as faster CI builds if we invest in remote execution and caching. See https://buck2.build/docs/getting_started/what_is_buck2/#why-use-buck2-key-advantages for more details about the performance advantages of Buck2. Buck2 enforces stronger requirements in terms of build and test isolation. It discourages assumptions about absolute paths (which is key to enabling remote execution). Because the `CARGO_BIN_EXE_` environment variables that Cargo provides are absolute paths (which `assert_cmd::Command` reads), this is a problem for Buck2, which is why we need this `codex-utils-cargo-bin` utility. My WIP-Buck2 setup sets the `CARGO_BIN_EXE_` environment variables passed to a `rust_test()` build rule as relative paths. `codex-utils-cargo-bin` will resolve these values to absolute paths, when necessary. --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/openai/codex/pull/8496). * #8498 * __->__ #8496	2025-12-23 19:29:32 -08:00
Ahmed Ibrahim	40de81e7af	Remove reasoning format (#8484 ) This isn't very useful parameter. logic: ``` if model puts `**` in their reasoning, trim it and visualize the header. if couldn't trim: don't render if model doesn't support: don't render ``` We can simplify to: ``` if could trim, visualize header. if not, don't render ```	2025-12-23 16:01:46 -08:00
Anton Panasenko	cbc5fb9acf	chore: save more about turn context in rollout log file (#8458 ) ### Motivation - Persist richer per-turn configuration in rollouts so resumed/forked sessions and tooling can reason about the exact instruction inputs and output constraints used for a turn. ### Description - Extend `TurnContextItem` to include optional `base_instructions`, `user_instructions`, and `developer_instructions`. - Record the optional `final_output_json_schema` associated with a turn. - Add an optional `truncation_policy` to `TurnContextItem` and populate it when writing turn-context rollout items. - Introduce a protocol-level `TruncationPolicy` representation and convert from core truncation policy when recording. ### Testing - `cargo test -p codex-protocol` (pass)	2025-12-22 19:51:07 -08:00
Ahmed Ibrahim	6b2ef216f1	remove minimal client version (#8447 ) This isn't needed value by client	2025-12-22 12:52:24 -08:00
Dylan Hurd	33e1d0844a	feat(windows) start powershell in utf-8 mode (#7902 ) ## Summary Adds a FeatureFlag to enforce UTF8 encoding in powershell, particularly Windows Powershell v5. This should help address issues like #7290. Notably, this PR does not include the ability to parse `apply_patch` invocations within UTF8 shell commands (calls to the freeform tool should not be impacted). I am leaving this out of scope for now. We should address before this feature becomes Stable, but those cases are not the default behavior at this time so we're okay for experimentation phase. We should continue cleaning up the `apply_patch::invocation` logic and then can handle it more cleanly. ## Testing - [x] Adds additional testing	2025-12-22 09:36:44 -08:00
Gav Verma	fab1ded484	Remove plan from system skills (#8374 ) Removes plan from system skills. It has been rewritten into `create-plan` for evaluation and feedback: https://github.com/openai/skills/pull/22	2025-12-20 19:42:53 -08:00
Shijie Rao	987dd7fde3	Chore: remove rmcp feature and exp flag usages (#8087 ) ### Summary With codesigning on Mac, Windows and Linux, we should be able to safely remove `features.rmcp_client` and `use_experimental_use_rmcp_client` check from the codebase now.	2025-12-20 14:18:00 -08:00
Ahmed Ibrahim	f0dc6fd3c7	Rename OpenAI models to models manager (#8346 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2025-12-19 16:20:05 -08:00
Michael Bolin	dc61fc5f50	feat: support allowed_sandbox_modes in requirements.toml (#8298 ) This adds support for `allowed_sandbox_modes` in `requirements.toml` and provides legacy support for constraining sandbox modes in `managed_config.toml`. This is converted to `Constrained<SandboxPolicy>` in `ConfigRequirements` and applied to `Config` such that constraints are enforced throughout the harness. Note that, because `managed_config.toml` is deprecated, we do not add support for the new `external-sandbox` variant recently introduced in https://github.com/openai/codex/pull/8290. As noted, that variant is not supported in `config.toml` today, but can be configured programmatically via app server.	2025-12-19 21:09:20 +00:00
Michael Bolin	0a7021de72	fix: enable resume_warning that was missing from mod.rs (#8333 ) This test was introduced in https://github.com/openai/codex/pull/6507, but was not included in `mod.rs`. It does not appear that it was getting compiled?	2025-12-19 19:21:47 +00:00
GalaxyDetective	014235f533	Fix: /undo destructively interacts with git staging (#8214 ) (#8303 ) Fixes #8214 by removing the '--staged' flag from the undo git restore command. This ensures that while the working tree is reverted to the snapshot state, the user's staged changes (index) are preserved, preventing data loss. Also adds a regression test.	2025-12-19 10:07:41 -08:00
Michael Bolin	3d4ced3ff5	chore: migrate from Config::load_from_base_config_with_overrides to ConfigBuilder (#8276 ) https://github.com/openai/codex/pull/8235 introduced `ConfigBuilder` and this PR updates all call non-test call sites to use it instead of `Config::load_from_base_config_with_overrides()`. This is important because `load_from_base_config_with_overrides()` uses an empty `ConfigRequirements`, which is a reasonable default for testing so the tests are not influenced by the settings on the host. This method is now guarded by `#[cfg(test)]` so it cannot be used by business logic. Because `ConfigBuilder::build()` is `async`, many of the test methods had to be migrated to be `async`, as well. On the bright side, this made it possible to eliminate a bunch of `block_on_future()` stuff.	2025-12-18 16:12:52 -08:00
jif-oai	87abf06e78	fix: flaky tests 5 (#8282 )	2025-12-18 21:08:43 +00:00
Ahmed Ibrahim	7157421daa	splash screen (#8270 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2025-12-18 10:59:53 -08:00
Ahmed Ibrahim	f084e5264b	caribou (#8265 ) Welcome caribou <img width="1536" height="1024" alt="image" src="https://github.com/user-attachments/assets/2a67b21f-40cf-4518-aee4-691af331ab50" />	2025-12-18 08:58:44 -08:00
Ahmed Ibrahim	374d591311	chores: clean picker (#8232 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2025-12-18 08:41:34 -08:00
xl-openai	5c8d22138a	Reintroduce feature flags for skills. (#8244 ) 1. Reintroduce feature flags for skills; 2. UI tweaks (truncate descriptions, better validation error display).	2025-12-18 01:14:11 -08:00
xl-openai	da3869eeb6	Support SYSTEM skills. (#8220 ) 1. Remove PUBLIC skills and introduce SYSTEM skills embedded in the binary and installed into $CODEX_HOME/skills/.system at startup. 2. Skills are now always enabled (feature flag removed). 3. Update skills/list to accept forceReload and plumb it through (not used by clients yet).	2025-12-17 18:48:28 -08:00
Ahmed Ibrahim	6f102e18c4	Show migration link (#8228 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2025-12-18 02:03:40 +00:00
Ahmed Ibrahim	774bd9e432	feat: model picker (#8209 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2025-12-17 16:12:35 -08:00
Ahmed Ibrahim	927a6acbea	Load models from static file (#8153 ) - Load models from static file as a fallback - Make API users use this file directly - Add tests to make sure updates to the file always serialize	2025-12-17 14:34:13 -08:00
jif-oai	167553f00d	fix: session downgrade (#8196 ) The problem is that the `tokio` task own an `Arc` reference of the session and that this task only exit with the broadcast channel get closed. But this never get closed if the session is not dropped. So it's a snake biting his tail basically The most notable result was that non of the `Drop` implementation were triggered (temporary files, shell snapshots, session cleaning etc etc) when closing the session (through a `/new` for example) The fix is just to weaken the `Arc` and upgrade it on the fly	2025-12-17 10:44:39 -08:00
gt-oai	9352c6b235	feat: Constrain values for approval_policy (#7778 ) Constrain `approval_policy` through new `admin_policy` config. This PR will: 1. Add a `admin_policy` section to config, with a single field (for now) `allowed_approval_policies`. This list constrains the set of user-settable `approval_policy`s. 2. Introduce a new `Constrained<T>` type, which combines a current value and a validator function. The validator function ensures disallowed values are not set. 3. Change the type of `approval_policy` on `Config` and `SessionConfiguration` from `AskForApproval` to `Constrained<AskForApproval>`. The validator function is set by the values passed into `allowed_approval_policies`. 4. `GenericDisplayRow`: add a `disabled_reason: Option<String>`. When set, it disables selection of the value and indicates as such in the menu. This also makes it unselectable with arrow keys or numbers. This is used in the `/approvals` menu. Follow ups are: 1. Do the same thing to `sandbox_policy`. 2. Propagate the allowed set of values through app-server for the extension (though already this should prevent app-server from setting this values, it's just that we want to disable UI elements that are unsettable). Happy to split this PR up if you prefer, into the logical numbered areas above. Especially if there are parts we want to gavel on separately (e.g. admin_policy). Disabled full access: <img width="1680" height="380" alt="image" src="https://github.com/user-attachments/assets/1fb61c8c-1fcb-4dc4-8355-2293edb52ba0" /> Disabled `--yolo` on startup: <img width="749" height="76" alt="image" src="https://github.com/user-attachments/assets/0a1211a0-6eb1-40d6-a1d7-439c41e94ddb" /> CODEX-4087	2025-12-17 16:19:27 +00:00
jif-oai	2bf57674d6	fix: flaky test 6 (#8175 )	2025-12-17 11:59:13 +00:00
xl-openai	4897efcced	Add public skills + improve repo skill discovery and error UX (#8098 ) 1. Adds SkillScope::Public end-to-end (core + protocol) and loads skills from the public cache directory 2. Improves repo skill discovery by searching upward for the nearest .codex/skills within a git repo 3. Deduplicates skills by name with deterministic ordering to avoid duplicates across sources 4. Fixes garbled “Skill errors” overlay rendering by preventing pending history lines from being injected during the modal 5. Updates the project docs “Skills” intro wording to avoid hardcoded paths	2025-12-17 01:35:49 -08:00
Eric Traut	bbc5675974	Revert "chore: review in read-only (#7593 )" (#8127 ) This reverts commit `291b54a762`. This commit was intended to prevent the model from making code changes during `/review`, which is sometimes does. Unfortunately, it has other unintended side effects that cause `/review` to fail in a variety of ways. See #8115 and #7815. We've therefore decided to revert this change.	2025-12-16 12:01:54 -08:00
jif-oai	d7482510b1	nit: trace span for regular task (#8053 ) Logs are too spammy --------- Co-authored-by: Anton Panasenko <apanasenko@openai.com>	2025-12-16 16:53:15 +00:00
jif-oai	c9f5b9a6df	feat: do not compact on last user turn (#8060 )	2025-12-16 15:36:33 +00:00
jif-oai	ae57e18947	feat: close unified_exec at end of turn (#8052 )	2025-12-16 12:16:43 +00:00
Dylan Hurd	b9d1a087ee	chore(shell_command) fix freeform timeout output (#7791 ) ## Summary Adding an additional integration test for timeout_ms ## Testing - [x] these are tests	2025-12-15 19:26:39 -08:00
Ahmed Ibrahim	c0a12b3952	feat: merge remote models instead of destructing (#7997 ) - merge remote models instead of destructing - make config values have more precedent over remote values	2025-12-15 18:02:35 -08:00
Ahmed Ibrahim	d802b18716	fix parallel tool calls (#7956 )	2025-12-16 01:28:27 +00:00
xl-openai	5d77d4db6b	Reimplement skills loading using SkillsManager + skills/list op. (#7914 ) refactor the way we load and manage skills: 1. Move skill discovery/caching into SkillsManager and reuse it across sessions. 2. Add the skills/list API (Op::ListSkills/SkillsListResponse) to fetch skills for one or more cwds. Also update app-server for VSCE/App; 3. Trigger skills/list during session startup so UIs preload skills and handle errors immediately.	2025-12-14 09:58:17 -08:00
Anton Panasenko	ad7b9d63c3	[codex] add otel tracing (#7844 )	2025-12-12 17:07:17 -08:00
Michael Bolin	642b7566df	fix: introduce AbsolutePathBuf as part of sandbox config (#7856 ) Changes the `writable_roots` field of the `WorkspaceWrite` variant of the `SandboxPolicy` enum from `Vec<PathBuf>` to `Vec<AbsolutePathBuf>`. This is helpful because now callers can be sure the value is an absolute path rather than a relative one. (Though when using an absolute path in a Seatbelt config policy, we still have to _canonicalize_ it first.) Because `writable_roots` can be read from a config file, it is important that we are able to resolve relative paths properly using the parent folder of the config file as the base path.	2025-12-12 15:25:22 -08:00
pakrym-oai	b3ddd50eee	Remote compact for API-key users (#7835 )	2025-12-12 10:05:02 -08:00
Dylan Hurd	9e91e49edb	Revert "fix(apply-patch): preserve CRLF line endings on Windows" (#7903 ) Reverts openai/codex#7515	2025-12-11 17:58:35 -08:00
Victor Vannara	95f7d37ec6	Fix misleading 'maximize' high effort description on xhigh models (#7874 ) ## Notes - switch misleading High reasoning effort descriptions from "Maximizes reasoning depth" to "Higher reasoning depth" across models with xhigh reasoning. Affects GPT-5.1 Codex Max and Robin - refresh model list fixtures and chatwidget snapshots to match new copy ## Revision - R2: Change 'Higher' to 'Greater' - R1: Initial ## Testing <img width="583" height="142" alt="image" src="https://github.com/user-attachments/assets/1ddd8971-7841-4cb3-b9ba-91095a7435d2" /> <img width="838" height="142" alt="image" src="https://github.com/user-attachments/assets/79aaedbf-7624-4695-b822-93dea7d6a800" />	2025-12-11 16:38:52 -08:00

... 10 11 12 13 14 ...

885 Commits