codex

mirror of https://github.com/openai/codex.git synced 2026-04-30 09:26:44 +00:00

Author	SHA1	Message	Date
Jeremy Rose	77e8a45684	Merge branch 'main' into nornagon/fix-openpty-test	2025-12-11 15:12:10 -08:00
Michael Bolin	e0d7ac51d3	fix: policy/.codexpolicy -> rules/.rules (#7888 ) We decided that `.rules` is a more fitting (and concise) file extension than `.codexpolicy`, so we are changing the file extension for the "execpolicy" effort. We are also changing the subfolder of `$CODEX_HOME` from `policy` to `rules` to match. This PR updates the in-repo docs and we will update the public docs once the next CLI release goes out. Locally, I created `~/.codex/rules/default.rules` with the following contents: ``` prefix_rule(pattern=["gh", "pr", "view"]) ``` And then I asked Codex to run: ``` gh pr view 7888 --json title,body,comments ``` and it was able to!	2025-12-11 14:46:00 -08:00
Ahmed Ibrahim	b7fa7ca8e9	Update Model Info (#7853 )	2025-12-11 14:06:07 -08:00
Anton Panasenko	0af7e4a195	fix: omit reasoning summary when ReasoningSummary::None (#7845 ) ``` { "error": { "message": "Invalid value: 'none'. Supported values are: 'concise', 'detailed', and 'auto'.", "type": "invalid_request_error", "param": "reasoning.summary", "code": "invalid_value" } } ```	2025-12-11 11:59:40 -08:00
Ahmed Ibrahim	238ce7dfad	feat: robin (#7882 ) <img width="554" height="554" alt="image" src="https://github.com/user-attachments/assets/aa86f4c8-fb34-4b0e-8b03-3a9980dfdb08" /> --------- Co-authored-by: Dylan Hurd <dylan.hurd@openai.com>	2025-12-11 09:04:08 -08:00
jif-oai	d4554ce6c8	fix: flaky tests 4 (#7875 )	2025-12-11 14:26:27 +00:00
jif-oai	29381ba5c2	feat: add shell snapshot for shell command (#7786 )	2025-12-11 13:46:43 +00:00
Dylan Hurd	dca7f4cb60	fix(stuff) (#7855 ) Co-authored-by: Ahmed Ibrahim <aibrahim@openai.com>	2025-12-11 00:39:47 -08:00
xl-openai	b36ecb6c32	Inject SKILL.md when it's explicitly mentioned. (#7763 ) 1. Skills load once in core at session start; the cached outcome is reused across core and surfaced to TUI via SessionConfigured. 2. TUI detects explicit skill selections, and core injects the matching SKILL.md content into the turn when a selected skill is present.	2025-12-10 13:59:17 -08:00
Ahmed Ibrahim	cb9a189857	make `model` optional in config (#7769 ) - Make Config.model optional and centralize default-selection logic in ModelsManager, including a default_model helper (with codex-auto-balanced when available) so sessions now carry an explicit chosen model separate from the base config. - Resolve `model` once in `core` and `tui` from config. Then store the state of it on other structs. - Move refreshing models to be before resolving the default model	2025-12-10 11:19:00 -08:00
jif-oai	f677d05871	fix: flaky tests 3 (#7826 )	2025-12-10 17:57:53 +00:00
zhao-oai	e0fb3ca1db	refactoring with_escalated_permissions to use SandboxPermissions instead (#7750 ) helpful in the future if we want more granularity for requesting escalated permissions: e.g when running in readonly sandbox, model can request to escalate to a sandbox that allows writes	2025-12-10 17:18:48 +00:00
jif-oai	463249eff3	fix: flaky test 2 (#7818 )	2025-12-10 16:35:28 +00:00
jif-oai	0ad54982ae	chore: rework unified exec events (#7775 )	2025-12-10 10:30:38 +00:00
jif-oai	7836aeddae	feat: shell snapshotting (#7641 )	2025-12-09 18:36:58 +00:00
Ahmed Ibrahim	cacfd003ac	override instructions using `ModelInfo` (#7754 ) Making sure we can override base instructions	2025-12-08 17:30:42 -08:00
pakrym-oai	ac5fa6baf8	Do not emit start/end events for write stdin (#7561 )	2025-12-08 15:23:02 -08:00
Ahmed Ibrahim	222a491570	load models from disk and set a ttl and etag (#7722 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2025-12-08 13:43:04 -08:00
gameofby	98923654d0	fix: refine the warning message and docs for deprecated tools config (#7685 ) Issue #7661 revealed that users are confused by deprecation warnings like: > `tools.web_search` is deprecated. Use `web_search_request` instead. This message misleadingly suggests renaming the config key from `web_search` to `web_search_request`, when the actual required change is to move and rename the configuration from the `[tools]` section to the `[features]` section. This PR clarifies the warning messages and documentation to make it clear that deprecated `[tools]` configurations should be moved to `[features]`. Changes made: - Updated deprecation warning format in `codex-rs/core/src/codex.rs:520` to include `[features].` prefix - Updated corresponding test expectations in `codex-rs/core/tests/suite/deprecation_notice.rs:39` - Improved documentation in `docs/config.md` to clarify upfront that `[tools]` options are deprecated in favor of `[features]`	2025-12-08 01:23:21 -08:00
Ahmed Ibrahim	53a486f7ea	Add remote models feature flag (#7648 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2025-12-07 09:47:48 -08:00
Dylan Hurd	6c9c563faf	fix(apply-patch): preserve CRLF line endings on Windows (#7515 ) ## Summary This PR is heavily based on #4017, which contains the core logic for the fix. To reduce the risk, we are first introducing it only on windows. We can then expand to wsl / other environments as needed, and then tackle net new files. ## Testing - [x] added unit tests in apply-patch - [x] add integration tests to apply_patch_cli.rs --------- Co-authored-by: Chase Naples <Cnaples79@gmail.com>	2025-12-05 16:43:27 -08:00
Pavel Krymets	f48d88067e	Fix unified_exec on windows (#7620 ) Fix unified_exec on windows Requires removal of PSUEDOCONSOLE_INHERIT_CURSOR flag so child processed don't attempt to wait for cursor position response (and timeout). https://github.com/wezterm/wezterm/compare/main...pakrym:wezterm:PSUEDOCONSOLE_INHERIT_CURSOR?expand=1 --------- Co-authored-by: pakrym-oai <pakrym@openai.com>	2025-12-05 20:09:43 +00:00
Dylan Hurd	a8cbbdbc6e	feat(core) Add login to shell_command tool (#6846 ) ## Summary Adds the `login` parameter to the `shell_command` tool - optional, defaults to true. ## Testing - [x] Tested locally	2025-12-05 11:03:25 -08:00
Ahmed Ibrahim	d08efb1743	Wire `with_remote_overrides` to construct model families (#7621 ) - This PR wires `with_remote_overrides` and make the `construct_model_families` an async function - Moves getting model family a level above to keep the function `sync` - Updates the tests to local, offline, and `sync` helper for model families	2025-12-05 10:40:15 -08:00
zhao-oai	b8eab7ce90	fix: taking plan type from usage endpoint instead of thru auth token (#7610 ) pull plan type from the usage endpoint, persist it in session state / tui state, and propagate through rate limit snapshots	2025-12-04 23:34:13 -08:00
Ahmed Ibrahim	7b359c9c8e	Call models endpoint in models manager (#7616 ) - Introduce `with_remote_overrides` and update `refresh_available_models` - Put `auth_manager` instead of `auth_mode` on `models_manager` - Remove `ShellType` and `ReasoningLevel` to use already existing structs	2025-12-04 18:28:03 -08:00
Ahmed Ibrahim	903b7774bc	Add models endpoint (#7603 ) - Use the codex-api crate to introduce models endpoint. - Add `models` to codex core tests helpers - Add `ModelsInfo` for the endpoint return type	2025-12-04 12:57:54 -08:00
Ahmed Ibrahim	9b2055586d	remove `model_family` from `config (#7571 ) - Remove `model_family` from `config` - Make sure to still override config elements related to `model_family` like supporting reasoning	2025-12-04 11:57:58 -08:00
Dylan Hurd	37c36024c7	chore(core): test apply_patch_cli on Windows (#7554 ) ## Summary These tests pass on windows, let's enable them. ## Testing - [x] These are more tests	2025-12-04 10:39:45 -08:00
jif-oai	291b54a762	chore: review in read-only (#7593 )	2025-12-04 10:01:12 -08:00
jif-oai	2b5d0b2935	feat: update sandbox policy to allow TTY (#7580 ) Change: Seatbelt now allows file-ioctl on /dev/ttys[0-9]+ even without the sandbox extension so pre-created PTYs remain interactive (Python REPL, shells). Risk: A seatbelted process that already holds a PTY fd (including one it shouldn’t) could issue tty ioctls like TIOCSTI or termios changes on that fd. This doesn’t allow opening new PTYs or reading/writing them; it only broadens ioctl capability on existing fds. Why acceptable: We already hand the child its PTY for interactive use; restoring ioctls is required for isatty() and prompts to work. The attack requires being given or inheriting a sensitive PTY fd; by design we don’t hand untrusted processes other users’ PTYs (we don't hand them any PTYs actually), so the practical exposure is limited to the PTY intentionally allocated for the session. Validation: Running ``` start a python interpreter and keep it running ``` Followed by: * `calculate 1+1 using it` -> works as expected * `Use this Python session to run the command just fix in /Users/jif/code/codex/codex-rs` -> does not work as expected	2025-12-04 17:58:58 +00:00
zhao-oai	3d35cb4619	Refactor execpolicy fallback evaluation (#7544 ) ## Refactor of the `execpolicy` crate To illustrate why we need this refactor, consider an agent attempting to run `apple \| rm -rf ./`. Suppose `apple` is allowed by `execpolicy`. Before this PR, `execpolicy` would consider `apple` and `pear` and only render one rule match: `Allow`. We would skip any heuristics checks on `rm -rf ./` and immediately approve `apple \| rm -rf ./` to run. To fix this, we now thread a `fallback` evaluation function into `execpolicy` that runs when no `execpolicy` rules match a given command. In our example, we would run `fallback` on `rm -rf ./` and prevent `apple \| rm -rf ./` from being run without approval.	2025-12-03 23:39:48 -08:00
zhao-oai	e925a380dc	whitelist command prefix integration in core and tui (#7033 ) this PR enables TUI to approve commands and add their prefixes to an allowlist: <img width="708" height="605" alt="Screenshot 2025-11-21 at 4 18 07 PM" src="https://github.com/user-attachments/assets/56a19893-4553-4770-a881-becf79eeda32" /> note: we only show the option to whitelist the command when 1) command is not multi-part (e.g `git add -A && git commit -m 'hello world'`) 2) command is not already matched by an existing rule	2025-12-03 23:17:02 -08:00
Ahmed Ibrahim	67e67e054f	Migrate codex max (#7566 ) - make codex max the default - fix: we were doing some async work in sync function which caused tui to panic	2025-12-03 20:54:48 -08:00
Ahmed Ibrahim	cee37a32b2	Migrate model family to models manager (#7565 ) This PR moves `ModelsFamily` to `openai_models`. It also propagates `ModelsManager` to session services and use it to drive model family. We also make `derive_default_model_family` private because it's a step towards what we want: one place that gives model configuration. This is a second step at having one source of truth for models information and config: `ModelsManager`. Next steps would be to remove `ModelsFamily` from config. That's massive because it's being used in 41 occasions mostly pre launching `codex`. Also, we need to make `find_family_for_model` private. It's also big because it's being used in 21 occasions ~ all tests.	2025-12-03 18:49:47 -08:00
Ahmed Ibrahim	00cc00ead8	Introduce `ModelsManager` and migrate `app-server` to use it. (#7552 )	2025-12-03 17:17:56 -08:00
Ahmed Ibrahim	71504325d3	Migrate model preset (#7542 ) - Introduce `openai_models` in `/core` - Move `PRESETS` under it - Move `ModelPreset`, `ModelUpgrade`, `ReasoningEffortPreset`, `ReasoningEffortPreset`, and `ReasoningEffortPreset` to `protocol` - Introduce `Op::ListModels` and `EventMsg::AvailableModels` Next steps: - migrate `app-server` and `tui` to use the introduced Operation	2025-12-03 20:30:43 +00:00
Jeremy Rose	2d50c0a865	core(sandbox): fix openpty test	2025-12-03 10:21:31 -08:00
Jeremy Rose	9b3251f28f	seatbelt: allow openpty() (#7507 ) This allows `openpty(3)` to run in the default sandbox. Also permit reading `kern.argmax`, which is the maximum number of arguments to exec().	2025-12-03 09:15:38 -08:00
jif-oai	51307eaf07	feat: retroactive image placeholder to prevent poisoning (#6774 ) If an image can't be read by the API, it will poison the entire history, preventing any new turn on the conversation. This detect such cases and replace the image by a placeholder	2025-12-03 11:35:56 +00:00
Robby He	f3989f6092	fix(unified_exec): use platform default shell when unified_exec shell… (#7486 ) # Unified Exec Shell Selection on Windows ## Problem reference issue #7466 The `unified_exec` handler currently deserializes model-provided tool calls into the `ExecCommandArgs` struct: ```rust #[derive(Debug, Deserialize)] struct ExecCommandArgs { cmd: String, #[serde(default)] workdir: Option<String>, #[serde(default = "default_shell")] shell: String, #[serde(default = "default_login")] login: bool, #[serde(default = "default_exec_yield_time_ms")] yield_time_ms: u64, #[serde(default)] max_output_tokens: Option<usize>, #[serde(default)] with_escalated_permissions: Option<bool>, #[serde(default)] justification: Option<String>, } ``` The `shell` field uses a hard-coded default: ```rust fn default_shell() -> String { "/bin/bash".to_string() } ``` When the model returns a tool call JSON that only contains `cmd` (which is the common case), Serde fills in `shell` with this default value. Later, `get_command` uses that value as if it were a model-provided shell path: ```rust fn get_command(args: &ExecCommandArgs) -> Vec<String> { let shell = get_shell_by_model_provided_path(&PathBuf::from(args.shell.clone())); shell.derive_exec_args(&args.cmd, args.login) } ``` On Unix, this usually resolves to `/bin/bash` and works as expected. However, on Windows this behavior is problematic: - The hard-coded `"/bin/bash"` is not a valid Windows path. - `get_shell_by_model_provided_path` treats this as a model-specified shell, and tries to resolve it (e.g. via `which::which("bash")`), which may or may not exist and may not behave as intended. - In practice, this leads to commands being executed under a non-default or non-existent shell on Windows (for example, WSL bash), instead of the expected Windows PowerShell or `cmd.exe`. The core of the issue is that "model did not specify `shell`" is currently interpreted as "the model explicitly requested `/bin/bash`", which is both Unix-specific and wrong on Windows. ## Proposed Solution Instead of hard-coding `"/bin/bash"` into `ExecCommandArgs`, we should distinguish between: 1. The model explicitly specifying a shell, e.g.: ```json { "cmd": "echo hello", "shell": "pwsh" } ``` In this case, we do want to respect the model’s choice and use `get_shell_by_model_provided_path`. 2. The model omitting the `shell` field entirely, e.g.: ```json { "cmd": "echo hello" } ``` In this case, we should not assume `/bin/bash`. Instead, we should use `default_user_shell()` and let the platform decide. To express this distinction, we can: 1. Change `shell` to be optional in `ExecCommandArgs`: ```rust #[derive(Debug, Deserialize)] struct ExecCommandArgs { cmd: String, #[serde(default)] workdir: Option<String>, #[serde(default)] shell: Option<String>, #[serde(default = "default_login")] login: bool, #[serde(default = "default_exec_yield_time_ms")] yield_time_ms: u64, #[serde(default)] max_output_tokens: Option<usize>, #[serde(default)] with_escalated_permissions: Option<bool>, #[serde(default)] justification: Option<String>, } ``` Here, the absence of `shell` in the JSON is represented as `shell: None`, rather than a hard-coded string value.	2025-12-02 21:49:25 -08:00
jif-oai	72b95db12f	feat: intercept apply_patch for unified_exec (#7446 )	2025-12-02 17:54:02 +00:00
jif-oai	4b78e2ab09	chore: review everywhere (#7444 )	2025-12-02 11:26:27 +00:00
Thibault Sottiaux	a8d5ad37b8	feat: experimental support for skills.md (#7412 ) This change prototypes support for Skills with the CLI. This is an experimental feature for internal testing. --------- Co-authored-by: Gav Verma <gverma@openai.com>	2025-12-01 20:22:35 -08:00
Dylan Hurd	5b25915d7e	fix(apply_patch) tests for shell_command (#7307 ) ## Summary Adds test coverage for invocations of apply_patch via shell_command with heredoc, to validate behavior. ## Testing - [x] These are tests	2025-12-01 15:09:22 -08:00
jif-oai	a421eba31f	fix: disable review rollout filtering (#7371 )	2025-12-01 09:04:13 +00:00
jif-oai	aaec8abf58	feat: detached review (#7292 )	2025-11-28 11:34:57 +00:00
jif-oai	28ff364c3a	feat: update process ID for event handling (#7261 )	2025-11-25 14:21:05 -08:00
jif-oai	9ba27cfa0a	feat: add compaction event (#7289 )	2025-11-25 16:12:14 +00:00
jif-oai	fc2ff624ac	fix: don't store early exit sessions (#7263 )	2025-11-24 21:14:24 +00:00

1 2 3 4 5 ...

379 Commits