codex

mirror of https://github.com/openai/codex.git synced 2026-05-28 15:00:16 +00:00

Author	SHA1	Message	Date
Ahmed Ibrahim	c7e6666337	auto-picker	2025-11-18 15:37:25 -08:00
Ahmed Ibrahim	e7227ffec8	tests	2025-11-18 14:49:32 -08:00
Ahmed Ibrahim	f0cc2db4c3	tests	2025-11-18 14:48:57 -08:00
Ahmed Ibrahim	c03d43be6f	all models	2025-11-18 14:47:12 -08:00
Ahmed Ibrahim	44058a2d27	all models	2025-11-18 14:43:03 -08:00
Ahmed Ibrahim	90fdb118f4	Merge branch 'auto-picker' of github.com:openai/codex into auto-picker	2025-11-18 14:38:42 -08:00
Ahmed Ibrahim	0afeea3b5e	tests	2025-11-18 14:34:14 -08:00
Ahmed Ibrahim	e3809caaaf	Merge branch 'main' into auto-picker	2025-11-18 13:50:14 -08:00
Dylan Hurd	29ca89c414	chore(config) enable shell_command (#6843 ) ## Summary Enables shell_command as default for `gpt-5` and `codex-` models. ## Testing - [x] Updated unit tests	2025-11-18 12:46:02 -08:00
Ahmed Ibrahim	1fafbb0fb8	tests	2025-11-18 12:33:54 -08:00
Ahmed Ibrahim	c10bcc6780	tests	2025-11-18 12:29:51 -08:00
Ahmed Ibrahim	77bc873059	tests	2025-11-18 12:28:56 -08:00
Ahmed Ibrahim	c1b7e6f9a8	auto-picker	2025-11-18 12:13:12 -08:00
Ahmed Ibrahim	0ce1ed78e4	tests	2025-11-18 12:00:05 -08:00
Ahmed Ibrahim	c36291f233	tests	2025-11-18 11:43:59 -08:00
iceweasel-oai	4bada5a84d	Prompt to turn on windows sandbox when auto mode selected. (#6618 ) - stop prompting users to install WSL - prompt users to turn on Windows sandbox when auto mode requested. <img width="1660" height="195" alt="Screenshot 2025-11-17 110612" src="https://github.com/user-attachments/assets/c67fc239-a227-417e-94bb-599a8ed8f11e" /> <img width="1684" height="168" alt="Screenshot 2025-11-17 110637" src="https://github.com/user-attachments/assets/d18c3370-830d-4971-8746-04757ae2f709" /> <img width="1655" height="293" alt="Screenshot 2025-11-17 110719" src="https://github.com/user-attachments/assets/d21f6ce9-c23e-4842-baf6-8938b77c16db" />	2025-11-18 11:38:18 -08:00
Ahmed Ibrahim	3de8790714	Add the utility to truncate by tokens (#6746 ) - This PR is to make it on path for truncating by tokens. This path will be initially used by unified exec and context manager (responsible for MCP calls mainly). - We are exposing new config `calls_output_max_tokens` - Use `tokens` as the main budget unit but truncate based on the model family by Introducing `TruncationPolicy`. - Introduce `truncate_text` as a router for truncation based on the mode. In next PRs: - remove truncate_with_line_bytes_budget - Add the ability to the model to override the token budget.	2025-11-18 11:36:23 -08:00
Ahmed Ibrahim	9435d39d4c	tests	2025-11-18 11:34:19 -08:00
Ahmed Ibrahim	c025309d32	tests	2025-11-18 11:19:38 -08:00
Ahmed Ibrahim	093cd9cf04	function	2025-11-18 10:54:47 -08:00
Alejandro Peña	b035c604b0	Update faq.md section on supported models (#6832 ) Update faq.md to recommend usage of GPT-5.1 Codex, the latest Codex model from OpenAI.	2025-11-18 09:38:45 -08:00
zhao-oai	e9e644a119	fixing localshell tool calls (#6823 ) - Local-shell tool responses were always tagged as `ExecCommandSource::UserShell` because handler would call `run_exec_like` with `is_user_shell_cmd` set to true. - Treat `ToolPayload::LocalShell` the same as other model generated shell tool calls by deleting `is_user_shell_cmd` from `run_exec_like` (since actual user shell commands follow a separate code path)	2025-11-18 17:28:26 +00:00
jif-oai	f5d9939cda	feat: enable parallel tool calls (#6796 )	2025-11-18 17:10:14 +00:00
jif-oai	838531d3e4	feat: remote compaction (#6795 ) Co-authored-by: pakrym-oai <pakrym@openai.com>	2025-11-18 16:51:16 +00:00
jif-oai	0eb2e6f9ee	nit: app server (#6830 )	2025-11-18 16:34:13 +00:00
jif-oai	c20df79a38	nit: mark ghost commit as stable (#6833 )	2025-11-18 16:05:49 +00:00
jif-oai	fc55fd7a81	feat: git branch tooling (#6831 )	2025-11-18 15:26:09 +00:00
Ahmed Ibrahim	9923f76831	auto	2025-11-18 00:17:19 -08:00
Lael	f3d4e210d8	🐛 fix(rmcp-client): refresh OAuth tokens using expires_at (#6574 ) ## Summary - persist OAuth credential expiry timestamps and rehydrate `expires_in` - proactively refresh rmcp OAuth tokens when `expires_at` is near, then persist ## Testing - just fmt - just fix -p codex-rmcp-client - cargo test -p codex-rmcp-client Fixes #6572	2025-11-18 02:16:58 -05:00
Dylan Hurd	28ebe1c97a	fix(windows) shell_command on windows, minor parsing (#6811 ) ## Summary Enables shell_command for windows users, and starts adding some basic command parsing here, to at least remove powershell prefixes. We'll follow this up with command parsing but I wanted to land this change separately with some basic UX. NOTE: This implementation parses bash and powershell on both platforms. In theory this is possible, since you can use git bash on windows or powershell on linux. In practice, this may not be worth the complexity of supporting, so I don't feel strongly about the current approach vs. platform-specific branching. ## Testing - [x] Added a bunch of tests - [x] Ran on both windows and os x	2025-11-17 22:23:53 -08:00
Dylan Hurd	2b7378ac77	chore(core) Add shell_serialization coverage (#6810 ) ## Summary Similar to #6545, this PR updates the shell_serialization test suite to cover the various `shell` tool invocations we have. Note that this does not cover unified_exec, which has its own suite of tests. This should provide some test coverage for when we eventually consolidate serialization logic. ## Testing - [x] These are tests	2025-11-17 19:10:56 -08:00
Ahmed Ibrahim	ddcc60a085	Update defaults to gpt-5.1 (#6652 ) ## Summary - update documentation, example configs, and automation defaults to reference gpt-5.1 / gpt-5.1-codex - bump the CLI and core configuration defaults, model presets, and error messaging to the new models while keeping the model-family/tool coverage for legacy slugs - refresh tests, fixtures, and TUI snapshots so they expect the upgraded defaults ## Testing - `cargo test -p codex-core config::tests::test_precedence_fixture_with_gpt5_profile` ------ [Codex Task](https://chatgpt.com/codex/tasks/task_i_6916c5b3c2b08321ace04ee38604fc6b)	2025-11-17 17:40:11 -08:00
cassirer-openai	8465f1f2f4	Demote function call payload log to debug to avoid noisy error-level stderr (#6808 )	2025-11-18 01:16:11 +00:00
zhao-oai	7ab45487dd	execpolicy2 extension (#6627 ) - enabling execpolicy2 parser to parse multiple policy files to build a combined `Policy` (useful if codex detects many `.codexpolicy` files) - adding functionality to `Policy` to allow evaluation of multiple cmds at once (useful when we have chained commands)	2025-11-17 16:44:41 -08:00
Owen Lin	cecbd5b021	[app-server] feat: add v2 command execution approval flow (#6758 ) This PR adds the API V2 version of the command‑execution approval flow for the shell tool. This PR wires the new RPC (`item/commandExecution/requestApproval`, V2 only) and related events (`item/started`, `item/completed`, and `item/commandExecution/delta`, which are emitted in both V1 and V2) through the app-server protocol. The new approval RPC is only sent when the user initiates a turn with the new `turn/start` API so we don't break backwards compatibility with VSCE. The approach I took was to make as few changes to the Codex core as possible, leveraging existing `EventMsg` core events, and translating those in app-server. I did have to add additional fields to `EventMsg::ExecCommandEndEvent` to capture the command's input so that app-server can statelessly transform these events to a `ThreadItem::CommandExecution` item for the `item/completed` event. Once we stabilize the API and it's complete enough for our partners, we can work on migrating the core to be aware of command execution items as a first-class concept. Note: We'll need followup work to make sure these APIs work for the unified exec tool, but will wait til that's stable and landed before doing a pass on app-server. Example payloads below: ``` { "method": "item/started", "params": { "item": { "aggregatedOutput": null, "command": "/bin/zsh -lc 'touch /tmp/should-trigger-approval'", "cwd": "/Users/owen/repos/codex/codex-rs", "durationMs": null, "exitCode": null, "id": "call_lNWWsbXl1e47qNaYjFRs0dyU", "parsedCmd": [ { "cmd": "touch /tmp/should-trigger-approval", "type": "unknown" } ], "status": "inProgress", "type": "commandExecution" } } } ``` ``` { "id": 0, "method": "item/commandExecution/requestApproval", "params": { "itemId": "call_lNWWsbXl1e47qNaYjFRs0dyU", "parsedCmd": [ { "cmd": "touch /tmp/should-trigger-approval", "type": "unknown" } ], "reason": "Need to create file in /tmp which is outside workspace sandbox", "risk": null, "threadId": "019a93e8-0a52-7fe3-9808-b6bc40c0989a", "turnId": "1" } } ``` ``` { "id": 0, "result": { "acceptSettings": { "forSession": false }, "decision": "accept" } } ``` ``` { "params": { "item": { "aggregatedOutput": null, "command": "/bin/zsh -lc 'touch /tmp/should-trigger-approval'", "cwd": "/Users/owen/repos/codex/codex-rs", "durationMs": 224, "exitCode": 0, "id": "call_lNWWsbXl1e47qNaYjFRs0dyU", "parsedCmd": [ { "cmd": "touch /tmp/should-trigger-approval", "type": "unknown" } ], "status": "completed", "type": "commandExecution" } } } ```	2025-11-18 00:23:54 +00:00
zhao-oai	4000e26303	background rate limits fetch (#6789 ) fetching rate limits every minute asynchronously	2025-11-17 16:06:26 -08:00
iceweasel-oai	e032d338f2	move cap_sid file into ~/.codex so the sandbox cannot overwrite it (#6798 ) The `cap_sid` file contains the IDs of the two custom SIDs that the Windows sandbox creates/manages to implement read-only and workspace-write sandbox policies. It previously lived in `<cwd>/.codex` which means that the sandbox could write to it, which could degrade the efficacy of the sandbox. This change moves it to `~/.codex/` (or wherever `CODEX_HOME` points to) so that it is outside the workspace.	2025-11-17 15:49:41 -08:00
Eric Traut	8bebe86a47	Fix TUI issues with Alt-Gr on Windows (#6799 ) This PR fixes keyboard handling for the Right Alt (aka "Alt-Gr") key on Windows. This key appears on keyboards in Central and Eastern Europe. Codex has effectively never worked for Windows users in these regions because the code didn't properly handle this key, which is used for typing common symbols like `\` and `@`. A few days ago, I merged a [community-authored PR](https://github.com/openai/codex/pull/6720) that supplied a partial fix for this issue. Upon closer inspect, that PR was 1) too broad (not scoped to Windows only) and 2) incomplete (didn't fix all relevant code paths, so paste was still broken). This improvement is based on another [community-provided PR](https://github.com/openai/codex/pull/3241) by @marektomas-cz. He submitted it back in September and later closed it because it didn't receive any attention. This fix addresses the following bugs: #5922, #3046, #3092, #3519, #5684, #5843.	2025-11-17 15:18:16 -08:00
Jeremy Rose	ab2e7499f8	core: add a feature to disable the shell tool (#6481 ) `--disable shell_tool` disables the built-in shell tool. This is useful for MCP-only operation. --------- Co-authored-by: Michael Bolin <mbolin@openai.com>	2025-11-17 22:56:19 +00:00
Dylan Hurd	daf77b8452	chore(core) Update shell instructions (#6679 ) ## Summary Consolidates `shell` and `shell_command` tool instructions. ## Testing - [x] Updated tests, tested locally	2025-11-17 13:05:15 -08:00
Owen Lin	03a6e853c0	fix: annotate all app server v2 types with camelCase (#6791 )	2025-11-17 12:02:52 -08:00
rugvedS07	837bc98a1d	LM Studio OSS Support (#2312 ) ## Overview Adds LM Studio OSS support. Closes #1883 ### Changes This PR enhances the behavior of `--oss` flag to support LM Studio as a provider. Additionally, it introduces a new flag`--local-provider` which can take in `lmstudio` or `ollama` as values if the user wants to explicitly choose which one to use. If no provider is specified `codex --oss` will auto-select the provider based on whichever is running. #### Additional enhancements The default can be set using `oss-provider` in config like: ``` oss_provider = "lmstudio" ``` For non-interactive users, they will need to either provide the provider as an arg or have it in their `config.toml` ### Notes For best performance, [set the default context length](https://lmstudio.ai/docs/app/advanced/per-model) for gpt-oss to the maximum your machine can support --------- Co-authored-by: Matt Clayton <matt@lmstudio.ai> Co-authored-by: Eric Traut <etraut@openai.com>	2025-11-17 11:49:09 -08:00
Celia Chen	842a1b7fe7	[app-server] add events to readme (#6690 ) add table of contents, lifecycle and events to readme.	2025-11-17 19:28:05 +00:00
Jeremy Rose	03ffe4d595	core/tui: non-blocking MCP startup (#6334 ) This makes MCP startup not block TUI startup. Messages sent while MCPs are booting will be queued. https://github.com/user-attachments/assets/96e1d234-5d8f-4932-a935-a675d35c05e0 Fixes #6317 --------- Co-authored-by: pakrym-oai <pakrym@openai.com>	2025-11-17 11:26:11 -08:00
Owen Lin	ae2a084fae	chore: delete chatwidget::tests::binary_size_transcript_snapshot tui test (#6759 ) We're running into quite a bit of drag maintaining this test, since every time we add fields to an EventMsg that happened to be dumped into the `binary-size-log.jsonl` fixture, this test starts to fail. The fix is usually to either manually update the `binary-size-log.jsonl` fixture file, or update the `upgrade_event_payload_for_tests` function to map the data in that file into something workable. Eason says it's fine to delete this test, so let's just delete it	2025-11-17 11:11:41 -08:00
zhao-oai	a941ae7632	feat: execpolicy v2 (#6467 ) ## Summary - Introduces the `codex-execpolicy2` crate. - This PR covers only the prefix-rule subset of the planned execpolicy v2 language; a richer language will follow. ## Policy - Policy language centers on `prefix_rule(pattern=[...], decision?, match?, not_match?)`, where `pattern` is an ordered list of tokens; any element may be a list to denote alternatives. `decision` defaults to `allow`; valid values are `allow`, `prompt`, and `forbidden`. `match` / `not_match` hold example commands that are tokenized and validated at load time (think of these as unit tests). ## Policy shapes - Prefix rules use Starlark syntax: ```starlark prefix_rule( pattern = ["cmd", ["alt1", "alt2"]], # ordered tokens; list entries denote alternatives decision = "prompt", # allow \| prompt \| forbidden; defaults to allow match = [["cmd", "alt1"]], # examples that must match this rule (enforced at compile time) not_match = [["cmd", "oops"]], # examples that must not match this rule (enforced at compile time) ) ``` ## Response shapes - Match: ```json { "match": { "decision": "allow\|prompt\|forbidden", "matchedRules": [ { "prefixRuleMatch": { "matchedPrefix": ["<token>", "..."], "decision": "allow\|prompt\|forbidden" } } ] } } ``` - No match: ```json "noMatch" ``` - `matchedRules` lists every rule whose prefix matched the command; `matchedPrefix` is the exact prefix that matched. - The effective `decision` is the strictest severity across all matches (`forbidden` > `prompt` > `allow`). --------- Co-authored-by: Michael Bolin <mbolin@openai.com>	2025-11-17 10:15:45 -08:00
jif-oai	2c665fb1dd	nit: personal git ignore (#6787 )	2025-11-17 17:45:52 +00:00
jif-oai	98a90a3bb2	tmp: drop sccache for windows 2 (#6775 )	2025-11-17 16:39:15 +00:00
jif-oai	7c8d333980	feat: placeholder for image that can't be decoded to prevent 400 (#6773 )	2025-11-17 16:10:53 +00:00
Dylan Hurd	497fb4a19c	fix(core) serialize shell_command (#6744 ) ## Summary Ensures we're serializing calls to `shell_command` ## Testing - [x] Added unit test	2025-11-16 23:16:51 -08:00

1 2 3 4 5 ...

2032 Commits