codex

mirror of https://github.com/openai/codex.git synced 2026-04-24 22:54:54 +00:00

Author	SHA1	Message	Date
gt-oai	e85d019daa	Fetch Requirements from cloud (#10167 ) Load requirements from Codex Backend. It only does this for enterprise customers signed in with ChatGPT. Todo in follow-up PRs: * Add to app-server and exec too * Switch from fail-open to fail-closed on failure	2026-01-30 12:03:29 +00:00
pap-openai	1ef5455eb6	Conversation naming (#8991 ) Session renaming: - `/rename my_session` - `/rename` without arg and passing an argument in `customViewPrompt` - AppExitInfo shows resume hint using the session name if set instead of uuid, defaults to uuid if not set - Names are stored in `CODEX_HOME/sessions.jsonl` Session resuming: - codex resume <name> lookup for `CODEX_HOME/sessions.jsonl` first entry matching the name and resumes the session --------- Co-authored-by: jif-oai <jif@openai.com>	2026-01-30 10:40:09 +00:00
jif-oai	25ad414680	chore: unify metric (#10220 )	2026-01-30 11:13:43 +01:00
jif-oai	129787493f	feat: backfill timing metric (#10218 ) 1. Add a metric to measure the backfill time 2. Add a unit to the timing histogram	2026-01-30 10:19:41 +01:00
Shijie Rao	a0ccef9d5c	Chore: plan mode do not include free form question and always include isOther (#10210 ) We should never ask a freeform question when planning and we should always include isOther as an escape hatch.	2026-01-30 01:19:24 -08:00
jif-oai	f8056e62d4	nit: actually run tests (#10217 )	2026-01-30 10:02:46 +01:00
Matthew Zeng	34f89b12d0	MCP tool call approval (simplified version) (#10200 ) Add elicitation approval request for MCP tool call requests.	2026-01-29 23:40:32 -08:00
Dylan Hurd	e3ab0bd973	chore(personality) new schema with fallbacks (#10147 ) ## Summary Let's dial in this api contract in a bit more with more robust fallback behavior when model_instructions_template is false. Switches to a more explicit template / variables structure, with more fallbacks. ## Testing - [x] Adding unit tests - [x] Tested locally	2026-01-30 00:10:12 -07:00
alexsong-oai	d550fbf41a	load from yaml (#10194 )	2026-01-29 21:44:12 -05:00
willwang-openai	a9cf449a80	add error messages for the go plan type (#10181 ) Adds support for the Go plan type Updates rate limit error messages to point to the usage page	2026-01-30 01:17:25 +00:00
Celia Chen	7151387474	[feat] persist dynamic tools in session rollout file (#10130 ) Add dynamic tools to rollout file for persistence & read from rollout on resume. Ran a real example and spotted the following in the rollout file: ``` {"timestamp":"2026-01-29T01:27:57.468Z","type":"session_meta","payload":{"id":"019c075d-3f0b-77e3-894e-c1c159b04b1e","timestamp":"2026-01-29T01:27:57.451Z","...."dynamic_tools":[{"name":"demo_tool","description":"Demo dynamic tool","inputSchema":{"additionalProperties":false,"properties":{"city":{"type":"string"}},"required":["city"],"type":"object"}}],"git":{"commit_hash":"ebc573f15c01b8af158e060cfedd401f043e9dfa","branch":"dev/cc/dynamic-tools","repository_url":"https://github.com/openai/codex.git"}}} ```	2026-01-30 01:10:00 +00:00
Owen Lin	81a17bb2c1	feat(app-server): support external auth mode (#10012 ) This enables a new use case where `codex app-server` is embedded into a parent application that will directly own the user's ChatGPT auth lifecycle, which means it owns the user’s auth tokens and refreshes it when necessary. The parent application would just want a way to pass in the auth tokens for codex to use directly. The idea is that we are introducing a new "auth mode" currently only exposed via app server: `chatgptAuthTokens` which consist of the `id_token` (stores account metadata) and `access_token` (the bearer token used directly for backend API calls). These auth tokens are only stored in-memory. This new mode is in addition to the existing `apiKey` and `chatgpt` auth modes. This PR reuses the shape of our existing app-server account APIs as much as possible: - Update `account/login/start` with a new `chatgptAuthTokens` variant, which will allow the client to pass in the tokens and have codex app-server use them directly. Upon success, the server emits `account/login/completed` and `account/updated` notifications. - A new server->client request called `account/chatgptAuthTokens/refresh` which the server can use whenever the access token previously passed in has expired and it needs a new one from the parent application. I leveraged the core 401 retry loop which typically triggers auth token refreshes automatically, but made it pluggable: - chatgpt mode refreshes internally, as usual. - chatgptAuthTokens mode calls the client via `account/chatgptAuthTokens/refresh`, the client responds with updated tokens, codex updates its in-memory auth, then retries. This RPC has a 10s timeout and handles JSON-RPC errors from the client. Also some additional things: - chatgpt logins are blocked while external auth is active (have to log out first. typically clients will pick one OR the other, not support both) - `account/logout` clears external auth in memory - Ensures that if `forced_chatgpt_workspace_id` is set via the user's config, we respect it in both: - `account/login/start` with `chatgptAuthTokens` (returns a JSON-RPC error back to the client) - `account/chatgptAuthTokens/refresh` (fails the turn, and on next request app-server will send another `account/chatgptAuthTokens/refresh` request to the client).	2026-01-29 23:46:04 +00:00
Colin Young	b79bf69af6	[Codex][CLI] Show model-capacity guidance on 429 (#10118 ) ###### Problem Users get generic 429s with no guidance when a model is at capacity. ###### Solution Detect model-cap headers, surface a clear “try a different model” message, and keep behavior non‑intrusive (no auto‑switch). ###### Scope CLI/TUI only; protocol + error mapping updated to carry model‑cap info. ###### Tests - just fmt - cargo test -p codex-tui - cargo test -p codex-core --lib shell_snapshot::tests::try_new_creates_and_deletes_snapshot_file -- --nocapture (ran in isolated env) - validate local build with backend <img width="719" height="845" alt="image" src="https://github.com/user-attachments/assets/1470b33d-0974-4b1f-b8e6-d11f892f4b54" />	2026-01-29 14:59:07 -08:00
pakrym-oai	fbb3a30953	Remove WebSocket wire format (#10179 ) I'd like WireApi to go away (when chat is removed) and WebSockets is still responses API just over a different transport.	2026-01-29 13:50:53 -08:00
xl-openai	bdd8a7d58b	Better handling skill depdenencies on ENV VAR. (#9017 ) An experimental flow for env var skill dependencies. Skills can now declare required env vars in SKILL.md; if missing, the CLI prompts the user to get the value, and Core will store it in memory (eventually to a local persistent store) <img width="790" height="169" alt="image" src="https://github.com/user-attachments/assets/cd928918-9403-43cb-a7e7-b8d59bcccd9a" />	2026-01-29 14:13:30 -05:00
pakrym-oai	3b1cddf001	Fall back to http when websockets fail (#10139 ) I expect not all proxies work with websockets, fall back to http if websockets fail.	2026-01-29 10:36:21 -08:00
jif-oai	798c4b3260	feat: reduce span exposition (#10171 ) This only avoids the creation of duplicates spans	2026-01-29 18:15:22 +00:00
jif-oai	e6c4f548ab	chore: unify log queries (#10152 ) Unify log queries to only have SQLX code in the runtime and use it for both the log client and for tests	2026-01-29 16:28:15 +00:00
jif-oai	89c5f3c4d4	feat: adding thread ID to logs + filter in the client (#10150 )	2026-01-29 16:53:30 +01:00
jif-oai	714dc8d8bd	feat: async backfill (#10089 )	2026-01-29 09:57:50 +00:00
jif-oai	780482da84	feat: add log db (#10086 ) Add a log DB. The goal is just to store our logs in a `.sqlite` DB to make it easier to crawl them and drop the oldest ones.	2026-01-29 10:23:03 +01:00
iceweasel-oai	8cc338aecf	emit a metric when we can't spawn powershell (#10125 ) This will help diagnose and measure the impact of a user-reported bug with the elevated sandbox and powershell	2026-01-28 21:51:51 -08:00
Dylan Hurd	335713f7e9	chore(core) personality under development (#10133 ) ## Summary Have one or two more changes coming in for this.	2026-01-28 22:00:48 -07:00
Matthew Zeng	b9cd089d1f	[connectors] Support connectors part 2 - slash command and tui (#9728 ) - [x] Support `/apps` slash command to browse the apps in tui. - [x] Support inserting apps to prompt using `$`. - [x] Lots of simplification/renaming from connectors to apps.	2026-01-28 19:51:58 -08:00
Dylan Hurd	9757e1418d	chore(config) Update personality instructions (#10114 ) ## Summary Add personality instructions so we can let users try it out, in tandem with making it an experimental feature ## Testing - [x] Tested locally	2026-01-29 01:14:44 +00:00
Dylan Hurd	ce3d764ae1	chore(config) personality as a feature (#10116 ) ## Summary Sets up an explicit Feature flag for `/personality`, so users can now opt in to it via `/experimental`. #10114 also updates the config ## Testing - [x] Tested locally	2026-01-28 17:58:28 -07:00
Ahmed Ibrahim	26590d7927	Ensure auto-compaction starts after turn started (#10129 ) Start auto-compaction only after TurnStarted is emitted.\nAdd an integration test for deterministic ordering.	2026-01-28 16:51:20 -08:00
zbarsky-openai	8497163363	[bazel] Improve runfiles handling (#10098 ) we can't use runfiles directory on Windows due to path lengths, so swap to manifest strategy. Parsing the manifest is a bit complex and the format is changing in Bazel upstream, so pull in the official Rust library (via a small hack to make it importable...) and cleanup all the associated logic to work cleanly in both bazel and cargo without extra confusion	2026-01-29 00:15:44 +00:00
sayan-oai	ff9fa56368	default enable compression, update test helpers (#10102 ) set `enable_request_compression` flag to default-enabled. update integration test helpers to decompress `zstd` if flag set.	2026-01-28 12:25:40 -08:00
Eric Traut	147e7118e0	Added `tui.notifications_method` config option (#10043 ) This PR adds a new `tui.notifications_method` config option that accepts values of "auto", "osc9" and "bel". It defaults to "auto", which attempts to auto-detect whether the terminal supports OSC 9 escape sequences and falls back to BEL if not. The PR also removes the inconsistent handling of notifications on Windows when WSL was used.	2026-01-28 12:00:32 -08:00
iceweasel-oai	66de985e4e	allow elevated sandbox to be enabled without base experimental flag (#10028 ) elevated flag = elevated sandbox experimental flag = non-elevated sandbox both = elevated	2026-01-28 11:38:29 -08:00
Ahmed Ibrahim	b7edeee8ca	compaction (#10034 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2026-01-28 11:36:11 -08:00
sayan-oai	851617ff5a	chore: deprecate old web search feature flags (#10097 ) deprecate all old web search flags and aliases, including: - `[features].web_search_request` and `[features].web_search_cached` - `[tools].web_search` - `[features].web_search` slightly rework `legacy_usages` to enable pointing to non-features from deprecated features; we need to point to `web_search` (not under `[features]`) from things like `[features].web_search_cached` and `[features].web_search_request`. Added integration tests to confirm deprecation notice is shown on explicit enablement and disablement of deprecated flags.	2026-01-28 10:55:57 -08:00
Jeremy Rose	b8156706e6	file-search: improve file query perf (#9939 ) switch nucleo-matcher for nucleo and use a "file search session" w/ live updating query instead of a single hermetic run per query.	2026-01-28 10:54:43 -08:00
jif-oai	231406bd04	feat: sort metadata by date (#10083 )	2026-01-28 16:19:08 +01:00
jif-oai	3878c3dc7c	feat: sqlite 1 (#10004 ) Add a `.sqlite` database to be used to store rollout metatdata (and later logs) This PR is phase 1: * Add the database and the required infrastructure * Add a backfill of the database * Persist the newly created rollout both in files and in the DB * When we need to get metadata or a rollout, consider the `JSONL` as the source of truth but compare the results with the DB and show any errors	2026-01-28 15:29:14 +01:00
gt-oai	71b8d937ed	Add exec policy TOML representation (#10026 ) We'd like to represent these in `requirements.toml`. This just adds the representation and the tests, doesn't wire it up anywhere yet.	2026-01-28 12:00:10 +00:00
Dylan Hurd	996e09ca24	feat(core) RequestRule (#9489 ) ## Summary Instead of trying to derive the prefix_rule for a command mechanically, let's let the model decide for us. ## Testing - [x] tested locally	2026-01-28 08:43:17 +00:00
iceweasel-oai	9f79365691	error code/msg details for failed elevated setup (#9941 )	2026-01-27 23:06:10 -08:00
Dylan Hurd	fef3e36f67	fix(core) info cleanup (#9986 ) ## Summary Simplify this logic a bit.	2026-01-27 21:15:15 -07:00
Matthew Zeng	3bb8e69dd3	[skills] Auto install MCP dependencies when running skils with dependency specs. (#9982 ) Auto install MCP dependencies when running skils with dependency specs.	2026-01-27 19:02:45 -08:00
sayan-oai	1609f6aa81	fix: allow unknown fields on Notice in schema (#10041 ) the `notice` field didn't allow unknown fields in the schema, leading to issues where they shouldn't be. Now we allow unknown fields. <img width="2260" height="720" alt="image" src="https://github.com/user-attachments/assets/1de43b60-0d50-4a96-9c9c-34419270d722" />	2026-01-27 18:24:24 -08:00
sayan-oai	a90ab789c2	fix: enable per-turn updates to web search mode (#10040 ) web_search can now be updated per-turn, for things like changes to sandbox policy. `SandboxPolicy::DangerFullAccess` now sets web_search to `live`, and the default is still `cached`. Added integration tests.	2026-01-27 18:09:29 -08:00
sayan-oai	28051d18c6	enable live web search for DangerFullAccess sandbox policy (#10008 ) Auto-enable live `web_search` tool when sandbox policy is `DangerFullAccess`. Explicitly setting `web_search` (canonical setting), or enabling `web_search_cached` or `web_search_request` still takes precedence over this sandbox-policy-driven enablement.	2026-01-27 20:09:05 +00:00
alexsong-oai	2f8a44baea	Remove load from SKILL.toml fallback (#10007 )	2026-01-27 12:06:40 -08:00
iceweasel-oai	c40ad65bd8	remove sandbox globals. (#9797 ) Threads sandbox updates through OverrideTurnContext for active turn Passes computed sandbox type into safety/exec	2026-01-27 11:04:23 -08:00
Owen Lin	fc0fd85349	fix(app-server, core): defer initial context write to rollout file until first turn (#9950 ) ### Overview Currently calling `thread/resume` will always bump the thread's `updated_at` timestamp. This PR makes it the `updated_at` timestamp changes only if a turn is triggered. ### Additonal context What we typically do on resuming a thread is always writing “initial context” to the rollout file immediately. This initial context includes: - Developer instructions derived from sandbox/approval policy + cwd - Optional developer instructions (if provided) - Optional collaboration-mode instructions - Optional user instructions (if provided) - Environment context (cwd, shell, etc.) This PR defers writing the “initial context” to the rollout file until the first `turn/start`, so we don't inadvertently bump the thread's `updated_at` timestamp until a turn is actually triggered. This works even though both `thread/resume` and `turn/start` accept overrides (such as `model`, `cwd`, etc.) because the initial context is seeded from the effective `TurnContext` in memory, computed at `turn/start` time, after both sets of overrides have been applied. NOTE: This is a very short-lived solution until we introduce sqlite. Then we can remove this.	2026-01-27 10:41:54 -08:00
jif-oai	067922a734	description in role type (#9993 )	2026-01-27 17:20:07 +00:00
jif-oai	3b726d9550	chore: clean orchestrator prompt (#9994 )	2026-01-27 16:32:05 +00:00
jif-oai	74ffbbe7c1	nit: better unused prompt (#9991 )	2026-01-27 13:03:12 +00:00

1 2 3 4 5 ...

1398 Commits