codex

mirror of https://github.com/openai/codex.git synced 2026-05-20 03:05:02 +00:00

Author	SHA1	Message	Date
jif-oai	4fd7dfe223	memories-mcp: reject symlink traversal in local backend (#21010 ) ## Why The local memories MCP backend only rejected symlinks after resolving the final path. That left room for scoped requests like `skills/secret.md` to walk through a symlinked ancestor directory and escape the configured memories root. This change also makes missing scoped paths fail explicitly instead of looking like an empty `list` / `search` result or a `NotFile` read error. ## What Changed - walk each scoped path component in `LocalMemoriesBackend::resolve_scoped_path` and reject symlinked ancestors before accessing the target - reject scoped paths that traverse through a non-directory intermediate component - add a `NotFound` backend error for missing `read`, `list`, and `search` paths and map it through the MCP server error conversion - add coverage for missing paths and symlinked ancestor directories in `codex-rs/memories/mcp/src/local_tests.rs` ## Testing - added unit coverage in `codex-rs/memories/mcp/src/local_tests.rs` for missing paths and symlinked ancestor directories across `read`, `list`, and `search`	2026-05-04 18:40:28 +02:00
jif-oai	f20f8a719e	memories/mcp: generate tool schemas with schemars (#21012 ) ## Why The memories MCP server currently keeps handwritten JSON Schema beside the Rust types that actually serialize and deserialize the tool payloads: [`schema.rs`](`2f5c06a29c/codex-rs/memories/mcp/src/schema.rs (L4-L133)`), [`server.rs`](`2f5c06a29c/codex-rs/memories/mcp/src/server.rs (L44-L75)`), and [`backend.rs`](`2f5c06a29c/codex-rs/memories/mcp/src/backend.rs (L41-L117)`). That duplicates the tool contract and makes schema drift easier as the API evolves. ## What changed - derive `JsonSchema` for the memories tool arguments, responses, and nested response types - replace the handwritten schema builders with shared `schemars` generation - preserve the existing wire shape while generating schemas, including nullable output `Option` fields and non-nullable optional input fields - wire the `list`, `read`, and `search` tools to the generated schemas ## Verification - CI pending	2026-05-04 18:40:17 +02:00
jif-oai	2f5c06a29c	nit: legacy (#21006 )	2026-05-04 16:04:29 +02:00
jif-oai	8ba294ea13	feat: support multi-query memories search (#21004 ) ## Why The memories MCP `search` tool only accepts a single substring today, which makes it hard for clients to express combined queries or explain why a line matched. This change adds the richer search shape needed for the next client iteration while keeping the legacy single-`query` call working. ## What changed - accept either the legacy `query` field or a new `queries` array, plus `match_mode: any\|all` - teach the local memories backend to evaluate multi-query line matches and return `matched_queries` on each hit - update the MCP input/output schema and add coverage for parser behavior, ordering, pagination, case sensitivity, and match modes ## Testing - added unit coverage in `memories/mcp/src/local_tests.rs` and `memories/mcp/src/server.rs`	2026-05-04 15:55:06 +02:00
jif-oai	5512b23c95	nit: renaming (#20998 )	2026-05-04 15:43:58 +02:00
jif-oai	0269a46ab1	feat: add context lines to memories MCP search (#20997 ) ## Why The paginated memories MCP `search` tool still returned only the matching line text, which made it harder for clients to present useful search results or decide whether they needed to follow up with a separate `read` call. Adding a small amount of surrounding context makes individual hits much more usable while keeping the search response deterministic and line-addressable. ## What changed - add an optional `context_lines` search argument and thread it through the MCP server into the local memories backend - change search matches to return the matched `line_number` plus a `start_line_number` and multi-line `content` block for the requested context window - update the search tool schema and description to document the new request/response shape - extend the local backend tests to cover zero-context matches, contextual results, pagination, and invalid cursors that point past the end of the result set ## Testing - Added targeted unit coverage in `memories/mcp/src/local_tests.rs` - GitHub Actions are running for the branch --------- Co-authored-by: Codex <noreply@openai.com>	2026-05-04 15:32:57 +02:00
jif-oai	554223ab80	feat: paginate memories MCP search results (#20996 ) ## Why The memories MCP `search` tool previously stopped once it hit `max_results`, so callers could tell there were more matches via `truncated` but had no way to fetch the rest of the result set. That made large searches awkward for clients that need to keep paging through a stable, deterministic view of the matches. ## What changed - add an optional `cursor` field to `SearchMemoriesRequest` / tool input and return `next_cursor` in `SearchMemoriesResponse` - update the MCP schemas and tool wiring so clients can request subsequent pages explicitly - change the local memories backend to collect and sort the full scoped match list, then slice the requested page and reject invalid cursors - add unit coverage for paginated search results and invalid cursor handling in `memories/mcp/src/local_tests.rs` ## Testing - Added targeted unit coverage in `memories/mcp/src/local_tests.rs` - GitHub Actions are running for the branch	2026-05-04 15:23:10 +02:00
jif-oai	29352569b3	feat: make memories MCP list shallow (#20994 ) ## Why The memories MCP `list` tool should behave like a directory listing, not a recursive tree walk. Recursive results make pagination harder to reason about, return unexpectedly deep paths for scoped requests, and no longer match the intended tool contract. ## What Changed - Changed the local memories backend so `list` returns only the immediate children of the requested path. - Preserved file-scoped requests by returning the file itself, and missing paths by returning an empty result. - Updated cursor handling to paginate over the shallow sibling set and reject cursors past the available results. - Updated the MCP tool description to say it lists immediate files and directories under a path. - Reworked the local backend tests to cover shallow top-level listing, shallow scoped listing, sibling ordering, and pagination. ## Testing - `cargo test -p codex-memories-mcp`	2026-05-04 15:08:34 +02:00
jif-oai	5730615e75	feat: paginate MCP memories list (#20993 ) ## Why Large memories trees do not fit well into a single MCP `list` response. This change makes the memories MCP server page `list` results so callers can continue walking the tree without overfetching or relying on ambiguous truncation. ## What changed - add an optional `cursor` input to the memories MCP `list` API and return `next_cursor` alongside `truncated` in the response - paginate recursive local-memory traversal while preserving lexicographic path order across directories - reject malformed and out-of-range cursors as invalid MCP requests - update the server/schema wiring and add coverage for pagination, ordering, and cursor validation in `memories/mcp/src/local_tests.rs` ## Testing - `cargo test -p codex-memories-mcp`	2026-05-04 14:59:56 +02:00
jif-oai	6b6581ac59	feat: add max_lines to memories MCP read (#20991 ) ## Why The memories MCP `read` tool already supports `line_offset`, but it cannot return a bounded line range. That makes it awkward to page through large memory files or request a small slice without relying on token truncation. ## What changed - add an optional `max_lines` parameter to the memories MCP `read` tool schema and request parsing - cap local backend reads to the requested number of lines before token truncation - treat `max_lines = 0` as an invalid request and surface it as `invalid_params` - add backend tests for bounded reads and invalid line request validation ## Testing - added coverage in `memories/mcp/src/local_tests.rs` for `max_lines` reads and invalid `max_lines` / `line_offset` requests	2026-05-04 14:45:38 +02:00
jif-oai	019755d570	feat: add line offsets to memory read MCP (#20986 ) ## Why Memory clients sometimes need to continue reading a file from a known line instead of starting over from the top. Adding a line offset to the `read` MCP keeps that resume logic simple and avoids re-reading already-consumed content. ## What changed - Added an optional `line_offset` argument to the memory `read` tool, defaulting to `1`. - Read content starting at the requested 1-indexed line before token truncation, and return `start_line_number` in the response. - Treat invalid offsets as invalid params errors and cover the new behavior in `codex-rs/memories/mcp/src/local_tests.rs`. ## Testing - Added unit tests for reading from a non-default starting line. - Added unit tests for rejecting `0` and past-end line offsets.	2026-05-04 14:26:37 +02:00
jif-oai	d013155f40	feat: memories mcp v1 (#20622 ) Add an experimental MCP on memories This must never be used and is only here for testing purpose --------- Co-authored-by: Codex <noreply@openai.com>	2026-05-04 13:51:03 +02:00
jif-oai	ff27d01676	feat: seed ad-hoc memory extension instructions (#20606 ) ## Summary Ad-hoc memory notes are written under `memories/extensions/ad_hoc/`, but the consolidation agent only knows how to interpret an extension when the extension folder has an `instructions.md`. Seed those instructions from the memories write pipeline so an enabled memories startup creates the expected ad-hoc extension layout automatically. This also moves extension-specific write behavior behind a dedicated `memories/write/src/extensions/` module. `ad_hoc` owns the seeded instructions template, while the existing resource-retention cleanup lives in its own `prune` module so future memory extensions can add their own write-side setup without growing a flat helper file. ## Changes - Seed `memories/extensions/ad_hoc/instructions.md` during eligible memory startup without overwriting an existing file. - Store the ad-hoc instructions template under `memories/write/templates/extensions/ad_hoc/`, keeping ownership in `codex-memories-write`. - Split memory extension support into `extensions::ad_hoc` and `extensions::prune`. - Keep the existing old-resource pruning behavior unchanged. ## Verification - `cargo test -p codex-memories-write` - `bazel build //codex-rs/memories/write:write` --------- Co-authored-by: chatgpt-codex-connector[bot] <199175422+chatgpt-codex-connector[bot]@users.noreply.github.com>	2026-05-01 14:43:58 +02:00
jif-oai	70fc55b8f3	chore: improve remember prompt (#20610 )	2026-05-01 14:38:07 +02:00
jif-oai	97aae46800	feat: ad-hoc instructions (#20602 )	2026-05-01 13:42:54 +02:00
jif-oai	ad404c8400	chore: allow memories edition (#20600 )	2026-05-01 13:27:37 +02:00
Tom	fe05acad23	Make thread store process-scoped (#19474 ) - Build one app-server process ThreadStore from startup config and share it with ThreadManager and CodexMessageProcessor. - Remove per-thread/fork store reconstruction so effective thread config cannot switch the persistence backend. - Add params to ThreadStore create/resume for specifying thread metadata, since otherwise the metadata from store creation would be used (incorrectly).	2026-04-30 21:24:59 -07:00
viyatb-oai	07c8b8c77c	fix: handle deferred network proxy denials (#19184 ) ## Why This bug is exposed by Guardian/auto-review approvals. With the managed network proxy enabled, a blocked network request can be reported back through the network approval service as an approval denial after the command has already started. Before this change, the shell and unified exec runtimes registered those network approval calls, but did not have a way to observe an async proxy denial as a cancellation/failure signal for the running process. The result was confusing: Guardian/auto-review could correctly deny network access, but the command path could keep running or unregister the approval without surfacing the denial as the command failure. ## What Changed - `NetworkApprovalService` now attaches a cancellation token to active and deferred network approvals. - Proxy-denial outcomes are recorded only for active registrations, cancel the owning token, and are consumed when the approval is finalized. - The shell runtime combines the normal command timeout with the network-denial cancellation token. - Unified exec stores the deferred network approval object, terminates tracked processes when the proxy denial arrives, and returns the denial as a process failure while polling or completing the process. - Tool orchestration passes the active network approval cancellation token into the sandbox attempt and preserves deferred approval errors instead of silently unregistering them. - App-server `command/exec` now handles the combined timeout-or-cancellation expiration variant used by the runtime. ## Verification - `cargo test -p codex-core network_approval --lib` - `cargo clippy -p codex-app-server --all-targets -- -D warnings` - `cargo clippy -p codex-core --all-targets -- -D warnings` --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-29 19:13:57 +00:00
pakrym-oai	8356806fc9	Add ThreadManager sample crate (#20141 ) Summary: - Add codex-thread-manager-sample, a one-shot binary that starts a ThreadManager thread, submits a prompt, and prints the final assistant output. - Pass ThreadStore into ThreadManager::new and expose thread_store_from_config for existing callsites. - Build the sample Config directly with only --model and prompt inputs. Verification: - just fmt - cargo check -p codex-thread-manager-sample -p codex-app-server -p codex-mcp-server - git diff --check Tests: Not run per request.	2026-04-29 11:21:06 -07:00
jif-oai	5b7d6f5c4f	feat: house-keeping memories 3 (#20005 ) Move stuff in memories, no behavioural change expected	2026-04-28 18:13:35 +02:00
jif-oai	21e19912e0	feat: house-keeping memories 2 (#20000 ) Just move metrics in a dedicated file	2026-04-28 17:26:44 +02:00
jif-oai	5a79dfab7c	feat: house-keeping memories 1 (#19998 ) Just move metrics in a dedicated file	2026-04-28 17:11:49 +02:00
jif-oai	1b74360365	feat: skip memory startup when Codex rate limits are low (#19990 ) ## Why Memory startup runs in the background after an eligible turn, but it can consume Codex backend quota at exactly the wrong time: when the user is already near a rate-limit boundary. This PR adds a guard so the memory pipeline backs off when the Codex rate-limit snapshot says the remaining budget is too low. ## What Changed - Added `memories.min_rate_limit_remaining_percent` with a default of `25`, clamped to `0..=100`, and regenerated `core/config.schema.json`. - Added `codex-rs/memories/write/src/guard.rs`, which fetches Codex backend rate limits before memory startup and skips phase 1 / phase 2 when the Codex limit is reached or either tracked window is above the configured usage ceiling. - Keeps startup best-effort: non-Codex auth or rate-limit fetch/client failures preserve the existing memory startup behavior. - Records a `codex.memory.startup` counter with `status=skipped_rate_limit` when startup is skipped. - Added config parsing/clamping coverage and guard unit tests. ## Verification - Added `codex-rs/memories/write/src/guard_tests.rs` for threshold, primary/secondary window, and reached-limit behavior. - Added config tests for TOML parsing and clamping.	2026-04-28 17:07:16 +02:00
jif-oai	a9e5c34083	feat: trigger memories from user turns with cooldown (#19970 ) ## Why Memory startup was tied to thread lifecycle events such as create, load, and fork. That can run memory work before a thread receives real user input, and it makes startup cost scale with thread management instead of actual turns. Moving the trigger to `thread/sendInput` keeps memory startup aligned with the first real user turn and lets it use the current thread config at turn time. The idea is to prevent ghost cost due to pre-warm triggered by the app Turn-based startup can also make global phase-2 consolidation easier to request repeatedly, so this adds a success cooldown and tightens the default startup scan window. ## What Changed - Start `codex_memories_write::start_memories_startup_task` after a non-empty `thread/sendInput` turn is submitted, instead of from thread create/load/fork paths: `d4a6885b78/codex-rs/app-server/src/codex_message_processor.rs (L6477-L6487)` - Expose `CodexThread::config()` so app-server can pass the live config into memory startup at turn time. - Add a six-hour successful-run cooldown for global phase-2 consolidation via `SkippedCooldown`: `d4a6885b78/codex-rs/state/src/runtime/memories.rs (L963-L966)` - Reduce memory startup defaults to at most 2 rollouts over 10 days: `d4a6885b78/codex-rs/config/src/types.rs (L31-L34)` ## Verification Updated the memory runtime coverage around phase-2 reclaim behavior, including `phase2_global_lock_respects_success_cooldown`. --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-28 16:23:13 +02:00
jif-oai	fa127be25f	Stabilize memory Phase 2 input ordering (#19967 ) ## Why Phase 2 still needs to choose the most relevant stage-1 memory outputs by usage and recency, but exposing that ranking as the rendered `raw_memories.md` order creates unnecessary large diff. Usage-count or timestamp changes can reshuffle otherwise unchanged memories, making the workspace diff noisy and giving the consolidation prompt a misleading recency signal from file position. This fix will reduce token consumption ## What Changed - Keep the existing top-N Phase 2 selection ranking by `usage_count`, `last_usage`, `source_updated_at`, and `thread_id`. - Return the selected rows in stable ascending `thread_id` order before syncing Phase 2 filesystem inputs. - Update the memory README, raw memories header, and consolidation prompt so they describe the stable order and tell the prompt to use metadata and workspace diffs instead of file order as the recency signal. - Adjust the memory runtime tests to use deterministic thread IDs and assert the stable return order separately from the ranked selection semantics. ## Test Coverage - Existing memory runtime tests in `codex-rs/state/src/runtime/memories.rs` now cover the stable returned ordering for Phase 2 inputs. --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-28 13:32:05 +02:00
jif-oai	431ebeaef7	feat: split memories part 2 (#19860 ) Keep extracting memories out of core and moving the write trigger in the app-server This is temporary and it should move at the client level as a follow-up This makes core fully independant from `codex-memories-write` --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-28 13:03:28 +02:00
jif-oai	bb83eec825	chore: split memories part 1 (#19818 ) Extract memories into 2 different crates	2026-04-27 16:01:05 +02:00

27 Commits