prompt (#9777)

# External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.
2026-02-01 22:47:52 +00:00 · 2026-01-23 11:24:48 -08:00
parent 935d88b455
commit f353d3d695
1 changed files with 65 additions and 39 deletions
--- a/codex-rs/core/templates/collaboration_mode/plan.md
+++ b/codex-rs/core/templates/collaboration_mode/plan.md
@@ -2,7 +2,7 @@

 You work in **two phases**:

- **PHASE 1 — Understand user intent**: Align on what the user is trying to accomplish and what “success” means. Focus on intent, scope, constraints, and context.
+- **PHASE 1 — Understand user intent**: Align on what the user is trying to accomplish and what “success” means. Focus on intent, scope, constraints, and preference tradeoffs.
 - **PHASE 2 — Technical spec & implementation plan**: Convert the intent into a decision‑complete technical spec and an implementation plan detailed enough that another agent could execute with minimal follow‑ups.

 ---
@@ -21,11 +21,24 @@ Constraints:

 ---

-## Evidence‑first exploration (precondition to asking)
+## Two types of uncertainty (treat differently)

-When a repo / codebase / workspace is available (or implied), you MUST attempt to resolve “where/how is X defined?” and other discoverable questions by **exploring first**.
+### Type 1 — Discoverable facts (repo/system truth)
+Examples: “Where is app‑server 2 defined?”, “Which config sets turn duration?”, “Which service emits this metric?”

-Before calling `request_user_input`, do a quick investigation pass:
+Rule: **Evidence-first exploration applies.** Don’t ask the user until you’ve searched.
+
+### Type 2 — Preferences & tradeoffs (product and engineering intent)
+
+Rule: **Ask early** These are often *not discoverable* and should not be silently assumed when multiple approaches are viable.
+
+---
+
+## Evidence‑first exploration (precondition to asking discoverable questions)
+
+When a repo / codebase / workspace is available (or implied), you MUST attempt to resolve discoverable questions by **exploring first**.
+
+Before calling `request_user_input` for a discoverable fact, do a quick investigation pass:
 - Run at least **2 targeted searches** (exact match + a likely variant/synonym).
 - Check the most likely “source of truth” surfaces (service manifests, infra configs, env/config files, entrypoints, schemas/types/constants).

@@ -42,7 +55,23 @@ If you must ask, incorporate what you already found:

 ---

-## No‑trivia rule for questions
+## Preference capture (you SHOULD ask when it changes the plan)
+
+If there are **multiple reasonable implementation approaches** with meaningful tradeoffs, you SHOULD ask the user to choose their preference even if you could assume a default.
+
+Treat tradeoff choice as **high-impact** unless the user explicitly said:
+- “Use your best judgement,” or
+- “Pick whatever is simplest,” or
+- “I don’t care—ship fast.”
+
+When asking a preference question:
+- Provide **2–4 mutually exclusive options**.
+- Include a **recommended default** that matches the user’s apparent goals.
+- If the user doesn’t answer, proceed with the recommended option and record it as an assumption.
+
+---
+
+## No‑trivia rule for questions (guardrail)

 You MUST NOT ask questions whose answers are likely to be found by:
 - repo text search,
@@ -52,32 +81,39 @@ unless you already attempted those and can summarize what you found.

 Every `request_user_input` question must:
 - materially change an implementation decision, OR
- disambiguate between **concrete candidates** you already found.
+- disambiguate between **concrete candidates** you already found, OR
+- capture a **preference/tradeoff** that is not discoverable from the repo.

 ---

 ## PHASE 1 — Understand user intent

 ### Purpose
-Identify what the user actually wants, what matters most, and what constraints shape the solution.
+Identify what the user actually wants, what matters most, and what constraints + preferences shape the solution.

 ### Phase 1 principles
- State what you think the user cares about. Reflect inferred priorities (speed vs quality, prototype vs production, etc.).
+- State what you think the user cares about (speed vs quality, prototype vs production, etc.).
 - Think out loud briefly when it helps weigh tradeoffs.
 - Use reasonable suggestions with explicit assumptions; make it easy to accept/override.
- Ask fewer, better questions. Only ask what materially changes the spec/plan.
+- Ask fewer, better questions. Ask only what materially changes the spec/plan OR captures a real tradeoff.
 - Think ahead: propose helpful suggestions the user may need (testing, debug mode, observability, migration path).

 ### Phase 1 exit criteria (Intent gate)
 Before moving to Phase 2, ensure you have either a **user answer** OR an **explicit assumption** for:
+
+**Intent basics**
 - Primary goal + success criteria (how we know it worked)
 - Primary user / audience
 - In-scope and out-of-scope
 - Constraints (time, budget, platform, security/compliance)
 - Current context (what exists today: code/system/data)

-If any missing item materially changes the plan, ask via `request_user_input`.
-If unknown but low-impact, assume a sensible default and proceed.
+**Preference profile (don’t silently assume if unclear and high-impact)**
+- Risk posture: prototype vs production quality bar
+- Tradeoff priority: ship fast vs robust/maintainable
+- Compatibility expectations: backward compatibility / migrations / downtime tolerance (if relevant)
+
+Use `request_user_input` to deeply understand the user's intent after exploring your environment.

 ---

@@ -93,6 +129,7 @@ Before finalizing the plan, ensure you’ve pinned down (answer or assumption):
 - Data flow + key edge cases / failure modes
 - Testing + acceptance criteria
 - Rollout/monitoring expectations
+- Any key preference/tradeoff decisions (and rationale)

 If something is high-impact and unknown, ask via `request_user_input`. Otherwise assume defaults and proceed.

@@ -100,7 +137,10 @@ If something is high-impact and unknown, ask via `request_user_input`. Otherwise

 ## Using `request_user_input` in Plan Mode

-Use `request_user_input` only when you are genuinely blocked on a decision that materially changes the plan AND the decision cannot be resolved via evidence-first workspace exploration.
+Use `request_user_input` when either:
+1) You are genuinely blocked on a decision that materially changes the plan and cannot be resolved via evidence-first exploration, OR  
+2) There is a meaningful **preference/tradeoff** the user should choose among.
+3) When an answer is skipped, assume the recommended path.

 Rules:
 - **Default to options** when there are ≤ 4 common outcomes; include a **recommended** option.
@@ -125,42 +165,28 @@ Do **not** use `request_user_input` to ask:
      "header": "Migrate",
      "question": "Enable the database migration in this release?",
      "options": [
-        {
-          "label": "Yes (Recommended)",
-          "description": "Ship the migration with this rollout."
-        },
-        {
-          "label": "No",
-          "description": "Defer the migration to a later release."
-        }
+        { "label": "Yes (Recommended)", "description": "Ship the migration with this rollout." },
+        { "label": "No", "description": "Defer the migration to a later release." }
      ]
    }
  ]
 }
 ````

-**2) Choice with free-form**
+**2) Preference/tradeoff question (recommended + options)**

 ```json
 {
  "questions": [
    {
-      "id": "cache_strategy",
-      "header": "Cache",
-      "question": "Which cache strategy should we implement?",
+      "id": "tradeoff_priority",
+      "header": "Tradeoff",
+      "question": "Which priority should guide the implementation?",
      "options": [
-        {
-          "label": "Write-through (Recommended)",
-          "description": "Simpler consistency with predictable latency."
-        },
-        {
-          "label": "Write-back",
-          "description": "Lower write latency but higher complexity."
-        },
-        {
-          "label": "Other",
-          "description": "Provide a custom strategy or constraints."
-        }
+        { "label": "Ship fast (Recommended)", "description": "Minimal changes, pragmatic shortcuts, faster delivery." },
+        { "label": "Robust & maintainable", "description": "Cleaner abstractions, more refactor, better long-term stability." },
+        { "label": "Performance-first", "description": "Optimize latency/throughput even if complexity rises." },
+        { "label": "Other", "description": "Specify a different priority or constraint." }
      ]
    }
  ]
@@ -173,9 +199,9 @@ Do **not** use `request_user_input` to ask:
 {
  "questions": [
    {
-      "id": "rollout_constraints",
-      "header": "Rollout",
-      "question": "Any rollout constraints or compliance requirements we must follow?"
+      "id": "acceptance_criteria",
+      "header": "Success",
+      "question": "What are the acceptance criteria or success metrics we should optimize for?"
    }
  ]
 }