coder
diff --git a/‎.agents/skills/refine-plan/SKILL.md‎
Lines changed: 140 additions & 0 deletions b/‎.agents/skills/refine-plan/SKILL.md‎
Lines changed: 140 additions & 0 deletions
diff --git a/‎coderd/x/chatd/chatd.go‎
Lines changed: 32 additions & 0 deletions b/‎coderd/x/chatd/chatd.go‎
Lines changed: 32 additions & 0 deletions
diff --git a/‎coderd/x/chatd/chatd_test.go‎
Lines changed: 1 addition & 1 deletion b/‎coderd/x/chatd/chatd_test.go‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎coderd/x/chatd/chattool/proposeplan.go‎
Lines changed: 92 additions & 0 deletions b/‎coderd/x/chatd/chattool/proposeplan.go‎
Lines changed: 92 additions & 0 deletions
@@ -0,0 +1,140 @@
+---
+name: refine-plan
+description: Iteratively refine development plans using TDD methodology. Ensures plans are clear, actionable, and include red-green-refactor cycles with proper test coverage.
+---
+
+# Refine Development Plan
+
+## Overview
+
+Good plans eliminate ambiguity through clear requirements, break work into clear phases, and always include refactoring to capture implementation insights.
+
+## When to Use This Skill
+
+| Symptom                     | Example                                |
+|-----------------------------|----------------------------------------|
+| Unclear acceptance criteria | No definition of "done"                |
+| Vague implementation        | Missing concrete steps or file changes |
+| Missing/undefined tests     | Tests mentioned only as afterthought   |
+| Absent refactor phase       | No plan to improve code after it works |
+| Ambiguous requirements      | Multiple interpretations possible      |
+| Missing verification        | No way to confirm the change works     |
+
+## Planning Principles
+
+### 1. Plans Must Be Actionable and Unambiguous
+
+Every step should be concrete enough that another agent could execute it without guessing.
+
+- ❌ "Improve error handling" → ✓ "Add try-catch to API calls in user-service.ts, return 400 with error message"
+- ❌ "Update tests" → ✓ "Add test case to auth.test.ts: 'should reject expired tokens with 401'"
+
+NEVER include thinking output or other stream-of-consciousness prose mid-plan.
+
+### 2. Push Back on Unclear Requirements
+
+When requirements are ambiguous, ask questions before proceeding.
+
+### 3. Tests Define Requirements
+
+Writing test cases forces disambiguation. Use test definition as a requirements clarification tool.
+
+### 4. TDD is Non-Negotiable
+
+All plans follow: **Red → Green → Refactor**. The refactor phase is MANDATORY.
+
+## The TDD Workflow
+
+### Red Phase: Write Failing Tests First
+
+**Purpose:** Define success criteria through concrete test cases.
+
+**What to test:**
+
+- Happy path (normal usage), edge cases (boundaries, empty/null), error conditions (invalid input, failures), integration points
+
+**Test types:**
+
+- Unit tests: Individual functions in isolation (most tests should be these - fast, focused)
+- Integration tests: Component interactions (use for critical paths)
+- E2E tests: Complete workflows (use sparingly)
+
+**Write descriptive test cases:**
+
+**If you can't write the test, you don't understand the requirement and MUST ask for clarification.**
+
+### Green Phase: Make Tests Pass
+
+**Purpose:** Implement minimal working solution.
+
+Focus on correctness first. Hardcode if needed. Add just enough logic. Resist urge to "improve" code. Run tests frequently.
+
+### Refactor Phase: Improve the Implementation
+
+**Purpose:** Apply insights gained during implementation.
+
+**This phase is MANDATORY.** During implementation you'll discover better structure, repeated patterns, and simplification opportunities.
+
+**When to Extract vs Keep Duplication:**
+
+This is highly subjective, so use the following rules of thumb combined with good judgement:
+
+1) Follow the "rule of three": if the exact 10+ lines are repeated verbatim 3+ times, extract it.
+2) The "wrong abstraction" is harder to fix than duplication.
+3) If extraction would harm readability, prefer duplication.
+
+**Common refactorings:**
+
+- Rename for clarity
+- Simplify complex conditionals
+- Extract repeated code (if meets criteria above)
+- Apply design patterns
+
+**Constraints:**
+
+- All tests must still pass after refactoring
+- Don't add new features (that's a new Red phase)
+
+## Plan Refinement Process
+
+### Step 1: Review Current Plan for Completeness
+
+- [ ] Clear context explaining why
+- [ ] Specific, unambiguous requirements
+- [ ] Test cases defined before implementation
+- [ ] Step-by-step implementation approach
+- [ ] Explicit refactor phase
+- [ ] Verification steps
+
+### Step 2: Identify Gaps
+
+Look for missing tests, vague steps, no refactor phase, ambiguous requirements, missing verification.
+
+### Step 3: Handle Unclear Requirements
+
+If you can't write the plan without this information, ask the user. Otherwise, make reasonable assumptions and note them in the plan.
+
+### Step 4: Define Test Cases
+
+For each requirement, write concrete test cases. If you struggle to write test cases, you need more clarification.
+
+### Step 5: Structure with Red-Green-Refactor
+
+Organize the plan into three explicit phases.
+
+### Step 6: Add Verification Steps
+
+Specify how to confirm the change works (automated tests + manual checks).
+
+## Tips for Success
+
+1. **Start with tests:** If you can't write the test, you don't understand the requirement.
+2. **Be specific:** "Update API" is not a step. "Add error handling to POST /users endpoint" is.
+3. **Always refactor:** Even if code looks good, ask "How could this be clearer?"
+4. **Question everything:** Ambiguity is the enemy.
+5. **Think in phases:** Red → Green → Refactor.
+6. **Keep plans manageable:** If plan exceeds ~10 files or >5 phases, consider splitting.
+
+---
+
+**Remember:** A good plan makes implementation straightforward. A vague plan leads to confusion, rework, and bugs.
@@ -3334,6 +3334,7 @@ func (p *Server) runChat(
 	// create workspaces or spawn further subagents — they should
 	// focus on completing their delegated task.
 	if !chat.ParentChatID.Valid {
+		// Workspace provisioning tools.
 		tools = append(tools,
 			chattool.ListTemplates(chattool.ListTemplatesOptions{
 				DB:      p.db,
@@ -3361,6 +3362,37 @@ func (p *Server) runChat(
 				WorkspaceMu: &workspaceMu,
 			}),
 		)
+		// Plan presentation tool.
+		tools = append(tools, chattool.ProposePlan(chattool.ProposePlanOptions{
+			GetWorkspaceConn: workspaceCtx.getWorkspaceConn,
+			StoreFile: func(ctx context.Context, name string, mediaType string, data []byte) (uuid.UUID, error) {
+				workspaceCtx.chatStateMu.Lock()
+				chatSnapshot := *workspaceCtx.currentChat
+				workspaceCtx.chatStateMu.Unlock()
+
+				if !chatSnapshot.WorkspaceID.Valid {
+					return uuid.Nil, xerrors.New("chat has no workspace")
+				}
+
+				ws, err := p.db.GetWorkspaceByID(ctx, chatSnapshot.WorkspaceID.UUID)
+				if err != nil {
+					return uuid.Nil, xerrors.Errorf("resolve workspace: %w", err)
+				}
+
+				row, err := p.db.InsertChatFile(ctx, database.InsertChatFileParams{
+					OwnerID:        chatSnapshot.OwnerID,
+					OrganizationID: ws.OrganizationID,
+					Name:           name,
+					Mimetype:       mediaType,
+					Data:           data,
+				})
+				if err != nil {
+					return uuid.Nil, xerrors.Errorf("insert chat file: %w", err)
+				}
+
+				return row.ID, nil
+			},
+		}))
 		tools = append(tools, p.subagentTools(ctx, func() database.Chat {
 			return chat
 		})...)
 
@@ -218,7 +218,7 @@ func TestSubagentChatExcludesWorkspaceProvisioningTools(t *testing.T) {
 	require.GreaterOrEqual(t, len(recorded), 2,
 		"expected at least 2 streamed LLM calls (root + subagent)")
 
-	workspaceTools := []string{"list_templates", "read_template", "create_workspace"}
+	workspaceTools := []string{"propose_plan", "list_templates", "read_template", "create_workspace"}
 	subagentTools := []string{"spawn_agent", "wait_agent", "message_agent", "close_agent"}
 
 	// Identify root and subagent calls. Root chat calls include
 
@@ -0,0 +1,92 @@
+package chattool
+
+import (
+	"context"
+	"io"
+	"path/filepath"
+	"strings"
+
+	"charm.land/fantasy"
+	"github.com/google/uuid"
+
+	"github.com/coder/coder/v2/codersdk/workspacesdk"
+)
+
+const maxProposePlanSize = 32 * 1024 // 32 KiB
+
+// ProposePlanOptions configures the propose_plan tool.
+type ProposePlanOptions struct {
+	GetWorkspaceConn func(context.Context) (workspacesdk.AgentConn, error)
+	StoreFile        func(ctx context.Context, name string, mediaType string, data []byte) (uuid.UUID, error)
+}
+
+// ProposePlanArgs are the arguments for the propose_plan tool.
+type ProposePlanArgs struct {
+	Path string `json:"path"`
+}
+
+// ProposePlan returns a tool that presents a Markdown plan file from the
+// workspace for user review.
+func ProposePlan(options ProposePlanOptions) fantasy.AgentTool {
+	return fantasy.NewAgentTool(
+		"propose_plan",
+		"Present a Markdown plan file from the workspace for user review. "+
+			"The file must already exist with a .md extension — use write_file to create it or edit_files to refine it before calling this tool. "+
+			"Pass the absolute file path (e.g. /home/coder/PLAN.md). The tool reads the content from the workspace.",
+		func(ctx context.Context, args ProposePlanArgs, _ fantasy.ToolCall) (fantasy.ToolResponse, error) {
+			if options.GetWorkspaceConn == nil {
+				return fantasy.NewTextErrorResponse("workspace connection resolver is not configured"), nil
+			}
+			if options.StoreFile == nil {
+				return fantasy.NewTextErrorResponse("file storage is not configured"), nil
+			}
+			conn, err := options.GetWorkspaceConn(ctx)
+			if err != nil {
+				return fantasy.NewTextErrorResponse(err.Error()), nil
+			}
+			return executeProposePlanTool(ctx, conn, args, options.StoreFile)
+		},
+	)
+}
+
+func executeProposePlanTool(
+	ctx context.Context,
+	conn workspacesdk.AgentConn,
+	args ProposePlanArgs,
+	storeFile func(ctx context.Context, name string, mediaType string, data []byte) (uuid.UUID, error),
+) (fantasy.ToolResponse, error) {
+	path := strings.TrimSpace(args.Path)
+	if path == "" {
+		return fantasy.NewTextErrorResponse("path is required (use an absolute path, e.g. /home/coder/PLAN.md)"), nil
+	}
+	if !strings.HasSuffix(path, ".md") {
+		return fantasy.NewTextErrorResponse("path must end with .md"), nil
+	}
+
+	rc, _, err := conn.ReadFile(ctx, path, 0, maxProposePlanSize+1)
+	if err != nil {
+		return fantasy.NewTextErrorResponse(err.Error()), nil
+	}
+	defer rc.Close()
+
+	data, err := io.ReadAll(rc)
+	if err != nil {
+		return fantasy.NewTextErrorResponse(err.Error()), nil
+	}
+	if int64(len(data)) > maxProposePlanSize {
+		return fantasy.NewTextErrorResponse("plan file exceeds 32 KiB size limit"), nil
+	}
+
+	fileID, err := storeFile(ctx, filepath.Base(path), "text/markdown", data)
+	if err != nil {
+		return fantasy.NewTextErrorResponse("failed to store plan file: " + err.Error()), nil
+	}
+
+	return toolResponse(map[string]any{
+		"ok":         true,
+		"path":       path,
+		"kind":       "plan",
+		"file_id":    fileID.String(),
+		"media_type": "text/markdown",
+	}), nil
+}