CurdX Flow
Spec-driven development for Claude Code, with autonomous task execution.
What Is CurdX Flow?
@curdx/flow is a single npm package that delivers two products built around Claude Code:
- A spec-driven development plugin. Provides the
/curdx-flow:*slash commands. Turns an unstructured feature request into a research report, requirements document, technical design, and a task list — then executes the task list autonomously in fresh contexts, one task at a time, with verification gates between every task. - A curated plugin and MCP marketplace. A single interactive installer that picks, installs, updates, and uninstalls a complementary tool set for Claude Code (cross-session memory, browser automation, live documentation lookup, and more). It maintains a managed manifest in
~/.claude/CLAUDE.mdso Claude knows what is available.
The plugin and the installer ship in the same npm package and share a single version number. Run npx @curdx/flow once and you have a fully wired Claude Code environment.
You: /curdx-flow:start "Add OAuth login with token refresh"
flow: ✓ Interview: 3 clarifying questions answered (60s)
flow: ✓ Parallel research team dispatched (3 specialist agents)
→ research.md (148 lines, 9 references, 4 risks identified)
You: review · approve → /curdx-flow:requirements
flow: ✓ product-manager agent
→ requirements.md (US-1..US-9, FR-1..FR-23, NFR-1..NFR-12)
You: review · approve → /curdx-flow:design
flow: ✓ architect-reviewer
→ design.md (9 decisions, 7 risks, component diagram)
You: review · approve → /curdx-flow:tasks
flow: ✓ task-planner
→ tasks.md (12 tasks across 4 phases, with VERIFY gates)
You: review · approve → /curdx-flow:implement
flow: ⟳ task 1.1 → verify → commit ✓
⟳ task 1.2 → verify → commit ✓
…
✓ ALL_TASKS_COMPLETE (12/12 tasks, 47 commits, all green)Workflow At A Glance
The workflow has five phases. Each phase is owned by a specialist subagent, produces exactly one Markdown artifact, and pauses for human approval before the next phase begins. The final phase, implement, runs an autonomous loop — execute task, verify, commit, advance — until every task in tasks.md is checked off.
| Phase | Command | Subagent | Output |
|---|---|---|---|
| Research | /curdx-flow:research | research-analyst (parallel team) | research.md |
| Requirements | /curdx-flow:requirements | product-manager | requirements.md |
| Design | /curdx-flow:design | architect-reviewer | design.md |
| Tasks | /curdx-flow:tasks | task-planner | tasks.md |
| Implement | /curdx-flow:implement | spec-executor (autonomous loop) | code, tests, commits |
All artifacts live under specs/<spec-name>/ in your repository. They are plain Markdown, version-controlled, and survive across sessions.
When It Works Best
CurdX Flow pays off most when:
- You ship code others will read. Specs become a paper trail your reviewers and future self can audit.
- Your codebase has conventions and constraints. The research and design phases force these to surface before code is written, not after review.
- You context-switch. Every task starts with a clean context window, so you do not have to manually re-explain the project after a long pause or a rate-limit reset.
- You delegate, then come back. The autonomous loop runs unattended; you review the diff when it finishes.
It is less useful for one-off scripts or five-line tweaks. For those, plain Claude Code is faster.
Key Features
Five-phase spec workflow turns an unstructured request into research, requirements, design, and tasks before any code is written. Every phase is reviewable Markdown.
Autonomous execution loop runs every approved task end-to-end with verification gates. Walk away. Come back. Read the diff.
Specialist subagents assign a single owner to each phase. Every subagent runs in a fresh context window so reasoning stays sharp.
Quality gates with [VERIFY] tasks run typecheck, tests, and smoke checks between work units. A weak step does not silently pass.
Skill auto-discovery scans your installed Claude Code skills, semantically matches them to the goal, and pre-loads the relevant ones into the active context.
Curated marketplace installs claude-mem, chrome-devtools-mcp, context7, and other proven tools through a single interactive picker. The same package, the same version, one command.
Plugin observability ships an analyze CLI that parses session jsonl and produces a 7-section markdown report covering hook failures, command frequency, agent dispatch heat, spec funnel, and schema drift.
A Typical Operator Workflow
- Run
npx @curdx/flowonce to install the bundled plugin and pick optional marketplace items. - Inside a project, type
/curdx-flow:startand describe the feature in plain English. - Approve the research output, then advance through requirements, design, and tasks.
- Hit
/curdx-flow:implementand walk away while the loop ticks through every task. - Come back, read the diff, open the PR.
Power User Tips
- Commit the four canonical artifacts (
research.md,requirements.md,design.md,tasks.md). Reviewers will thank you. - Use
--quickonly on low-risk specs you can fully review at the end. The pause-for-approval gates exist for a reason. - Pass
--tasks-size coarsefor quick prototypes,finefor production work where each task should map to one verifiable step. - Run
/curdx-flow:statusbefore resuming. It tells you where every spec is in the workflow. - Use
/curdx-flow:triagefor features that obviously need multiple specs. It produces an epic with declared dependencies. - If a task fails verification repeatedly, fix the underlying issue rather than retrying. The loop halts on purpose.
Quick Links
- Getting Started for install, first spec, and the five-minute walkthrough
- How It Works for the spec model, subagents, and autonomous loop architecture
- Configuration for flags, environment variables, state files, and the managed CLAUDE.md block
- Commands for every slash command and CLI invocation
- Subagents for the nine specialist agents that own each phase
- Troubleshooting for common failure modes and recovery steps