8-phase AI development pipeline with full orchestration on Claude Code, Copilot CLI, Cursor, Antigravity, and VS Code Copilot Chat. Issue → analysis → plan → TDD → review → commit → PR in one command.
End-to-end AI pipeline that drives a Jira issue or GitHub URL all the way to a merged PR in a single command (v9.6.0). 8 phases: Init · Analysis · Planning (with Plan Approval Gate) · TDD dev · CLI-aware parallel review (Opus + Sonnet on Claude Code, GPT + Opus + Sonnet on Copilot CLI, plus Opus triage) · Test · Commit + PR · Phase 7 Report (PR body, Jira comment, Wiki, Confluence). v8.5.5: /multi-agent:review now accepts PR URL / #N / repo#N input shapes and posts the parallel reviewer verdict back as one canonical PR comment per run. Branch-mode invocations stay chat-only. Multi-repo orchestration (per-repo worktrees, platform-aware identity routing, push-must-succeed, fetch-fail recovery). Bilingual setup applies only to assistant explanations (outputLanguage en/tr); interactive prompts, commits, and PR bodies stay English. Cross-CLI parity: Claude Code (colon form) and Copilot CLI (dash form) commands are byte-identical; Cursor, Antigravity, and VS Code Copilot Chat run the full pipeline via per-platform orchestration adapters (subagents/workflow + /multi-agent command + dev-toolkit MCP), with pickers/gates degrading per the picker contract. v9.3.0: review consensus surfacing (same-base-model agreement on judgment calls flagged 'unverified' instead of trusted), a persistent per-repo learnings ledger (durable facts + rejected review preferences replayed into every run so agents stop re-discovering and reviewers stop re-flagging), a default-FAIL evidence gate (a 'passed' claim needs a substantiating log), an entropy + provider-prefix secret scanner, a conceptual-vs-edit intent guard, and a per-phase cost ledger that prices prompt-cache reads and names the top cost driver. v9.6.0: two disk-hygiene commands (/multi-agent:garbage-collect sweeps /tmp scratch from past runs; /multi-agent:prune-logs prunes per-task logs while always preserving the audit trail + metrics), an intent guard hardened against adversarial EN+TR input, and an ajv schema-conformance test that validates the eval fixtures against the real JSON Schemas. CI re-armed on push/PR. Full version history in repo CHANGELOG.
Claude/multi-agent "#316"Copilotmulti-agent "#316"Claude/multi-agent "PROJ-1234"Copilotmulti-agent "PROJ-1234"Claude/multi-agent:dev "#316"Copilotmulti-agent-dev "#316"Claude/multi-agent:autopilot "#316"Copilotmulti-agent-autopilot "#316"Claude/multi-agent:dev-autopilot "#316"Copilotmulti-agent-dev-autopilot "#316"Claude/multi-agent:local "#316"Copilotmulti-agent-local "#316"Claude/multi-agent:dev-local "#316"Copilotmulti-agent-dev-local "#316"Claude/multi-agent:dev-local-autopilot "#316"Copilotmulti-agent-dev-local-autopilot "#316"Claude/multi-agent:local-autopilot "#316"Copilotmulti-agent-local-autopilot "#316"setupjiraissuestatuslog #Nresume #Nkill #Nreviewrefactorchannelsteststack [platform]language [en|tr]scansearchsyncupdateclear-logspurgehelptesttest "dark mode"test "accessibility"test "dynamic type"test "store-ready"test "biometric"test "performance"stack iosstack androidstack backendstack mobilestack allComplete automated workflow from branch to handoff.
Reviewer set depends on the host CLI: GPT is only natively reachable from Copilot, so Claude runs a 2-model set and Copilot runs a 3-model set. Opus triage runs on both.
Phase 2 refuses to start Phase 3 on underspecified tickets without a human read-through.
Two parallel skills wrap external tooling and report store-submission findings in a normalized severity-grouped shape.
Live token accounting in the tracker.
Every Phase 7 run appends a 4-column cost block to agent-log.md.
Phase 4 Step 1.75 advisory — heuristic, deterministic, sub-second, no LLM.
Phase 5 Step 0 advisory — static detector for newly added public symbols missing a paired test.
Per-repo triage corpus that powers Phase 1 enrichment, Phase 4 prior-art lookup, and /multi-agent:search --semantic.
Phase 6 Step 0 builds the host project that integrates multiple repos. Learn once, auto-apply forever.
Browse and launch tasks from your terminal.
Incremental learning that persists across sessions.
Full component generation from Figma URL.
Tiered pattern scanner for skill content. Blocks supply-chain surprises before install.
Full-text search across every pipeline run you have ever done.
Purpose-built agents for each concern.
6 automated scripts installed to ~/.claude/scripts/.
Stack auto-detected at session start by stack-swap.sh based on project markers (.xcodeproj, build.gradle, package.json).
Automated visual testing and compliance audits.
One source of truth, five platforms, zero drift.
| Claude Code (colon-form) | Copilot CLI (dash-form) | Category | Description |
|---|---|---|---|
/multi-agent "task" | multi-agent "task" | Full pipeline | Full 8-phase pipeline with worktree + interactive confirmations. Analysis → Planning (Plan Approval Gate) → TDD Dev → Parallel Review (Opus triage) → Test → Commit/PR → Report. The default entry point for all tasks. |
/multi-agent:autopilot "task" | multi-agent-autopilot "task" | Full pipeline | Full 8-phase pipeline with worktree, **no confirmations**. Autopilot contract: zero user interaction. Phase 2 gate, Phase 5 test prompt, and Phase 6 commit/PR prompts are all skipped. Safety classifier still runs; high-risk plans (security path, schema migration, delete-without-test) trigger a one-time pause. |
/multi-agent:local "task" | multi-agent-local "task" | Full pipeline | Full 8-phase pipeline on the current branch — **no worktree**, interactive confirmations preserved. Same pipeline as `/multi-agent` but stays in your current editor/IDE session (no split workspace). Single-repo only. |
/multi-agent:local-autopilot "task" | multi-agent-local-autopilot "task" | Full pipeline | Full 8-phase pipeline + no worktree + no confirmations. Combines `autopilot` + `local`: fire-and-forget execution on the current branch. Safety classifier still honored. |
/multi-agent:dev "task" | multi-agent-dev "task" | Fast mode | Skips Analysis (Phase 1) + Planning (Phase 2) + Review (Phase 4) gates — jumps to Dev. Uses Opus for development instead of Sonnet (Opus catches more issues without the review safety net). For small, well-scoped tasks where the diff is obvious. |
/multi-agent:dev-autopilot "task" | multi-agent-dev-autopilot "task" | Fast mode | Fast path + autopilot. Init → Dev(Opus) → Commit → Report with no confirmations. Second-fastest mode; ~2–3x faster than full pipeline, no review/triage safety net. |
/multi-agent:dev-local "task" | multi-agent-dev-local "task" | Fast mode | Fast + local. Same Init → Dev(Opus) → Commit → Report flow, on the current branch with confirmations preserved. |
/multi-agent:dev-local-autopilot "task" | multi-agent-dev-local-autopilot "task" | Fast mode | Fastest possible mode. Fast + local + autopilot — Init → Dev(Opus) → Commit → Report with no worktree and no confirmations. Ideal for trivial fixes you'd otherwise do manually. |
/multi-agent:jira | multi-agent-jira | Interactive | Browse your open Jira issues in a terminal picker. Select one → confirm branch → choose mode (normal / autopilot / fast) → pipeline launches. Authenticated via Keychain-stored Jira token. |
/multi-agent:issue | multi-agent-issue | Interactive | Browse unassigned GitHub issues. Select one → auto-assigns to you + moves to In Progress → confirm branch → choose mode → pipeline launches. Handles GitHub-only repos that don't use Jira. |
/multi-agent:status | multi-agent-status | Ops | Lists every active task across all projects — ID, phase, branch, status, last updated. Finds tasks that stopped mid-pipeline. |
/multi-agent:log [#id] | multi-agent-log [#id] | Ops | Shows a task's `agent-log.md` — full pipeline trace including analysis output, plan approval history, review findings, commit shas, Phase 7 channels dispatch results. No id → shows latest. |
/multi-agent:resume [#id] | multi-agent-resume [#id] | Ops | Resume a paused / failed task from its last completed phase. State is file-based (`agent-state.json`) so resumes work across sessions and machines. |
/multi-agent:kill [#id] | multi-agent-kill [#id] | Ops | Stop a task — deletes its worktree + branch after double confirmation. Log is archived (not deleted) for audit. Destructive; always confirms. |
/multi-agent:purge | multi-agent-purge | Ops | ⚠️ Nukes all worktrees, branches, logs, and state files. Double-confirmation required. For starting fully fresh — rarely needed in normal usage. |
/multi-agent:review | multi-agent-review | Ops | Runs Phase 4 review **standalone** on the current branch's diff — parallel Claude Code 2-model (Opus + Sonnet) or Copilot CLI 3-model (GPT-5.4 + Opus + Sonnet), then Opus triage. Opt-in deep audit before PR. |
/multi-agent:channels | multi-agent-channels | Ops | Multi-channel reporter for PR body / Jira comment / Confluence page / component Wiki. Multi-select channel × multi-select content (normal analysis / technical details / test scenarios / auto-diff summary / manual note / cost summary). Humanizer pass per channel, reviewer-preserving Bitbucket PR PUT. Phase 7 delegates to it. |
/multi-agent:scan | multi-agent-scan | Ops | Tiered pattern scanner for skill content (critical / high / medium / low). Catches shell-pipe exec, base64-decode exec, eval-of-network, unicode bidi override (trojan source), hardcoded credentials, pastebin raw fetches. Auto-runs warn-only on every install; `--strict` for CI. |
/multi-agent:search "query" | multi-agent-search "query" | Ops | Full-text search across `~/.claude/logs/multi-agent/**/agent-log.md`. Smart ranking (match count × recency weight), filters by `--project`, `--since`, `--phase`. Output modes: colored text, `--json`, `--tsv`. Local-only, no network. |
/multi-agent:setup | multi-agent-setup | Meta | First-run wizard — scans Keychain for tokens (GitHub, Jira, Confluence, Bitbucket, Telegram, npm, Vercel, Firebase), prompts for Git identity (personal / work), maps per-project platform identity routing. All creds stay in Keychain; prefs file holds only pointers. |
/multi-agent:sync | multi-agent-sync | Meta | Single command to sync the full multi-agent ecosystem: Claude Code skill dir, Copilot CLI skill dir, pipeline source repo, website data, remote-control endpoints. 26 commands kept byte-identical between CLIs. |
/multi-agent:update | multi-agent-update | Meta | Updates the pipeline to the latest version — `git pull` + `install.js` + `migrate-prefs.mjs` schema migration. Idempotent. |
/multi-agent:help | multi-agent-help | Meta | Bilingual usage guide. Renders in EN or TR based on `prefs.global.promptLanguage`. Shows all 26 commands, modes matrix, input types, preference reference. |
/multi-agent:stack | multi-agent-stack | Meta | Manually override the detected stack (ios / android / backend / frontend / mobile / fullstack / all). Auto-detection runs at session start via `stack-swap.sh`; override when working polyglot. |
/multi-agent:refactor | multi-agent-refactor | Meta | Scores your codebase against industry best practices + competitor projects, proposes an improvement plan, asks for approval, then runs the pipeline to apply the changes. For periodic tech-debt sweeps. |
/multi-agent:test | multi-agent-test | Meta | UI Bug Hunter — auto-detects platform (iOS simctl / Android adb) and runs screenshot + tap + analyze on the booted device. Also handles `store-ready` preset that dispatches the Apple/Google compliance skill. |
/multi-agent:manual-test | multi-agent-manual-test | Meta | Standalone Phase 5 — checks out the active task's branch locally so you can run manual tests in Xcode/Android Studio. Separate from `:test` (which is auto UI bug hunting). |