Multi-Agent Pipeline

8-phase AI development pipeline with full orchestration on Claude Code, Copilot CLI, Cursor, Antigravity, and VS Code Copilot Chat. Issue → analysis → plan → TDD → review → commit → PR in one command.

End-to-end AI pipeline that drives a Jira issue or GitHub URL all the way to a merged PR in a single command (v9.6.0). 8 phases: Init · Analysis · Planning (with Plan Approval Gate) · TDD dev · CLI-aware parallel review (Opus + Sonnet on Claude Code, GPT + Opus + Sonnet on Copilot CLI, plus Opus triage) · Test · Commit + PR · Phase 7 Report (PR body, Jira comment, Wiki, Confluence). v8.5.5: /multi-agent:review now accepts PR URL / #N / repo#N input shapes and posts the parallel reviewer verdict back as one canonical PR comment per run. Branch-mode invocations stay chat-only. Multi-repo orchestration (per-repo worktrees, platform-aware identity routing, push-must-succeed, fetch-fail recovery). Bilingual setup applies only to assistant explanations (outputLanguage en/tr); interactive prompts, commits, and PR bodies stay English. Cross-CLI parity: Claude Code (colon form) and Copilot CLI (dash form) commands are byte-identical; Cursor, Antigravity, and VS Code Copilot Chat run the full pipeline via per-platform orchestration adapters (subagents/workflow + /multi-agent command + dev-toolkit MCP), with pickers/gates degrading per the picker contract. v9.3.0: review consensus surfacing (same-base-model agreement on judgment calls flagged 'unverified' instead of trusted), a persistent per-repo learnings ledger (durable facts + rejected review preferences replayed into every run so agents stop re-discovering and reviewers stop re-flagging), a default-FAIL evidence gate (a 'passed' claim needs a substantiating log), an entropy + provider-prefix secret scanner, a conceptual-vs-edit intent guard, and a per-phase cost ledger that prices prompt-cache reads and names the top cost driver. v9.6.0: two disk-hygiene commands (/multi-agent:garbage-collect sweeps /tmp scratch from past runs; /multi-agent:prune-logs prunes per-task logs while always preserving the audit trail + metrics), an intent guard hardened against adversarial EN+TR input, and an ajv schema-conformance test that validates the eval fixtures against the real JSON Schemas. CI re-armed on push/PR. Full version history in repo CHANGELOG.

Private RepositoryThis is a private repository. You need collaborator access to view the source code on GitHub.
8
Phases
204
Skills
31
Commands
95
Smoke suites
Stable
Status

Tech Stack

Node.jsClaude CodeCopilot CLICursorAntigravityVS Code Copilot Chat206 Unified SkillsGitHub PackagesJSON Schemas8 Sub-Agents

How It Works

Setup
🔑
Keychain
11 services · servicePatMap per identity
👤
Git Identity
identities[] · servicePatMap ownership
🔍
Repo Discovery
Local + Bitbucket + GitHub 90-day
🗺
Identity Routing
URL glob → identity auto-pick
📋
Stack Detect
iOS · Android · Backend · Frontend
🔒
Pre-commit
Secret scanning
Inputs (5 types)
GitHub Issue#316
GitHub URLgithub.com/.../issues/N
Jira IDPROJ-1234
Jira URLjira.../browse/...
Free Text"Fix dark mode bug"
Launch Modes (8 dispatch variants)
Claude/multi-agent "#316"Copilotmulti-agent "#316"
Normal0→1→2→3→4→5→6→7
Claude/multi-agent "PROJ-1234"Copilotmulti-agent "PROJ-1234"
Jira0→1→2→3→4→5→6→7
Claude/multi-agent:dev "#316"Copilotmulti-agent-dev "#316"
dev0→3→6→7
Claude/multi-agent:autopilot "#316"Copilotmulti-agent-autopilot "#316"
autopilot0→1→2→3→4→5→6→7
Claude/multi-agent:dev-autopilot "#316"Copilotmulti-agent-dev-autopilot "#316"
dev-autopilot0→3→6→7
Claude/multi-agent:local "#316"Copilotmulti-agent-local "#316"
local0→1→2→3→4→5→6→7
Claude/multi-agent:dev-local "#316"Copilotmulti-agent-dev-local "#316"
dev + local0→3→6→7
Claude/multi-agent:dev-local-autopilot "#316"Copilotmulti-agent-dev-local-autopilot "#316"
dev-autopilot + local0→3→6→7
Claude/multi-agent:local-autopilot "#316"Copilotmulti-agent-local-autopilot "#316"
autopilot + local0→1→2→3→4→5→6→7
Helper Commands
setup
Keychain token + Git identity onboarding
jira
Browse your open Jira issues → pick → launch
issue
Browse unassigned GitHub issues → pick → launch
status
List tasks across all projects
log #N
Show task log
resume #N
Resume a stopped task
kill #N
Delete worktree, logs preserved
review
Review only the current diff
refactor
Autonomous refactoring with TDD safety net
channels
Multi-channel reporter — PR · Jira · Confluence · Wiki (multi-select, humanizer, v5.7+ replaces enrich)
test
UI Bug Hunter (simulator/emulator auto-detect)
stack [platform]
Swap platform skills on-the-fly
language [en|tr]
Toggle outputLanguage (assistant explanations). Picker prompts stay English.
scan
Skill security (4-tier · warn-only · CI strict)
search
Log full-text search (smart ranking · ripgrep)
sync
6-target sync (incl. plan-driven figma source pull)
update
Upgrade: git pull + install + migrate
clear-logs
Clean global log directory
purge
Full reset (double confirmation)
help
Usage guide and examples
UI Testing (Standalone)
🧪
test
Full simulator test - screenshot all screens
🌙
test "dark mode"
Dark mode bug test
test "accessibility"
Accessibility audit - labels, tap targets, contrast
🔤
test "dynamic type"
Large text size test
📦
test "store-ready"
Full audit: visual + accessibility + archive compliance
🔐
test "biometric"
Face ID / Touch ID flow test
test "performance"
Launch time + scroll performance test
Stack Swap
Swap platform skills on-the-fly
stack iosstack androidstack backendstack mobilestack all
Skills (60+)
Pluggable knowledge modules injected into agents
🍎iOS · SwiftUI · UIKit · CoreData
🤖Android · Compose · Hilt · Room
FastAPI · NestJS · Docker · CI/CD
🌐React · Next.js · Tailwind · Vue
Clean Code · TDD · Architecture · Security
🔧Git Actions · Figma · ASO · Audit Scripts
Pipeline Phases
0
Init
Multi-select picker
Identity routing
Fetch-fail 4-opt
N worktrees (serial)
Branch collision auto -v2
Step 0 language (en/tr)
1
Analysis
Parallel Explore
Opus model
Codebase scan
2
Planning
Task breakdown
Architecture
User approval
Todo repo tagging
3
Dev
TDD: Red→Green→Refactor
Per-repo worktree switch
Build verify
Max 3 retry
component → figma-to-component skill delegate
4
Review
Deterministic gates BEFORE AI review
Combined diff (repo headers)
crossRepo findings
Claude: Opus + Sonnet · Copilot: + GPT-5.4 → triage
5
Test
UI Bug Hunter
Dark/Light test
A11y audit
Simulator boot
6
Commit
Per-repo commit + push
Push-must-succeed (5 retries)
Cross-linked sibling PRs
Never auto-closes issues (4 approvals)
7
Report
Step 1: Channels dispatch — PR · Jira · Confluence · Wiki (multi-select, humanizer)
Autopilot ALWAYS pauses — 30-min timeout → session ends (resumable)
Step 2: agent-log · Quality & Metrics · telemetry
Step 3: Knowledge base + memory
Phase 0 - Interactive Init (8 Steps)
PARSE
Parse input, fetch from Jira/GitHub
PROJECT
Multi-select picker · recentGroups
BRANCH
TTL filter · collision auto -v2/-v3 · fetch-fail 4-opt
DESIGN?
Figma URL / screenshot (optional)
BRANCH NAME
Shared name · kebab rules · 50-char
IDENTITY
platformIdentityRouting · per-repo lookup
INSTRUCTION
Auto-detect .instructions/ directory
WORKSPACE
Multi-repo: N worktrees · serial · rollback
Sub-Agents (6)
🔍
Code Reviewer
Security, architecture, quality
🗺
Explorer
Codebase scan, patterns, impact areas
🍎
iOS Architect
Design decisions, patterns, modules
🤖
Android Architect
Design decisions, patterns, modules
Backend Architect
API design, data modeling, architecture
🛡
Security Auditor
Vulnerabilities and compliance
Component Pipeline (Figma → SwiftUI)
Figma URL detect
Token extraction
Config + View
Modifiers
Code Connect
3-layer tests
14-item review
Key Features
Phase 7 Report (Jira → Wiki+Figma → Confluence → Log → Knowledge)Skill Security Scanner (tiered · warn-only · CI strict · /multi-agent:scan)Log Search (smart ranking · filters · ripgrep · /multi-agent:search)Generic Figma-to-Component (iOS + Android)4-Adapter Wiki (submodule / in-repo / github-wiki / separate-repo)Issue → Jira → Wiki Triad (autopilot-aware)Corporate Hosts in Setup (Jira / Confluence / Bitbucket inline prompt)Live Progress-Line Contract (quiet/normal/verbose)Plan-Driven Upstream Sync (--plan → review → apply)Overlay Protection (local-overlay: true frontmatter)Cross-CLI Parity (26 cmds + 37 figma skills identical)Build queue serialized with lock fileAuto-detects iOS/Python/Node/Docker/GoDeterministic gates BEFORE AI reviewMulti-Repo OrchestrationPlatform Identity RoutingPush-Must-Succeed (5 retries)Fetch-Fail 4-Option FlowNever auto-closes issues (4 approvals)PAT Audit Logging (SHA-256)49 Smoke Suites (~870 assertions)Lazy file loading for token efficiencyPre-commit secret detection hookAutopilot Phase 2 Safety Classifier (v7.0.G · default-on · score≥50 → pause)Dynamic Trigger-Based Skill Loading (v7.0.I · 193-entry index · install.js --index-only)Per-Persona Model Routing (v6.1.D · preferredModel frontmatter · explorer → sonnet)OpenTelemetry Span Emission (v6.1.C · MULTI_AGENT_OTEL_SPANS=1 · JSONL)Phase 4 Disagreement Round (v6.1.B · opt-in · mixed-verdict rebuttal pass)Cost Summary in PR Body (v6.1.J · per-phase tokens + est. USD)Per-Repo File-System Memory Layer (v6.2.E · .multi-agent/memory/ · opt-in)Golden-Task Eval Harness (v6.2.F · 2 seed fixtures · wired to npm test)Skill Manifest Signing + Verify (v6.2.H · 193 SHA-256 · install-time warn-only)
Output
Reviewed PR
Per-repo PRs · cross-linked siblings
📊
Report
Quality & Metrics · token rollup
🧠
Knowledge
Cross-session memory
📦
Components
Config + View + Tests
📁
Logs
~/.claude/logs/multi-agent/{project}/{task-id}/
Live Replay — watch commands run end-to-end
Mode
Input
claude-code · /multi-agent PROJ-1234
Tracker
8 phases · 2-model review · Report delivery
0Init
1Analysis
2Planning
3Dev
4Review
5Test
6Commit
7Report
running… · line 0/42
Simulated replay — actual pipeline emits the same progress-line shape (`→ verb object target`) at immediate flush.

Features

8-Phase Pipeline

Complete automated workflow from branch to handoff.

  • Phase 0 - Init: branch, identity, worktree setup
  • Phase 1 - Analysis: codebase exploration with Opus
  • Phase 2 - Planning: task decomposition + approval gate (clarification max 2 rounds + free-text edit loop), normal mode only
  • Phase 3 - Dev: TDD red-green-refactor (component tasks delegate to figma-to-component)
  • Phase 4 - Review: CLI-aware parallel review + Opus triage (Claude: 2-model · Copilot: 3-model)
  • Phase 5 - Test: optional UI testing
  • Phase 6 - Commit: conventional commits → PR → issue update
  • Phase 7 - Report: Jira comment → Wiki + Figma screenshots → Confluence → log + knowledge capture

CLI-Aware Parallel Review + Opus Triage

Reviewer set depends on the host CLI: GPT is only natively reachable from Copilot, so Claude runs a 2-model set and Copilot runs a 3-model set. Opus triage runs on both.

  • Deterministic gates first: build + lint + test + secret scan
  • Claude Code (2 parallel): Opus (security + architecture) + Sonnet (quality + correctness)
  • Copilot CLI (3 parallel): GPT + Opus + Sonnet
  • Opus triage filters false positives and out-of-scope items
  • Only actionable findings loop back to Phase 3

Plan Approval Gate

Phase 2 refuses to start Phase 3 on underspecified tickets without a human read-through.

  • Clarification mode (max 2 rounds): ambiguous tickets, missing acceptance criteria, UI without Figma, vague language, scope drift
  • Approval loop: plan rendered with summary / approach / risk / scope / files / todos. Respond with onayla / iptal / free-text edit
  • Skipped for fast and autopilot modes to preserve their zero-interaction contracts

Store-Ready Compliance (iOS + Android)

Two parallel skills wrap external tooling and report store-submission findings in a normalized severity-grouped shape.

  • apple-archive-compliance: wraps the ios_app_store_audit MCP tool inside @mmerterden/dev-toolkit-mcp ≥ 2.4. 17-rule catalog cross-referenced to Apple ITMS error codes + App Store Review Guidelines (privacy manifest, required-reason API, Info.plist, code signing, entitlements, embedded SDK, IPv6, debug leak, ...)
  • google-play-compliance: orchestrates bundletool + aapt2 + apksigner. 21-rule catalog across Technical · Security · Privacy · Hygiene categories
  • /multi-agent:test "store-ready" auto-detects platform, runs visual + accessibility sweep, dispatches the matching skill, merges into one severity report
  • Wired into 4 consumers: store-ready test, Phase 4 Security Auditor, /multi-agent:review, /multi-agent:channels

Per-Phase Token Tracking

Live token accounting in the tracker.

  • phase-tracker.sh `tokens <phase_id> <in> <out>` accumulates LLM usage against the active phase
  • Renders ` · Nk tok` next to elapsed time + Total footer
  • Per-phase token budgets defined in token-budget.json with warn/max thresholds, enforced in CI
  • Phase 7 sub-step tracking: Jira / Wiki / Confluence / Log / Knowledge advanced through pending → in_progress → completed / failed / skipped

Per-Task Cost Breakdown

Every Phase 7 run appends a 4-column cost block to agent-log.md.

  • Renderer: pipeline/scripts/render-agent-log-cost.sh — tokens in/out + estimated USD per phase
  • Sourced from phase-tracker.sh tokens accumulators × cost-table.json (Opus / Sonnet / Haiku / GPT-5.4)
  • Independent of channels-side reportContent.costSummary — agent-log always carries the breakdown
  • Single call site: LOG_METRIC_FORWARD_TO_TRACKER=1 mirrors metrics.jsonl into the tracker — JSONL + cost block stay in sync
  • 13 smoke assertions (smoke-agent-log-cost.sh)

Diff Risk Scoring

Phase 4 Step 1.75 advisory — heuristic, deterministic, sub-second, no LLM.

  • Walks the diff, scores each file by 7 signals: security_path ×3, migration ×4, public_api ×2, no_test_change ×2.5, complexity_delta ×1.5, ui_critical ×1.5, loc_changed ×1
  • Top-N risk-ranked files inject into each reviewer's prompt as ${PRIORITY_FILES}
  • Reviewers read those files first but still review the entire diff — never gates the pipeline
  • 14 smoke assertions across iOS + Android fixtures (smoke-diff-risk.sh)
  • Toggle: prefs.global.diffRiskAdvisory (default ON)

Test Gap Detection

Phase 5 Step 0 advisory — static detector for newly added public symbols missing a paired test.

  • Stack-specific rules ship for iOS / Android / Python / Node.js (test-gap-rules/*.json)
  • iOS Views + Android @Composable + interfaces + public protocols → important severity; other public API additions → suggestion
  • Optional gating via prefs.testGap.blockingThreshold — when set, becomes a Phase 4 rework finding
  • --severity-promote forces all gaps to important (audit mode)
  • 22 smoke assertions across 4 stacks (smoke-test-gap.sh)

Triage Memory + Semantic Search

Per-repo triage corpus that powers Phase 1 enrichment, Phase 4 prior-art lookup, and /multi-agent:search --semantic.

  • ~/.claude/memory/multi-agent/<repo-slug>/triage-corpus.jsonl — append-only, idempotent ingest, per-repo isolation (never cross-leaks between projects)
  • Phase 7 ingests every accepted/deferred/rejected finding; Phase 4 triage attaches up to 3 prior-art hits per raw finding with explicit bias hedge
  • /multi-agent:search "<text>" --semantic routes the query to the corpus instead of agent-log grep
  • Token-overlap recall, zero deps, Node-18-compatible. Schema is forward-compatible with a future vector BLOB column.
  • 11 smoke assertions (smoke-triage-memory.sh, uses HOME override to never touch the user's real memory dir)
  • Toggle: prefs.global.priorArtEnrichment.enabled (default ON)

Multi-Repo Integration Build

Phase 6 Step 0 builds the host project that integrates multiple repos. Learn once, auto-apply forever.

  • Fires only when state.projects.length >= 2. Single-repo skips silently
  • Use case: shared codegen lib + consuming UI lib + host app. Mismatches surface only when the full chain builds together
  • First encounter prompts: register host (path/platform/scheme/submodules/resolve-cmd/build-cmd) / mark noHost / skip this run
  • Auto-run loop: submodule update → resolve → build → error-delta evaluation vs baseline
  • Autopilot refuses to prompt on unknown combos (won't silently build wrong thing)

Interactive Launchers

Browse and launch tasks from your terminal.

  • /multi-agent:jira: lists your open Jira issues, pick one to start
  • /multi-agent:issue: lists unassigned GitHub issues, auto-assigns on pick
  • Shared interactive flow: choose branch → mode → autopilot → launch
  • No more copy-pasting URLs

Cross-Session Learning & Memory

Incremental learning that persists across sessions.

  • Captures patterns, decisions, and architecture notes per project
  • Phase 7 extracts user corrections, constraints, and references into auto-memory
  • Phase 1 injects memories so future runs respect prior feedback
  • Max 3 new memories per run, merge-over-duplicate strategy
  • Context auto-compacts at 65% usage (vs default ~80%)

Figma → SwiftUI Pipeline

Full component generation from Figma URL.

  • Produces Configuration, View, Modifiers, Code Connect, and docs
  • Design token mapping and accessibility IDs
  • 3-layer testing: ViewInspector + Snapshot + Unit
  • 14-item review checklist
  • Wiki page + issue body update as nested SubPhases of Phase 3
  • Activated automatically when task contains a Figma URL

Skill Security Scanner

Tiered pattern scanner for skill content. Blocks supply-chain surprises before install.

  • /multi-agent:scan: four severity tiers (critical · high · medium · low)
  • Auto-runs warn-only on every install; --strict for CI blocks PRs on findings
  • Catches shell-pipe exec, base64-decode exec, eval-of-network, unicode bidi override (trojan source), JS eval(), Python exec(), hardcoded credentials, pastebin raw fetches
  • Three integration points: CI · install.js · slash command

Log Search

Full-text search across every pipeline run you have ever done.

  • /multi-agent:search "query" across ~/.claude/logs/multi-agent/**/agent-log.md
  • Smart ranking: match count × recency weight
  • Filters: --project, --since 7d|YYYY-MM-DD, --phase N
  • Output modes: colored text, --json, --tsv
  • Local-only, no network

6 Specialized Sub-Agents

Purpose-built agents for each concern.

  • code-reviewer: CLI-aware parallel review + Opus triage (Claude: 2-model · Copilot: 3-model)
  • explorer: Phase 1 codebase scanner
  • ios-architect: iOS architecture review
  • android-architect: Android architecture review
  • backend-architect: API design review
  • security-auditor: OWASP-based security audit

Pre-Commit Hooks & Scripts

6 automated scripts installed to ~/.claude/scripts/.

  • pre-commit-check.sh: secret detection (API keys, AWS AKIA, private keys, .env, Firebase JSON)
  • stack-swap.sh: auto-detect project stack on session start
  • keychain-save.sh: save tokens to macOS Keychain (credentials never visible in ps)
  • github-ssh-setup.sh: SSH key generation + config
  • ui-tree-dumper.swift: iOS accessibility tree dumper
  • sync-parity-check.sh: Claude / Copilot / repo drift detection

Stack Swap & Auto-Detection

Stack auto-detected at session start by stack-swap.sh based on project markers (.xcodeproj, build.gradle, package.json).

  • iOS/Swift, Android/Kotlin, Backend, Frontend, or mobile combined
  • Manual override: /multi-agent:stack [ios|android|backend|frontend|mobile|all]
  • Loads matching guide (swiftui-guide, android-guide, backend-guide, frontend-guide)
  • Platform-specific compliance rules map 1:1 to audit checks

UI Bug Hunter & Audits

Automated visual testing and compliance audits.

  • Boots simulators, captures screenshots across light/dark and large text
  • Accessibility audits and store compliance checks
  • Uses native CLI tools (xcrun simctl, adb, codesign, aapt2)
  • On-demand: dark mode, accessibility, store-ready, biometric, performance

Cross-CLI Parity

One source of truth, five platforms, zero drift.

  • 31 slash commands kept byte-identical between Claude Code (colon form) and Copilot CLI (dash form)
  • /multi-agent:sync re-aligns every target in one shot
  • Full orchestration also ports to Cursor, Antigravity, and VS Code Copilot Chat via per-platform adapters (subagents/workflow + /multi-agent command + dev-toolkit MCP)
  • 204 unified skills shipped to all targets

Available Commands

Claude Code (colon-form)Copilot CLI (dash-form)CategoryDescription
/multi-agent "task"multi-agent "task"Full pipeline

Full 8-phase pipeline with worktree + interactive confirmations. Analysis → Planning (Plan Approval Gate) → TDD Dev → Parallel Review (Opus triage) → Test → Commit/PR → Report. The default entry point for all tasks.

/multi-agent:autopilot "task"multi-agent-autopilot "task"Full pipeline

Full 8-phase pipeline with worktree, **no confirmations**. Autopilot contract: zero user interaction. Phase 2 gate, Phase 5 test prompt, and Phase 6 commit/PR prompts are all skipped. Safety classifier still runs; high-risk plans (security path, schema migration, delete-without-test) trigger a one-time pause.

/multi-agent:local "task"multi-agent-local "task"Full pipeline

Full 8-phase pipeline on the current branch — **no worktree**, interactive confirmations preserved. Same pipeline as `/multi-agent` but stays in your current editor/IDE session (no split workspace). Single-repo only.

/multi-agent:local-autopilot "task"multi-agent-local-autopilot "task"Full pipeline

Full 8-phase pipeline + no worktree + no confirmations. Combines `autopilot` + `local`: fire-and-forget execution on the current branch. Safety classifier still honored.

/multi-agent:dev "task"multi-agent-dev "task"Fast mode

Skips Analysis (Phase 1) + Planning (Phase 2) + Review (Phase 4) gates — jumps to Dev. Uses Opus for development instead of Sonnet (Opus catches more issues without the review safety net). For small, well-scoped tasks where the diff is obvious.

/multi-agent:dev-autopilot "task"multi-agent-dev-autopilot "task"Fast mode

Fast path + autopilot. Init → Dev(Opus) → Commit → Report with no confirmations. Second-fastest mode; ~2–3x faster than full pipeline, no review/triage safety net.

/multi-agent:dev-local "task"multi-agent-dev-local "task"Fast mode

Fast + local. Same Init → Dev(Opus) → Commit → Report flow, on the current branch with confirmations preserved.

/multi-agent:dev-local-autopilot "task"multi-agent-dev-local-autopilot "task"Fast mode

Fastest possible mode. Fast + local + autopilot — Init → Dev(Opus) → Commit → Report with no worktree and no confirmations. Ideal for trivial fixes you'd otherwise do manually.

/multi-agent:jiramulti-agent-jiraInteractive

Browse your open Jira issues in a terminal picker. Select one → confirm branch → choose mode (normal / autopilot / fast) → pipeline launches. Authenticated via Keychain-stored Jira token.

/multi-agent:issuemulti-agent-issueInteractive

Browse unassigned GitHub issues. Select one → auto-assigns to you + moves to In Progress → confirm branch → choose mode → pipeline launches. Handles GitHub-only repos that don't use Jira.

/multi-agent:statusmulti-agent-statusOps

Lists every active task across all projects — ID, phase, branch, status, last updated. Finds tasks that stopped mid-pipeline.

/multi-agent:log [#id]multi-agent-log [#id]Ops

Shows a task's `agent-log.md` — full pipeline trace including analysis output, plan approval history, review findings, commit shas, Phase 7 channels dispatch results. No id → shows latest.

/multi-agent:resume [#id]multi-agent-resume [#id]Ops

Resume a paused / failed task from its last completed phase. State is file-based (`agent-state.json`) so resumes work across sessions and machines.

/multi-agent:kill [#id]multi-agent-kill [#id]Ops

Stop a task — deletes its worktree + branch after double confirmation. Log is archived (not deleted) for audit. Destructive; always confirms.

/multi-agent:purgemulti-agent-purgeOps

⚠️ Nukes all worktrees, branches, logs, and state files. Double-confirmation required. For starting fully fresh — rarely needed in normal usage.

/multi-agent:reviewmulti-agent-reviewOps

Runs Phase 4 review **standalone** on the current branch's diff — parallel Claude Code 2-model (Opus + Sonnet) or Copilot CLI 3-model (GPT-5.4 + Opus + Sonnet), then Opus triage. Opt-in deep audit before PR.

/multi-agent:channelsmulti-agent-channelsOps

Multi-channel reporter for PR body / Jira comment / Confluence page / component Wiki. Multi-select channel × multi-select content (normal analysis / technical details / test scenarios / auto-diff summary / manual note / cost summary). Humanizer pass per channel, reviewer-preserving Bitbucket PR PUT. Phase 7 delegates to it.

/multi-agent:scanmulti-agent-scanOps

Tiered pattern scanner for skill content (critical / high / medium / low). Catches shell-pipe exec, base64-decode exec, eval-of-network, unicode bidi override (trojan source), hardcoded credentials, pastebin raw fetches. Auto-runs warn-only on every install; `--strict` for CI.

/multi-agent:search "query"multi-agent-search "query"Ops

Full-text search across `~/.claude/logs/multi-agent/**/agent-log.md`. Smart ranking (match count × recency weight), filters by `--project`, `--since`, `--phase`. Output modes: colored text, `--json`, `--tsv`. Local-only, no network.

/multi-agent:setupmulti-agent-setupMeta

First-run wizard — scans Keychain for tokens (GitHub, Jira, Confluence, Bitbucket, Telegram, npm, Vercel, Firebase), prompts for Git identity (personal / work), maps per-project platform identity routing. All creds stay in Keychain; prefs file holds only pointers.

/multi-agent:syncmulti-agent-syncMeta

Single command to sync the full multi-agent ecosystem: Claude Code skill dir, Copilot CLI skill dir, pipeline source repo, website data, remote-control endpoints. 26 commands kept byte-identical between CLIs.

/multi-agent:updatemulti-agent-updateMeta

Updates the pipeline to the latest version — `git pull` + `install.js` + `migrate-prefs.mjs` schema migration. Idempotent.

/multi-agent:helpmulti-agent-helpMeta

Bilingual usage guide. Renders in EN or TR based on `prefs.global.promptLanguage`. Shows all 26 commands, modes matrix, input types, preference reference.

/multi-agent:stackmulti-agent-stackMeta

Manually override the detected stack (ios / android / backend / frontend / mobile / fullstack / all). Auto-detection runs at session start via `stack-swap.sh`; override when working polyglot.

/multi-agent:refactormulti-agent-refactorMeta

Scores your codebase against industry best practices + competitor projects, proposes an improvement plan, asks for approval, then runs the pipeline to apply the changes. For periodic tech-debt sweeps.

/multi-agent:testmulti-agent-testMeta

UI Bug Hunter — auto-detects platform (iOS simctl / Android adb) and runs screenshot + tap + analyze on the booted device. Also handles `store-ready` preset that dispatches the Apple/Google compliance skill.

/multi-agent:manual-testmulti-agent-manual-testMeta

Standalone Phase 5 — checks out the active task's branch locally so you can run manual tests in Xcode/Android Studio. Separate from `:test` (which is auto UI bug hunting).