Multi-Agent Pipeline

9-phase AI pipeline · Plan Approval Gate (v5.3.0) · CLI-aware parallel review (v5.2.2) · Phase 7 Report (Jira · Wiki · Confluence · report) · generic figma-to-component (iOS + Android) · 34 smoke suites · local-only CI · plan-driven source sync

End-to-end AI development pipeline (v5.3.3).

  • 9-phase flow: analysis → planning (Plan Approval Gate — clarification + free-text edit loop for normal mode) → TDD dev → parallel review → test → commit/PR → Report
  • Phase 2 Plan Approval Gate (v5.3.0): on ambiguous tickets (missing acceptance criteria, no Figma link, no endpoint contract, vague language) the pipeline asks structured questions (max 2 rounds), regenerates the plan with answers, then loops on free-text edit requests ("let's touch auth but keep LoginView out of scope") until the user approves or aborts; skipped for --dev, autopilot, --dev autopilot to preserve fast/zero-interaction contracts
  • Phase 4 Review is CLI-aware (v5.2.2): Claude Code runs Opus + Sonnet (2-model), Copilot CLI runs GPT-5.4 + Opus + Sonnet (3-model); Opus triage on both filters false-positives
  • Phase 7 Report consolidates external delivery + internal capture: humanizer'd Jira comment → component wiki + Figma screenshots (4-adapter) → optional Confluence page → agent-log + Quality & Metrics + telemetry → knowledge capture + memory updates
  • v5.1.0 skill security scanner (tiered pattern catalog, FORBIDDEN-context whitelist) + log search (/multi-agent:search full-text across ~/.claude/logs/multi-agent/)
  • v5: generic figma-to-component pipeline for iOS (SwiftUI) and Android (Jetpack Compose) — 5 platform-specific skills under figma-ios/figma-android + 27 shared skills under figma-common, idempotent placeholder transform, overlay protection
  • issue → jira → wiki triad, live progress-line contract, plan-driven source sync
  • multi-repo mode with per-repo worktrees, platform identity routing, push-must-succeed (5 retries)
  • bilingual setup (en/tr) — TR now covers interactive prompts, TaskCreate subjects, AND phase banners via PHASE_LANG env + auto sentinel
  • cross-CLI full parity across 20 commands + 32 figma skills
  • 34 smoke suites, personal-data leak gate (13 patterns) always green
  • Opt-in telemetry, pre-commit secret scan, 6 sub-agents, cross-session memory
Private RepositoryThis is a private repository. You need collaborator access to view the source code on GitHub.
9
Phases
177
Skills
20
Commands
5.3.3
Version
Stable
Status

Tech Stack

Node.jsClaude CodeCopilot CLI177 Unified SkillsGitHub PackagesJSON Schemas6 Sub-Agents

How It Works

Setup
🔑
Keychain
11 services · servicePatMap per identity
👤
Git Identity
identities[] · servicePatMap ownership
🔍
Repo Discovery
Local + Bitbucket + GitHub 90-day
🗺
Identity Routing
URL glob → identity auto-pick
📋
Stack Detect
iOS · Android · Backend · Frontend
🔒
Pre-commit
Secret scanning
Inputs (5 types)
GitHub Issue#316
GitHub URLgithub.com/.../issues/N
Jira IDPROJ-1234
Jira URLjira.../browse/...
Free Text"Fix dark mode bug"
Commands
multi-agent "#316"
Normal0→1→2→3→4→5→6→7
multi-agent "PROJ-1234"
Jira0→1→2→3→4→5→6→7
multi-agent "#316" --dev
--dev0→3→6→7
multi-agent "PROJ-1234" --dev
Jira + --dev0→3→6→7
multi-agent "#316" autopilot
autopilot0→1→2→3→4→5→6→7
multi-agent "#316" --dev autopilot
--dev autopilot0→3→6→7
multi-agent "#316" --local
--local
multi-agent "#316" --local --dev
--local + --dev
Helper Commands
setup
Keychain token + Git identity onboarding
scan
Skill security: 4-tier pattern scanner (warn-only default)
search
Log search: smart ranking + filters, ripgrep primary
status
List tasks across all projects
log #N
Show task log
resume #N
Resume a stopped task
kill #N
Delete worktree, logs preserved
review-only
Review only the current diff
enrich
Enrich Jira + PR description from diff
stack [platform]
Swap platform skills on-the-fly
language [en|tr]
Show or set interactive-prompt language
refactor
Autonomous refactoring with TDD safety net
jira
Browse your open Jira issues → pick → launch
issue
Browse unassigned GitHub issues → pick → launch
clear-logs
Clean global log directory
purge
Full reset (double confirmation)
sync
6-target sync incl. plan-driven figma source pull
dev
Fast: Init→Dev(Opus)→Commit→Report
dev-autopilot
Fastest: dev + zero confirmations
autopilot
Full pipeline, skip confirmations
UI Testing (Standalone)
🧪
test
Full simulator test - screenshot all screens
🌙
test "dark mode"
Dark mode bug test
test "accessibility"
Accessibility audit - labels, tap targets, contrast
🔤
test "dynamic type"
Large text size test
📦
test "store-ready"
Full audit: visual + accessibility + archive compliance
🔐
test "biometric"
Face ID / Touch ID flow test
test "performance"
Launch time + scroll performance test
Stack Swap
Swap platform skills on-the-fly
stack iosstack androidstack backendstack mobilestack all
Skills (60+)
Pluggable knowledge modules injected into agents
🍎iOS · SwiftUI · UIKit · CoreData
🤖Android · Compose · Hilt · Room
FastAPI · NestJS · Docker · CI/CD
🌐React · Next.js · Tailwind · Vue
Clean Code · TDD · Architecture · Security
🔧Git Actions · Figma · ASO · Audit Scripts
Pipeline Phases
0
INIT
Multi-select picker
Identity routing
Fetch-fail 4-opt
N worktrees (serial)
Branch collision auto -v2
Step 0 language (en/tr)
1
ANALYSIS
Parallel Explore
Opus model
Codebase scan
2
PLANNING
Task breakdown
Architecture
User approval
Todo repo tagging
3
DEV
TDD: Red→Green→Refactor
Per-repo worktree switch
Build verify
Max 3 retry
component → figma-to-component skill delegate
4
REVIEW
Deterministic gates BEFORE AI review
Combined diff (repo headers)
crossRepo findings
Opus · GPT-5.4 · Sonnet → triage
5
USER TEST
UI Bug Hunter
Dark/Light test
A11y audit
Simulator boot
6
COMMIT
Per-repo commit + push
Push-must-succeed (5 retries)
Cross-linked sibling PRs
Never auto-closes issues (4 approvals)
7
Report
Step 1: Jira comment (humanizer + test scenarios)
Step 2: Component wiki (4-adapter) + Figma screenshots
Step 3: Confluence page
Step 4: Report · Quality & Metrics · audit log
Step 5: Knowledge base + memory
Phase 0 - Interactive Init (8 Steps)
PARSE
Parse input, fetch from Jira/GitHub
PROJECT
Multi-select picker · recentGroups
BRANCH
TTL filter · collision auto -v2/-v3 · fetch-fail 4-opt
DESIGN?
Figma URL / screenshot (optional)
BRANCH NAME
Shared name · kebab rules · 50-char
IDENTITY
platformIdentityRouting · per-repo lookup
INSTRUCTION
Auto-detect .instructions/ directory
WORKSPACE
Multi-repo: N worktrees · serial · rollback
Sub-Agents (6)
🔍
Code Reviewer
Security, architecture, quality
🗺
Explorer
Codebase scan, patterns, impact areas
🍎
iOS Architect
Design decisions, patterns, modules
🤖
Android Architect
Design decisions, patterns, modules
Backend Architect
API design, data modeling, architecture
🛡
Security Auditor
Vulnerabilities and compliance
Component Pipeline (Figma → SwiftUI)
Figma URL detect
Token extraction
Config + View
Modifiers
Code Connect
3-layer tests
14-item review
Key Features
Phase 7 Report (Jira → Wiki+Figma → Confluence → Log → Knowledge)Skill Security Scanner (tiered · warn-only · CI strict · /multi-agent:scan)Log Search (smart ranking · filters · ripgrep · /multi-agent:search)Generic Figma-to-Component (iOS + Android)4-Adapter Wiki (submodule / in-repo / github-wiki / separate-repo)Issue → Jira → Wiki Triad (autopilot-aware)Corporate Hosts in Setup (Jira / Confluence / Bitbucket inline prompt)Live Progress-Line Contract (quiet/normal/verbose)Plan-Driven Upstream Sync (--plan → review → apply)Overlay Protection (local-overlay: true frontmatter)Cross-CLI Parity (20 cmds + 32 figma skills identical)Build queue serialized with lock fileAuto-detects iOS/Python/Node/Docker/GoDeterministic gates BEFORE AI reviewMulti-Repo OrchestrationPlatform Identity RoutingPush-Must-Succeed (5 retries)Fetch-Fail 4-Option FlowNever auto-closes issues (4 approvals)PAT Audit Logging (SHA-256)36 Smoke Suites (≥506 assertions)Lazy file loading for token efficiencyPre-commit secret detection hook
Output
Reviewed PR
Per-repo PRs · cross-linked siblings
📊
Report
Quality & Metrics · token rollup
🧠
Knowledge
Cross-session memory
📦
Components
Config + View + Tests
📁
Logs
~/.claude/logs/multi-agent/{project}/{task-id}/
Live Replay — watch commands run end-to-end
Mode
Input
claude-code · /multi-agent PROJ-1234
Tracker
8 phases · 3-model review · Report delivery
0INIT
1ANALYSIS
2PLANNING
3DEV
4REVIEW
5TEST
6COMMIT
7Report
running… · line 0/42
Simulated replay — actual pipeline emits the same progress-line shape (`→ verb object target`) at immediate flush.

Features

9-Phase Pipeline

Complete automated workflow from branch to handoff.

  • Phase 0 - Init: branch, identity, worktree setup
  • Phase 1 - Analysis: codebase exploration with Opus
  • Phase 2 - Planning: task decomposition + Plan Approval Gate (clarification max 2 rounds + free-text edit loop) — normal mode only
  • Phase 3 - Dev: TDD red-green-refactor cycles (component tasks delegate to figma-to-component skill)
  • Phase 4 - Review: CLI-aware parallel review + Opus triage (Claude: 2-model · Copilot: 3-model)
  • Phase 5 - Test: optional UI testing
  • Phase 6 - Commit: conventional commits → PR → issue update
  • Phase 7 - Report: Jira comment → Wiki + Figma screenshots → Confluence → Report + log → Knowledge & memory capture

CLI-Aware Parallel Review + Opus Triage (v5.2.2)

Reviewer set depends on the host CLI — GPT-5.4 is only natively reachable from Copilot CLI, so Claude Code runs a 2-model set and Copilot CLI runs a 3-model set. Opus triage runs on both.

  • Deterministic gates first: build + lint + test + secret scan
  • Claude Code (2 parallel): Opus (security + architecture) + Sonnet (quality + correctness)
  • Copilot CLI (3 parallel): GPT-5.4 (edge cases + cross-provider diversity) + Opus + Sonnet
  • Opus triage pass filters false-positives and out-of-scope items on both CLIs
  • Only actionable findings loop back to Phase 3

Plan Approval Gate (v5.3.0)

Phase 2 now refuses to start Phase 3 on underspecified tickets without a human read-through.

  • Clarification mode (max 2 rounds): if the Jira/issue is ambiguous — missing acceptance criteria, UI task without a Figma link, API task without an endpoint contract, vague language, parent-story scope drift — the pipeline asks structured questions, regenerates the plan with answers, and caps at 2 rounds to prevent grooming loops.
  • Approval loop: plan rendered with summary / approach / risk / scope / files / todos. User responds with 'onayla' / 'iptal' / free-text edit (e.g. "let's touch the auth service too but keep LoginView out of scope"). Edit requests loop through Opus + validator until approve or abort. No iteration cap — user controls exit.
  • Skipped for --dev, autopilot, and --dev autopilot to preserve their fast/zero-interaction contracts.

Interactive Launchers (v3.1.0)

Browse and launch tasks from your terminal.

  • /multi-agent:jira - lists your open Jira issues, pick one to start
  • /multi-agent:issue - lists unassigned GitHub issues, auto-assigns on pick
  • Shared interactive flow: choose branch → mode → autopilot → launch
  • No more copy-pasting URLs

Token Budget Enforcement (v3.0.0)

Per-phase token limits for cost control.

  • Defined in token-budget.json with warn/max thresholds
  • smoke-token-budget.sh enforces in CI
  • Total pipeline cost: 13,282 tokens (−28% from v2.6)
  • Phase docs lazy-loaded - only current phase is read

6 Specialized Sub-Agents

Purpose-built agents for each concern.

  • code-reviewer - CLI-aware parallel review + Opus triage (Claude: 2-model · Copilot: 3-model)
  • explorer - Phase 1 codebase scanner
  • ios-architect - iOS architecture review
  • android-architect - Android architecture
  • backend-architect - API design review
  • security-auditor - OWASP-based security audit

Pre-Commit Hooks & Scripts

6 automated scripts installed to ~/.claude/scripts/.

  • pre-commit-check.sh - secret detection (API keys, AWS AKIA, private keys, .env, Firebase JSON)
  • stack-swap.sh - auto-detect project stack on session start
  • keychain-save.sh - save tokens/JSON to macOS Keychain (credentials never visible in ps)
  • github-ssh-setup.sh - SSH key generation + config
  • ui-tree-dumper.swift - iOS accessibility tree dumper
  • sync-parity-check.sh - Claude↔Copilot↔repo drift detection

Flexible Execution Modes

Multiple modes for different workflows.

  • Normal - all 8 phases with full review and Report delivery
  • --dev - Init → Dev (Opus) → Commit → Report for fast iteration
  • --local - works directly on your branch without worktrees
  • autopilot - skips all confirmations for zero-interaction runs
  • Combine: --dev autopilot is the fastest path from issue to PR

62-Test CI Suite

Comprehensive CI-ready test coverage.

  • 13 test suites (62 tests) covering every script and phase doc
  • 10 adversarial eval fixtures for triage semantic regression
  • 4 JSON schema validations - all runnable via 'npm test'
  • smoke-pre-commit.sh tests secret detection with real temp git repos
  • smoke-token-budget.sh enforces per-phase token limits

Stack Swap & Auto-Detection

Auto-detects project stack and loads matching guides.

  • iOS/Swift, Android/Kotlin, Backend, Frontend, or mobile combined
  • Switch on the fly with /multi-agent stack [platform]
  • Platform-specific compliance rules map 1:1 to audit checks

UI Bug Hunter & Audits

Automated visual testing and compliance audits.

  • Boots simulators, captures screenshots across light/dark and large text
  • Accessibility audits and store compliance checks
  • Uses native CLI tools (xcrun simctl, adb, codesign, aapt2)
  • On-demand: dark mode, accessibility, store-ready, biometric, performance

Cross-Session Learning & Memory

Incremental learning that persists across sessions.

  • Captures patterns, decisions, and architecture notes per project
  • Phase 7 extracts user corrections, constraints, and references into auto-memory
  • Phase 1 injects memories so future runs respect prior feedback
  • Max 3 new memories per run, merge-over-duplicate strategy
  • Context auto-compacts at 65% usage (vs default ~80%)

Figma → SwiftUI Pipeline

Full component generation from Figma URL.

  • Produces Configuration, View, Modifiers, Code Connect, and docs
  • Design token mapping and accessibility IDs
  • 3-layer testing: ViewInspector + Snapshot + Unit
  • 14-item review checklist
  • Wiki page + issue-body update as nested SubPhases of Phase 3
  • Activated automatically when task contains a Figma URL

18 Unified Slash Commands (v3.3.0)

Full command parity between Claude Code and Copilot CLI.

  • 18 slash commands: issue, jira, status, log, resume, kill, review, purge, dev, test, autopilot, help, sync, setup, enrich, refactor, dev-autopilot + main
  • Claude Code: /multi-agent:command — Copilot CLI: multi-agent-command
  • /multi-agent:sync keeps both platforms identical
  • /multi-agent:refactor scores codebase, researches competitors, plans improvements
  • /multi-agent:setup onboards Keychain tokens + Git identity

Phase 7 Report — 4-adapter Wiki (v5.2.1)

All external delivery consolidated in Phase 7 Report: Jira comment, Component Wiki + Figma screenshots, Confluence (optional), then internal report + knowledge capture.

  • Four wiki layouts supported: submodule, in-repo, github-wiki, separate-repo
  • Auto-detects mode from figma-project-config or project structure
  • Fetches Figma screenshot via REST API (auto-scaled 2x, LTR+RTL, light+dark)
  • Wiki → Jira comment triad (humanizer'd) closes the loop with QA/PMs
  • local-overlay guard prevents /multi-agent:sync from flattening v5 customizations
  • Issue progress flags auto-updated: Wiki ⚪ → 🟢

Runtime Triage Validator (v2.3.0)

Deterministic validation of Phase 4 triage output.

  • validate-triage.mjs runs after every review
  • Exit 0: valid · 1: invalid (retry+fallback) · 2: over-rejection guard (>80%)
  • Exit 3: contradiction auto-corrected (bidirectional approved↔blocking)
  • triage-output.schema.json v3 with reviewer enum locked to {opus, sonnet}
  • 'fix' mandatory on accepted items

Cross-CLI Phase Tracker (v2.4.0)

Stateful card-stack renderer for both CLIs.

  • phase-tracker.sh renders identical ASCII cards (○ ● ✓ ✗ ↷) with sub-phases
  • State persists per-task with atomic writes for concurrent worktrees
  • Phase 0 Step −1 bootstraps the tracker as the first thing every run

Telemetry & Per-Model Cost Tracking (v2.5.0)

Per-model cost tracking and pipeline analytics.

  • log-metric.sh appends to metrics.jsonl (atomic, never fails pipeline)
  • aggregate-metrics.mjs: review cycles avg/p95, triage rates, cost rollup
  • Per-model breakdown: calls + duration_ms + tokens_in + tokens_out
  • Phase 7 embeds last-30-day summary

Adversarial Eval Suite (v2.5.0+v2.6.0)

10-fixture regression suite for triage prompt quality.

  • Cases: empty findings, real blocker, out-of-scope defer, false positive, mixed
  • Plus: severity mismatch, duplicate findings, stylistic misclassification, cascading, deferred with cross-reference
  • eval-triage.mjs: structural validation + coverage + no-invention assertions
  • Runs in CI on every push

145 Unified Skills Across Both CLIs (v3.5.0)

One skill set, two CLIs, zero drift.

  • Previously Claude got 84 iOS-only skills, Copilot got 60 Android/web + pipeline skills — zero overlap
  • v3.5.0 merges everything into pipeline/skills/shared/ (145 total)
  • Both Claude Code and Copilot CLI install the same source
  • Covers: pipeline orchestration (multi-agent-*), iOS/SwiftUI, Android/Compose, Kotlin, Swift, web (React/Next/Vue), backend (Python/Node/FastAPI), CI/CD, HIG, and more
  • Pipeline is now truly generic — no leaked Jira keys, project directories, or domain references