Skip to main content

Introduction

oh-my-agent is a multi-agent orchestration framework for AI-powered IDEs and CLI tools. Instead of relying on a single AI assistant for everything, oh-my-agent decomposes work across 21 specialized agents — each modeled after a real engineering team role with its own tech stack knowledge, execution protocols, error playbooks, and quality checklists.

The entire system lives in a portable .agents/ directory inside your project. Switch between Claude Code, Gemini CLI, Codex CLI, Antigravity IDE, Cursor, or any other supported tool — your agent configuration travels with your code.


The Multi-Agent Paradigm

Traditional AI coding assistants operate as generalists. They handle frontend, backend, database, security, and infrastructure with the same prompt context and the same level of expertise. This leads to:

  • Context dilution — loading knowledge for every domain wastes the context window
  • Inconsistent quality — a generalist can not match a specialist in any single domain
  • No coordination — complex features spanning multiple domains get handled sequentially

oh-my-agent solves this with specialization:

  1. Each agent knows one domain deeply. The frontend agent knows React/Next.js, shadcn/ui, TailwindCSS v4, FSD-lite architecture. The backend agent knows the Repository-Service-Router pattern, parameterized queries, JWT authentication. They do not overlap.

  2. Agents run in parallel. While the backend agent builds your API, the frontend agent is already creating the UI. The orchestrator coordinates via shared memory.

  3. Quality is built in. Every agent has a domain-specific checklist and error playbook. Charter preflight catches scope creep before code is written. QA review is a first-class step, not an afterthought.


All 21 Agents

Ideation, Architecture, and Planning

AgentRoleKey Capabilities
oma-brainstormDesign-first ideationExplores user intent, proposes 2-3 approaches with trade-off analysis, produces design documents before any code is written. 6-phase workflow: Context, Questions, Approaches, Design, Documentation, Transition to /plan.
oma-architectureSystem architecture specialistModule/service/ownership boundaries, tradeoff analysis, stakeholder synthesis. Methodologies: diagnostic routing, design-twice comparison, ATAM-style risk analysis, CBAM-style prioritization, ADR-style decision records. Cost-aware by default.
oma-pmProduct managerDecomposes requirements into prioritized tasks with dependencies. Defines API contracts. Outputs .agents/results/plan-{sessionId}.json and task-board.md. Supports ISO 21500 concepts, ISO 31000 risk framing, ISO 38500 governance.

Implementation

AgentRoleTech Stack & Resources
oma-frontendUI/UX specialistReact, Next.js, TypeScript, TailwindCSS v4, shadcn/ui, FSD-lite architecture. Libraries: luxon (dates), ahooks (hooks), es-toolkit (utils), Jotai (client state), TanStack Query (server state), @tanstack/react-form + Zod (forms), better-auth (auth), nuqs (URL state). Resources: execution-protocol.md, tech-stack.md, tailwind-rules.md, component-template.tsx, snippets.md, error-playbook.md, checklist.md, examples/.
oma-backendAPI & server specialistClean architecture (Router-Service-Repository-Models). Stack-agnostic — detects Python/Node.js/Rust/Go/Java/Elixir/Ruby/.NET from project manifests. JWT + bcrypt for auth. Resources: execution-protocol.md, orm-reference.md, examples.md, checklist.md, error-playbook.md. Supports /stack-set for generating language-specific stack/ references.
oma-mobileCross-platform mobileFlutter, Dart, Riverpod/Bloc for state management, Dio with interceptors for API calls, GoRouter for navigation. Clean architecture: domain-data-presentation. Material Design 3 (Android) + iOS HIG. 60fps target. Resources: execution-protocol.md, tech-stack.md, snippets.md, screen-template.dart, checklist.md, error-playbook.md.
oma-dbDatabase architectureSQL, NoSQL, and vector database modeling. Schema design (3NF default), normalization, indexing, transactions, capacity planning, backup strategy. Supports ISO 27001/27002/22301-aware design. Resources: execution-protocol.md, document-templates.md, anti-patterns.md, vector-db.md, iso-controls.md, checklist.md, error-playbook.md.

Design

AgentRoleKey Capabilities
oma-designDesign system specialistCreates DESIGN.md with tokens, typography, color systems, motion design (motion/react, GSAP, Three.js), responsive-first layouts, WCAG 2.2 compliance. 7-phase workflow: Setup, Extract, Enhance, Propose, Generate, Audit, Handoff. Enforces anti-patterns (no "AI slop"). Optional Stitch MCP integration. Resources: design-md-spec.md, design-tokens.md, anti-patterns.md, prompt-enhancement.md, stitch-integration.md, plus reference/ directory with typography, color, spatial, motion, responsive, component, accessibility, and shader guides.

Infrastructure, DevOps, and Observability

AgentRoleKey Capabilities
oma-tf-infraInfrastructure-as-codeMulti-cloud Terraform (AWS, GCP, Azure, Oracle Cloud). OIDC-first auth, least privilege IAM, policy-as-code (OPA/Sentinel), cost optimization. Supports ISO/IEC 42001 AI controls, ISO 22301 continuity, ISO/IEC/IEEE 42010 architecture documentation. Resources: multi-cloud-examples.md, cost-optimization.md, policy-testing-examples.md, iso-42001-infra.md, checklist.md.
oma-dev-workflowMonorepo task automationmise task runner, CI/CD pipelines, database migrations, release coordination, git hooks, pre-commit validation. Resources: validation-pipeline.md, database-patterns.md, api-workflows.md, i18n-patterns.md, release-coordination.md, troubleshooting.md.
oma-observabilityIntent-based observability routerMELT+P signal coverage (metrics/logs/traces/profiles/cost/audit/privacy), transport tuning (UDP/MTU, OTLP gRPC vs HTTP, Collector topology, sampling), W3C Trace Context propagation, SLO management and burn-rate alerts, incident forensics (6-dimension localization), meta-observability (self-health, clock sync, cardinality, retention). CNCF-first; Fluentd deprecated (use Fluent Bit or OTel Collector).

Quality and Debugging

AgentRoleKey Capabilities
oma-qaQuality assuranceSecurity audit (OWASP Top 10), performance analysis, accessibility (WCAG 2.1 AA), code quality review. Severity: CRITICAL/HIGH/MEDIUM/LOW with file:line and remediation code. Supports ISO/IEC 25010 quality characteristics and ISO/IEC 29119 test alignment. Resources: execution-protocol.md, iso-quality.md, checklist.md, self-check.md, error-playbook.md.
oma-debugBug diagnosis and fixingReproduce-first methodology. Root cause analysis, minimal fixes, mandatory regression tests, similar pattern scanning. Uses Serena MCP for symbol tracing. Resources: execution-protocol.md, common-patterns.md, debugging-checklist.md, bug-report-template.md, error-playbook.md.

Localization, Coordination, and Git

AgentRoleKey Capabilities
oma-translatorContext-aware translation4-stage translation method: Analyze Source, Extract Meaning, Reconstruct in Target Language, Verify. Preserves tone, register, and domain terminology. Anti-AI pattern detection. Supports batch translation (i18n files). Optional 7-stage refined mode for publication quality. Resources: translation-rubric.md, anti-ai-patterns.md.
oma-orchestratorAutomated multi-agent coordinatorSpawns CLI subagents in parallel, coordinates via MCP memory, monitors progress, runs verification loops. Configurable: MAX_PARALLEL (default 3), MAX_RETRIES (default 2), POLL_INTERVAL (default 30s). Includes agent-to-agent review loop and Clarification Debt monitoring. Resources: subagent-prompt-template.md, memory-schema.md.
oma-scmSoftware configuration management (SCM) + GitHandles branching strategies, merge/rebase/conflict workflows, worktrees, baselines, and release-state tracking. Also generates Conventional Commit messages with safe staging. Co-Author: First Fluke <our.first.fluke@gmail.com>.

Search, Retrospective, and Document Processing

AgentRoleKey Capabilities
oma-searchIntent-based search routerRoutes queries to Context7 (docs), native web search, gh/glab (code), Serena (local). Domain trust scoring on all non-local results. Fail-forward routing (docs→web→fetch). Flags: --docs, --code, --web, --strict, --wide, --gitlab.
oma-recapCross-tool work retrospectiveAnalyzes conversation histories from Claude, Codex, Gemini, Qwen, and Cursor. Resolves natural-language date/window input, groups by tool+session, extracts themes, renders daily/period summaries for standups, weekly retros, and work logs.
oma-hwpHWP/HWPX/HWPML → MarkdownKorean word-processor document conversion via bunx kordoc@latest. Preserves headings, tables (incl. nested), footnotes, hyperlinks, images. Strips Hancom Private Use Area characters via flatten-tables.ts post-processor.
oma-pdfPDF → MarkdownPDF document conversion via uvx opendataloader-pdf. Preserves headings, tables, lists, images; OCR hybrid mode for scanned PDFs; output normalized with uvx mdformat.

Progressive Disclosure Model

oh-my-agent uses a two-layer skill architecture to prevent context window exhaustion:

Layer 1 — SKILL.md (~800 bytes, always loaded): Contains the agent's identity, routing conditions, core rules, and "when to use / when NOT to use" guidance. This is all that is loaded when the agent is not actively working.

Layer 2 — resources/ (loaded on-demand): Contains execution protocols, tech stack references, code snippets, error playbooks, checklists, and examples. These are loaded only when the agent is invoked for a task, and even then, only the resources relevant to the specific task type are loaded (based on the difficulty assessment and task-resource mapping in context-loading.md).

This design saves approximately 75% of tokens compared to loading everything upfront. For flash-tier models (128K context), the total resource budget is approximately 3,100 tokens — just 2.4% of the context window.


.agents/ — The Single Source of Truth (SSOT)

Everything oh-my-agent needs lives in the .agents/ directory:

.agents/
├── config/ # oma-config.yaml
├── skills/ # 22 skill directories (21 agents + _shared)
│ ├── _shared/ # Core resources used by all agents
│ └── oma-{agent}/ # Per-agent SKILL.md + resources/
├── workflows/ # 16 workflow definitions
├── agents/ # 9 subagent definitions
├── results/plan-{sessionId}.json # Generated plan output
├── state/ # Active workflow state files
├── results/ # Agent result files
└── mcp.json # MCP server configuration

The .claude/ directory exists only as an IDE integration layer — it contains symlinks pointing back to .agents/, plus hooks for keyword detection and the HUD statusline. The .serena/memories/ directory holds runtime state during orchestration sessions.

This architecture means your agent configuration is:

  • Portable — switch IDEs without reconfiguring
  • Version-controlled — commit .agents/ alongside your code
  • Shareable — team members get the same agent setup

Supported IDEs and CLI Tools

oh-my-agent works with any AI-powered IDE or CLI that supports skill/prompt loading:

ToolIntegration MethodParallel Agents
Claude CodeNative skills + Agent toolTask tool for true parallelism
Gemini CLISkills auto-loaded from .agents/skills/oma agent:spawn
Codex CLISkills auto-loadedModel-mediated parallel requests
Antigravity IDESkills auto-loadedoma agent:spawn
CursorSkills via .cursor/ integrationManual spawning
OpenCodeSkills loadingManual spawning

Agent spawning adapts to each vendor automatically via the vendor detection protocol, which checks for vendor-specific markers (e.g., the Agent tool for Claude Code, apply_patch for Codex CLI).


Skill Routing System

When you send a prompt, oh-my-agent determines which agent handles it using the skill routing map (.agents/skills/_shared/core/skill-routing.md):

Domain KeywordsRouted To
API, endpoint, REST, GraphQL, database, migrationoma-backend
auth, JWT, login, register, passwordoma-backend
UI, component, page, form, screen (web)oma-frontend
style, Tailwind, responsive, CSSoma-frontend
mobile, iOS, Android, Flutter, React Native, appoma-mobile
bug, error, crash, broken, slowoma-debug
review, security, performance, accessibilityoma-qa
UI design, design system, landing page, DESIGN.mdoma-design
brainstorm, ideate, explore, ideaoma-brainstorm
plan, breakdown, task, sprintoma-pm
automatic, parallel, orchestrateoma-orchestrator

For complex requests that span multiple domains, routing follows established execution orders. For example, "Create a fullstack app" routes to: oma-pm (plan) then oma-backend + oma-frontend (parallel implementation) then oma-qa (review).


HUD Statusline

When running in Claude Code, oh-my-agent displays a persistent status indicator [OMA] in the status bar showing:

  • Model name (e.g., Opus, Sonnet)
  • Context usage with color coding (green < 70%, yellow 70-85%, red > 85%)
  • Active workflow state (if a persistent workflow is running)

The HUD is powered by .claude/hooks/hud.ts using Claude Code's statusLine hook feature.


Automatic Workflow Detection

You do not need to type /command to trigger workflows. oh-my-agent's UserPromptSubmit hook scans your natural language input against keyword triggers defined in .claude/hooks/triggers.json — supporting 11 languages (English, Korean, Japanese, Chinese, Spanish, French, German, Portuguese, Russian, Dutch, Polish).

  • Actionable input (e.g., "plan the auth feature") → automatically loads the workflow
  • Informational input (e.g., "what is orchestrate?") → filtered out, no workflow triggered
  • Explicit /command → hook skips detection to avoid duplication
  • Persistent workflows reinject context on every message until you say "workflow done"

Cross-Vendor Support

oh-my-agent is not limited to Claude Code. The hook system supports:

VendorIntegration
Claude CodeNative hooks (UserPromptSubmit, Notification, statusLine)
Gemini CLISkills auto-loaded from .agents/skills/, agent spawning via oma agent:spawn
Codex CLISkills auto-loaded, model-mediated parallel requests
Qwen CodeHook support for workflow detection

Vendor detection happens automatically — agents adapt their spawning method based on the detected runtime environment.


What is Next

  • Installation — Three install methods, presets, CLI setup, and verification
  • Agents — Deep dive into all 21 agents and charter preflight
  • Skills — The two-layer architecture explained
  • Workflows — All 16 workflows with triggers and phases
  • Usage Guide — Real examples from single tasks to full orchestration