GitHub - garrytan/gstack: Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA
Service

GitHub - garrytan/gstack: Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA

garrytan
2026.04.04
·GitHub·by 권준호
#Agent#AI#Developer Tools#Open Source#Productivity

Key Points

  • 1gstack is an open-source "software factory" developed by Garry Tan that utilizes AI agents, primarily Claude Code, to simulate a comprehensive virtual engineering team and development workflow.
  • 2It provides 31 specialized AI skills, organized into a structured sprint process covering ideation, planning, building, reviewing, testing, and shipping software, significantly enhancing productivity.
  • 3By automating and streamlining the entire software development lifecycle, gstack enables individual builders to achieve an output comparable to a large team, and it is freely available under an MIT license.

gstack is an open-source software factory designed to augment individual developer productivity to that of a full engineering team, leveraging large language models (LLMs) like Anthropic's Claude Code, OpenAI's Codex, and Google's Gemini. The core premise is that a single builder, equipped with the right AI tooling, can achieve output comparable to a team of 20, as demonstrated by the creator, Garry Tan, shipping over 600,000 lines of production code in 60 days.

The system's methodology is structured around a "virtual engineering team" of 23 specialized AI agents and 8 power tools, orchestrated through a process mimicking a professional software sprint: Think → Plan → Build → Review → Test → Ship → Reflect. Each stage is managed by specific AI "specialists" implemented as slash commands, ensuring a sequential flow where the output of one skill feeds into the next, minimizing oversight and integrating quality assurance throughout the development lifecycle.

Core Methodology and Technical Details:

  1. Role-Based Task Execution: gstack abstracts complex development tasks into specific roles. For instance:
    • /office-hours (YC Office Hours): Initiates product ideation, challenging user framing, generating implementation alternatives, and producing a structured design document for downstream skills.
    • /plan-ceo-review (CEO/Founder): Conducts strategic reviews, refining product scope through modes like Expansion, Selective Expansion, Hold Scope, or Reduction.
    • /plan-eng-review (Eng Manager): Locks in architectural decisions, generating ASCII diagrams for data flow, state machines, and error paths, alongside defining test matrices and security considerations.
    • /plan-design-review (Senior Designer): Evaluates design dimensions (0-10 scale), identifies "AI slop," and iteratively refines plans through interactive prompts.
    • /review (Staff Engineer): Performs static analysis to catch subtle bugs that pass conventional CI, auto-fixes obvious issues, and identifies completeness gaps.
    • /qa (QA Lead): Operates a real Chromium browser via Playwright to interact with web applications. It navigates flows, identifies bugs, and performs atomic commits to fix them, concurrently generating regression tests. The browser can run in headless mode or as a headed window (/connect-chrome) for live observation. Authenticated sessions are managed via cookie import (/setup-browser-cookies).
    • /cso (Chief Security Officer): Conducts automated security audits based on OWASP Top 10 and STRIDE threat models, filtering false positives and generating concrete exploit scenarios.
    • /ship (Release Engineer): Automates the release process, including syncing with main, running tests (bootstrapping test frameworks if none exist), auditing code coverage, pushing changes, and creating pull requests.
    • /land-and-deploy (Release Engineer): Manages the full deployment pipeline from PR merge, waiting for CI/CD completion, to verifying production health.
    • /document-release (Technical Writer): Automatically updates project documentation (e.g., README, ARCHITECTURE.md) by cross-referencing code diffs, ensuring documentation consistency.
  1. Multi-Agent Orchestration and Collaboration:
    • Skill Chaining: Skills are designed to feed into each other, creating a seamless workflow. For example, the design doc from /office-hours is consumed by /plan-ceo-review, and the test plan from /plan-eng-review guides /qa.
    • Parallel Sprints: The "Conductor" component enables running 10-15 parallel Claude Code sessions, each in an isolated workspace, managing multiple development tasks concurrently (e.g., one session brainstorming, another reviewing a PR, a third fixing a bug). This parallelism is enabled by the structured sprint process, where each agent's scope and stopping conditions are well-defined.
    • Cross-Model Analysis (/codex): Integrates OpenAI's Codex CLI for an independent second opinion on code, offering review, adversarial challenge, and open consultation modes. When both Claude's /review and Codex's /codex are used, a cross-model analysis highlights overlapping and unique findings.
  1. Safety and Learning Mechanisms:
    • Safety Guardrails: Commands like /careful (warns before destructive commands such as rm -rf or DROP TABLE), /freeze (restricts edits to a specific directory), and /guard (combines both) prevent accidental system-level modifications.
    • Learning and Memory (/learn): Manages knowledge learned across sessions, including project-specific patterns, pitfalls, and preferences, allowing gstack to become more "intelligent" about a codebase over time.
  1. Technical Implementation Details:
    • Browser Automation: Uses Playwright for real browser interaction, enabling accurate UI testing and bug reproduction. The system supports both headless and headed browser modes, with an integrated Chrome extension Side Panel for live activity feeds and direct natural language instructions to a child Claude instance for browser tasks.
    • Installation and Configuration: Leverages git clone and a setup script. It integrates with various agents by conforming to the SKILL.md standard. Skills reside in .claude/skills/ or .agents/skills/ (for Codex-compatible hosts), allowing for both global and repo-local installations without affecting system PATH.
    • Telemetry (Opt-in): Collects anonymous usage data (skill name, duration, success/fail, version, OS) through Supabase, with strict privacy controls preventing collection of sensitive user or code information. Local analytics are always available from a JSONL file.

gstack represents a paradigm shift from simple copilots to a sophisticated, autonomous, and orchestratable virtual engineering team, designed to empower individual builders to achieve vastly scaled software development output.