Star 历史趋势
数据来源: GitHub API · 生成自 Stargazers.cn
README.md

grill-me-codex

Two AI models harden your plan before you write a line of code. A family of Claude Code skills that close the two gaps in AI-assisted coding: the gap between you and Claude (do we agree on what to build?) and the gap between Claude and the quality of what it produces (is the plan actually correct — and how would you even know?).

Act 1 grills you to lock the plan. Act 2 hands that plan to OpenAI Codex — a rival, cross-provider model — which adversarially tears it apart over several rounds until both models sign off. Only then does code get written.

Why a second model? Because the same model that plans the build and writes the build can't be trusted to grade its own work — it's an echo chamber. A different provider catches what Claude structurally can't see in itself.

Built on Matt Pocock's grill-me / grill-with-docs skills (MIT) — Act 1 is his work; the iterative cross-model Codex review (Act 2) is the addition.

🚀 Want to go deeper?

This skill is one piece of the full system. Get the Claude Code Masterclass and join builders going further with agentic AI inside Chase AI+.

The skills

SkillAct 1Act 2Use when
grill-me-codexClaude interrogates you one question at a time until the decision tree is resolvedCodex adversarial review loopPlanning from scratch + want alignment AND a second-model check
grill-with-docs-codexSame, but challenges your plan against your project's CONTEXT.md glossary + writes ADRs inlineCodex review loopSame, in a project with established terminology / architecture decisions
codex-review— (you already have a plan)Codex review loopYou have a plan and just want the cross-model stress-test

How Act 2 works

  1. Claude writes the locked plan to PLAN.md and starts a log at PLAN-REVIEW-LOG.md.
  2. Round 1: Codex reviews the plan in a read-only sandbox and returns VERDICT: APPROVED or VERDICT: REVISE.
  3. Rounds 2..N: Claude revises; the same Codex session is resumed so it remembers its prior critiques and only checks whether they're addressed.
  4. Bounded by MAX_ROUNDS (default 5). Terminates on APPROVED or the cap (a flagged deadlock beats a fake "approved").
  5. You gate twice only: kickoff, and final sign-off before any code. Codex is read-only every round and never writes a file.

Two artifacts: PLAN.md (the clean final plan — the what) and PLAN-REVIEW-LOG.md (the full round-by-round argument — the why).

Install

Copy the skill folders into your Claude Code skills directory:

# macOS / Linux
cp -r skills/* ~/.claude/skills/

# Windows (PowerShell)
Copy-Item -Recurse skills\* $env:USERPROFILE\.claude\skills\

Then invoke in Claude Code: /grill-me-codex, /grill-with-docs-codex, or /codex-review.

Prerequisites

  • Codex CLI ≥ 0.130npm install -g @openai/codex@latest (older versions error on the default gpt-5.5 model).
  • Authenticated Codex — run codex login once (a ChatGPT account works; Free/Plus/Pro/Max all fine).
  • Don't pin a model — ChatGPT-account auth rejects gpt-5.x-codex model variants; the skills use your config default.

Tunables

VarDefaultMeaning
MAX_ROUNDS5Hard cap on review rounds
PLAN_FILEPLAN.mdWhere the plan lives
LOG_FILEPLAN-REVIEW-LOG.mdThe argument transcript

Pass e.g. rounds=3 when invoking to override.

Safety

Codex runs read-only every round-s read-only on the first call, -c sandbox_mode="read-only" on every resume (the resume subcommand doesn't accept -s, and without forcing read-only it would inherit your config.toml sandbox default, which may be danger-full-access). The skills handle this for you. No code is ever written until you approve the final plan.

Credits

License

MIT — see LICENSE.

关于 About

No description, website, or topics provided.

语言 Languages

提交活跃度 Commit Activity

代码提交热力图
过去 52 周的开发活跃度
2
Total Commits
峰值: 2次/周
Less
More

核心贡献者 Contributors