PlanGate

PlanGate Model Profiles — 仕様 + マトリクス

Status: v2（#197 / PBI-HI-003 で edit_interface_preference / retry_strategy / context_acquisition / provider_capabilities / telemetry_tags を追加。v1 は後方互換維持＝schema version は 1/2 両対応）本体定義: docs/ai/model-profiles.yaml Schema: schemas/model-profile.schema.json Interface preflight: docs/working/PBI-116/interface-preflight.md 関連: docs/ai/core-contract.md（Phase 1 / 実行契約の正本）

1. 目的

実行モデルごとの差分（reasoning effort / verbosity / context budget / tool policy / validation depth / adapter）を 薄い設定層に閉じ込める。PlanGate Core / Gate / Artifact schema はモデル非依存で維持 し、モデル変更時の影響を本ファイルと参照層に限定する。

2. 不変ポリシー（モデル別に変更しない）

以下は すべてのプロファイルで共通。Model Profile では変更しない:

Core Contract（core-contract.md Hard constraints / Iron Law 8 項目 / Stop rules / Output discipline）
Gate 条件（C-3 / C-4 / Parent C-3 / Parent Integration）
Artifact schema（plan / handoff / status / approvals 等）
AI 運用 4 原則（project-rules.md F セクション）

3. プロファイル一覧（4 件）

プロファイル	family	role	adapter
`gpt-5_5`	gpt-5	default_reasoning	outcome_first
`gpt-5_5_pro`	gpt-5-pro	advanced_reasoning	outcome_first_strict
`gpt-5_mini`	gpt-5-mini	fast_lightweight	explicit_short
`legacy_or_unknown`	legacy	unknown	legacy_or_unknown

4. reasoning_effort × mode × profile マトリクス

PlanGate 5 mode 別の推奨 reasoning effort:

PlanGate mode	gpt-5_5	gpt-5_5_pro	gpt-5_mini	legacy_or_unknown
ultra-light	low	low	low	low
light	low	medium	low	low
standard	medium	high	low	medium
high-risk	high	high	medium	high
critical	high	high (xhigh 可)	disallowed	high

補足

gpt-5_5_pro の critical で xhigh を選択する場合、allowed_efforts: [low, medium, high, xhigh] により構造的に許容
gpt-5_mini は disallowed_modes: [critical] で critical mode を実効的に禁止（高リスクタスクを軽量モデルで実行しない）

5. verbosity_by_phase × profile

phase 別の出力 verbosity（low / medium / high）:

Phase	gpt-5_5	gpt-5_5_pro	gpt-5_mini	legacy_or_unknown
classify	low	low	low	low
plan	medium	medium	low	medium
execute	low	low	low	medium
review	medium	high	low	medium
handoff	medium	medium	low	medium

6. context_budget_policy

3 段階。max_context_policy フィールドで指定:

値	用途	コスト	プロファイル例
`compact`	トークン節約優先、軽量タスク	低	gpt-5_mini
`standard`	標準、汎用	中	gpt-5_5 / legacy_or_unknown
`expanded`	長文・大規模コンテキスト	高	gpt-5_5_pro

7. tool_policy（PBI-116-06 接続）

interface-preflight.md で合意した値域:

値	意味	射影先
`narrow`	限定 tool セット	PBI-116-06 `docs/ai/tool-policy.md` で定義
`allowed_tools_by_phase`	phase 別 allowed tools リスト	同上
`expanded`	広範 tool セット	同上

8. validation_bias（PBI-116-06 接続）

値	意味	追加 Hook 条件
`lenient`	軽い検証	なし
`normal`	標準検証	なし
`strict`	厳格検証	PBI-116-06 `docs/ai/hook-enforcement.md` で定義（最低 3 件）

8-bis. retry_strategy（Tool Error Taxonomy 接続 / #203）

retryable な tool error（tool-error-taxonomy.md §4）の自動再試行方針。profile 別に値を持てる（未定義時は保守的既定）。

キー	意味	既定（未定義時）
`max_retries`	retryable の自動再試行上限。超過で soft_warning 降格	`1`
`backoff`	`provider_timeout` 等の待機方針（none/linear/exponential）	`none`

どの category が retry 対象かは tool-error-taxonomy.md §2/§3 が正本。本節は 回数・待機の値域のみを定義（責務分離）。
release_blocker 分類（schema_validation/missing_context/stale_contract）は retry しない（max_retries に関わらず即停止）。
profile 拡張・値変更は §10 の方針（別 PBI / eval 根拠）に従う。

8-ter. v2 拡張フィールド（#197 / PBI-HI-003）

モデル差分をプロンプト全文 fork ではなく Model Profile に閉じ込めるための v2 追加（すべて任意フィールド・既存 v1 profile は無変更で valid）。 retry_strategy の Tool Error Taxonomy 接続詳細は §8-bis（#269）を正とし、本節は v2 全 5 フィールドの値域・移行・privacy を扱う（重複定義しない）。

フィールド	内容	値域
`edit_interface_preference`	モデルが得意な編集形式	`primary`/`fallback` ∈ {patch, string_replace, full_file}（primary 必須）
`retry_strategy`	失敗時の回復方針・回数・待機	`on_edit_failure`{reread_then_retry_small_diff,retry_same,escalate} / `on_test_failure`{inspect_error_then_fix,retry_same,escalate} / `max_retries` 0-10 / `backoff`{none,linear,exponential}
`context_acquisition`	初期コンテキスト取得戦略	`strategy`{dynamic_first,eager,lazy}（必須）/ `initial_context_budget`{compact,standard,expanded}
`provider_capabilities`	provider 機能可否（判定根拠）	`supports_patch` / `supports_string_replace` / `supports_parallel_subagents`（bool）
`telemetry_tags`	公開安全な provider/世代ラベル	`provider` ∈ {openai, anthropic, claude_code, codex, unknown} / `generation` ∈ {gpt-5.5, gpt-5.5-pro, gpt-5-mini, legacy, unknown}（固定 enum・free-form 不可。鍵/アカウントID/社内モデル名を構造排除＝metrics-privacy §4/§5.4）

retry_strategy と Tool Error Taxonomy の責務分離

どの category が retry 対象か は tool-error-taxonomy.md （#203・main 在席）が正本。回数・待機の値域は本書 §8-bis（#269）。
本 retry_strategy は 回数（max_retries）・待機（backoff）・方針のみを profile 別に定義（二重定義しない）。release_blocker 分類（schema_validation / missing_context / stale_contract）は max_retries に関わらず retry しない（taxonomy §4）。
max_retries 未指定時は保守的既定 1。
on_*: escalate のとき max_retries は適用しない（即エスカレーション。 legacy_or_unknown は escalate + max_retries: 0 で安全側を明示）。

telemetry_tags の registry（privacy）

provider / generation は 固定 enum registry（schema enum）。 free-form を許さないことで metrics-privacy.md §4 の鍵 / アカウント ID / 社内モデル名混入を構造的に排除する。新 provider / 世代の追加は本 schema の enum 追加（versioning-stability-policy §2.1 = minor・additive）として行い、free-form 化はしない。

context_acquisition の位置づけ

strategy は将来の Dynamic Context Engine（#199）の 判定根拠であり、本 PBI では DCE 実装はしない（Non-goal）。

移行と後方互換

既存 4 profile を v2 へ移行済（docs/ai/model-profiles.yaml version: 2）。
legacy_or_unknown fallback は維持（安全側: escalate / lazy / patch 非対応）。
schema version は 1/2 両対応（v1 設定は無変更で valid＝ versioning-stability-policy.md §2.1 オプショナル追加 = minor）。
Core Contract / Gate / Artifact schema はモデル別に変更しない（v2 でも不変）。
v1→v2 の比較は #196 eval comparison（--harness-compare・main 在席）で v1 baseline と対比可能（profile を harness_metadata に記録）。

9. critical mode 禁止の運用

gpt-5_mini は disallowed_modes: [critical] を持つ。これにより:

critical mode タスクで gpt-5_mini プロファイルを選択した場合、実行前に block すべき
block の実装は本 PBI scope 外（runtime / Hook で実装、PBI-116-06 / 別 PBI で対応）
本 schema は 判定根拠 を提供する

10. プロファイル拡張・追加方針

新規モデルのプロファイル追加は 別 PBI で実施
本 PBI では 4 プロファイルに限定
既存プロファイルの値変更は eval (PBI-116-05) の結果に基づく

11. Claude Code エージェントの model tier（2026-06-10 / 正本）

本ファイル §1〜10 は Codex（GPT 系）実行モデルの設定層。Claude Code 側のエージェント別モデル選択は .claude/agents/*.md の frontmatter model: が機構で、以下の 2 tier を正本とする（haiku は積極採用しない方針）:

Tier	frontmatter	基準	対象
定型・構造化	`model: sonnet`	出力が機械検証（lint 再実行・テスト実行・schema 検証）に裏付けられる定型タスク、読み取り中心の調査・文書化	explorer-agent / linter-fixer / retrospective-analyst / setup-coordinator / documentation-writer / skill-designer
判断系	`model: inherit`（メイン会話に追従）	計画・設計・実装・レビュー・ゲート判定など誤りが下流に波及する深い推論。メインを Fable/Opus で運用すればそのまま最上位モデルが適用される	orchestrator / workflow-conductor / requirements-analyst / solution-architect / spec-writer / implementation-agent / implementer / qa-reviewer / acceptance-tester（V-1 false PASS 防止のため定型側にしない） / code-optimizer / project-planner

GPT 系（Codex CLI）への写像

Codex 側はモデル切替でなく .codex/agents/*.toml の model_reasoning_effort で同一 tier を表現する: 定型・構造化 = low / 判断系 = medium （OpenAI 公式ガイダンス: medium は GPT-5.5 のデフォルト推奨開始点、low は「効率的 reasoning」で機械検証に裏付けられた定型タスクに適合。minimal は multi-step / tool-heavy には不適のため不採用）。

エージェント toml の静的値としては high / xhigh を採用しない（2026-06-10 ユーザー判断: GPT-5.5 の high はトークン消費が大きい）。§4 の mode×effort 行列が high_risk/critical で high を推奨する箇所は run 単位の参照値であり、エージェント toml の静的値が優先される。 critical run で個別に引き上げたい場合は人間判断で当該 run のみ調整する。

運用上の不変条件

plugin 配布版は model: inherit に正規化する（sync-plugin-plangate.sh が agents コピー時に自動正規化）。tier 指定は本リポジトリ運用向けであり、利用者環境のモデル可用性に依存する設定を OSS 配布物に持ち込まない
.claude/agents/*.md は Hardening Override 対象のため、tier 変更の適用は apply スクリプト（scripts/apply-agent-model-tiers.sh）+ Human 実行
frontmatter の許容値は inherit | sonnet | opus | haiku | fable | フルモデルID。本リポジトリでは inherit / sonnet のみ使用（tests で機械検査）
本 tier（定型/判断系）は .claude/agents/README.md の「コア/支援」2 層構成とは別軸（例: linter-fixer はコアだが定型 tier、project-planner は支援だが判断系）
再評価条件: メイン運用モデルの世代交代時、または定型 tier 群で fix loop / 再試行の増加が観測された時（計測基盤は #520 系 follow-up 参照）

This site is open source. Improve this page.