[Plan 2026-04] AI quality/strength/diversity/production tracking

## Overview

Tracking issue for the plan at [docs/planning/AI_QUALITY_STRENGTH_DIVERSITY_PLAN_2026-04-16.md](../blob/main/docs/planning/AI_QUALITY_STRENGTH_DIVERSITY_PLAN_2026-04-16.md).

## Week 1 issues

- [x] #78 — A1 Per-seat WR tracking in quality gate (shipped 98736c566; deployed to gh200-13)
- [x] #79 — A2 Plateau detector (shipped 24bf557b9)
- [x] #80 — **C2 Expose personas in production UI** (shipped 5aa0fe3be + 186a74bd5 + 1d9393a9d; **flag-gated; ready to flip**)
- [x] #81 — D1 Model-version telemetry on /ai/move (shipped cbcd73baa; live in production)
- [x] #82 — D5 Silent-fallback observability (shipped 6ec5c8e82; live in production)

## Review follow-ups (closed)

- [x] #83 Circuit breaker half-open concurrency (shipped f7df4cd8b)
- [x] #84 Plateau detector in-memory history (shipped 81d32957c)
- [x] #85 CircuitBreaker state-machine consolidation (shipped f7df4cd8b together with #83)
- [x] #86 getLocalFallbackMove tier context (shipped 61621efd3)

## Other work shipped along the way

- A3 training-probe model_version regression test (b714c63e7)
- Security sweep 19 → 4 LOW npm findings (30c73b6a0)
- v5-heavy compatibility fixes: bootstrap (c9a43020c) + runtime infer (5764c2656)
- Test-stub unblock after game-granular resume (bcbf39563)
- Evidence-artifact automation (787f4a278 by Codex)

## Week 2–3 next

- B2 v5-heavy pilot — running on gh200-11 now (iter 1 selfplay post-fix)
- C1 Ensemble serving for D9–D10 tiers
- C3 Varied multiplayer seating — leverages the personaIds[] array C2 shipped
- D2 Hot reload for new checkpoints
- B3 Seat-stratified value loss (if gh200-12 iter 26 seat_wr confirms imbalance)

## Success metrics

- At least one config above 2000 Elo within 8 weeks
- square8_3p above 1600 Elo within 4 weeks
- Production serves ≥ 4 distinguishable personas — **ready via C2; pending flag flip**
- p95 inference latency within per-tier SLO
- Fallback rate < 1% under normal operation (baseline observable via D5 telemetry)

## Related

- Plan doc: [docs/planning/AI_QUALITY_STRENGTH_DIVERSITY_PLAN_2026-04-16.md](../blob/main/docs/planning/AI_QUALITY_STRENGTH_DIVERSITY_PLAN_2026-04-16.md)
- Security triage: [docs/security/npm_audit_triage_2026-04-17.md](../blob/main/docs/security/npm_audit_triage_2026-04-17.md)
- Independent review: [docs/reviews/week1_commits_review_2026-04-17.md](../blob/main/docs/reviews/week1_commits_review_2026-04-17.md)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Plan 2026-04] AI quality/strength/diversity/production tracking #77

Overview

Week 1 issues

Review follow-ups (closed)

Other work shipped along the way

Week 2–3 next

Success metrics

Related

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

[Plan 2026-04] AI quality/strength/diversity/production tracking #77

Description

Overview

Week 1 issues

Review follow-ups (closed)

Other work shipped along the way

Week 2–3 next

Success metrics

Related

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions