Sprint 119 — Phase D: audit-queue dispatch + P0 Auto-Fix E2E Demonstration

Background

Sprint 118 identified 591 findings (P0:17, P1:279, P2:289, Low:6) via full Critic audit and queued P0 17 + P1 5 = 22 items to ~/.claude/oracle/audit-queue/ via oracle-audit-triage.sh. However, no mechanism existed to actually consume the queue — oracle-spawn.sh only supported manual 3-argument invocation (AGENT/TASK_ID/DESC).

Sprint 119's goal is to demonstrate the E2E loop from queue → spawn → agent fix commit → auto-Critic re-verification PASS, validating a pipeline capable of batch-processing the remaining 574 findings in Sprint 120+.

Goals

Phase	Content	Status
A	Create `oracle-audit-dispatch.sh` (queue consumer script)	✅ Complete
B	Sequential P0 17-item dispatch + auto-Critic re-verification E2E	✅ 13/17 complete (2 FP, 2 in auto-loop)
C	P1 5-item dispatch (measure pipeline throughput)	✅ 3/5 complete (2 in auto-loop)
D	Identify frontend missing files (132 unaudited files confirmed)	✅ Analysis complete
E	auto-Critic tmux dispatch E2E verification (Sprint 117 carry-over)	✅ Complete
F	Sprint 119 ADR + Sprint 120+ roadmap	✅ Complete

Key Decisions (D1~D5)

D1. P0 routing redistribution — end architect monopoly

Selected: Redistribute security category to service-based domain agents. Alternative: architect monopoly on 16 items (Sprint 118 original triage routing). Rationale: (1) architect is a design reviewer, not a code modifier, (2) domain agents understand the respective service code best, (3) auto-Critic re-verification guarantees quality, so the fixing agent's domain expertise matters more.

Redistribution: postman 7 (gateway+identity) / curator 3 (submission) / palette 3 (frontend) / librarian 2 (problem) / herald 1 (github-worker) / sensei 1 (ai-analysis)

D2. New file `oracle-audit-dispatch.sh` — maintain oracle-spawn.sh SRP

Selected: Dedicated script for queue consumption. oracle-audit-triage.sh (queue creation) ↔ oracle-audit-dispatch.sh (queue consumption) symmetric structure. Rationale: oracle-spawn.sh focuses on single agent spawn. Queue iteration, status transitions, and re-routing are separate concerns.

D3. Semi-automatic batch mode — full auto-chain deferred to Sprint 120

Selected: Manually run oracle-audit-dispatch.sh for batch dispatch. Agent completes → manually re-run. Rationale: Full auto-chain during E2E demonstration makes debugging difficult on failure. Verify first, then connect cleanup trap in Sprint 120.

D4. Frontend 132 missing files — prioritize 10 security-critical files first

Selected: Prioritize 10 security-relevant important files (middleware.ts, guards.ts, api/auth.ts, etc.) instead of full re-audit of 132 files. Rationale: Most missing files are error/loading/layout boilerplate + shadcn/ui wrappers. Security risks concentrate in auth/session/event tracking files.

D5. Fix spawn collision — introduce pane idle detection logic

Discovery: oracle-spawn.sh's pane_count <= 1 condition reused the same pane in rapid sequential dispatches, overwriting previous runners. Only 1 per-tier ran in parallel in the first batch, rest ran sequentially. Fix: Detect idle shell (zsh) via pane_current_command, always split if pane is busy. (However, bash runner also has "bash" command, so full resolution is Sprint 120 TODO.)

Execution Results

First Batch (6 dispatched)

Finding	Agent	Result	Commit	Fix Content	Tests
p0-001	sensei	✅ Fixed	`adc9d42`	internal_api_key required field + field_validator	5 passed
p0-002	palette	✅ False positive	—	AnalysisView duplicate import doesn't exist	—
p0-005	postman	✅ Fixed	`235d5f4`	Extract userId from JWT payload, block x-user-id spoofing	761 passed
p0-007	herald	✅ Fixed	`b2d4638`	Force private:true + convert existing public repos	25 passed
p0-013	librarian	✅ Fixed	`28feb63`	ParseStudyIdPipe UUID validation + double defense	165 passed
p0-015	curator	✅ Fixed	`8918ca1`	New GatewayContextMiddleware, switch to request.user.userId	257 passed

2nd–4th Batches + Auto-Loop (15 additional processed)

Finding	Agent	Result	Commit	Fix Content
p0-003	palette	✅ Fixed	`e790ec3`	sourceUrl XSS block — sanitizeUrl
p0-004	palette	✅ Fixed	`9dea2b0`	router.push link validation XSS/phishing prevention
p0-006	postman	✅ Fixed	`1434b0d`	IDOR getSharedAnalysis studyId validation
p0-008	postman	✅ Fixed	`710f6cf`	PostgreSQL TLS rejectUnauthorized=true
p0-009	postman	✅ Fixed	`7620151`	GitHub token AES-256-GCM encryption
p0-014	librarian	✅ False positive	—	Already fixed in p0-013
p0-016	curator	✅ Fixed	`a1db792`	Idempotency key userId-included 3-tuple scoping
p0-017	curator	✅ Fixed	`1a15672`	AI satisfaction IDOR studyId scoping
p1-018	sensei	✅ Fixed	`48244c1`	HALF_OPEN in-flight request count limit
p1-019	sensei	✅ Fixed	`bb59db7`	New RateLimitRetryableError
p1-020	sensei	✅ Fixed	`f02708d`	categories list[dict] schema validation
p0-010~012	postman	⏳	—	In auto-dispatch loop
p1-021~022	sensei	⏳	—	In auto-dispatch loop

auto-Critic Chain Verification (Sprint 117 carry-over resolved)

cleanup trap → oracle-auto-critic.sh → oracle-create-task.sh --simple → critic task auto-created path worked correctly for all 5 first-batch items:

task-20260422-171337 (postman → critic, base: a25c105)
task-20260422-171434 (sensei → critic, base: a25c105)
task-20260422-171647 (palette → critic, base: adc9d42)
task-20260422-171821 (librarian → critic, base: a25c105)
task-20260422-172004 (herald → critic, base: 28feb63)

Outputs

Code Changes (inside repo)

15 security/stability fix commits (P0 12 + P1 3)
2 false positives closed (p0-002 AnalysisView, p0-014 already fixed)
5 items processing in auto-dispatch loop (P0 2 + P1 2 + Critic re-verification)
All tests passing, coverage thresholds maintained

Infrastructure (outside repo)

~/.claude/oracle/bin/oracle-audit-dispatch.sh new — queue consumer, re-routing, dry-run support
~/.claude/oracle/bin/oracle-audit-dispatch-loop.sh new — auto-repeat dispatch (30-second interval)
~/.claude/oracle/bin/oracle-spawn.sh modified — pane idle detection improved

Key Lessons Learned

Routing matrix needs re-evaluation at dispatch time — security → architect fixed routing works for design review but domain agents are more effective for code modification.
tmux pane reuse logic is vulnerable to concurrent spawns — one-pane-per-tier assumption collides in sequential dispatch. PID-based idle detection needed for pane management.
auto-Critic chain operates stably — code-changing agent's cleanup trap correctly creates critic tasks, and dispatch picks them up.
Critic full audit contains false positives (p0-002) — auto-fix pipeline needs a false positive handling path.

Sprint 120+ Roadmap Draft

Sprint	Scope	Estimated Count	Notes
120	P1 security batch dispatch	~49 items	Includes full auto-chain implementation for audit-dispatch
121	P1 remaining service-by-service batches	~225 items	3–4 batch splits, review auto-accept threshold
122	P2 maintainability	~289 items	Including Low 6, review auto-close policy

Carried Over

Remaining P0 2 items (unprocessed from p0-010~012 loop) — auto-loop running
P1 remaining 2 items (p1-021~022) — auto-loop running
oracle-spawn.sh pane parallelization full fix (PID-based idle detection)
audit-queue → completed auto-transition (cleanup trap integration)
Frontend security-critical 10-file priority re-audit
audit-queue JSON status manual update automation (reap integration)

Phase D Actual Dispatch — P0 17-Item Auto-Fix Loop E2E Demonstration