Duel Mode
Run a real Red vs Blue exercise across two machines. Each side keeps its own evidence ledger; an impartial Referee correlates the two and scores the result — detection latency, coverage, misses — on evidence alone.
/spectra-duel-adjudication
Three moves
Separate
Red and Blue run on separate machines. Each writes a role-local, append-only JSONL ledger of what it did or saw — no shared state, no peeking.
Exchange
A Red/Blue broker exchanges offline, signed JSON bundles between the separated machines — checksum + schema + event verified, deduplicated. No sockets, no remote agents, no host modification.
Adjudicate
The Referee correlates Red actions with Blue detections across the timeline and produces a scorecard. Credit is awarded only where the ledger proves it — agent prior knowledge of the Red plan never counts.
A score without evidence is opinion, not adjudication. Detection must be evidenced by Blue telemetry; misses distinguish absent telemetry, unanalyzed telemetry, failed detection and failed correlation. A useful scorecard improves the next exercise — it does not merely declare a winner.
The broker is file-based: no sockets, no listeners, no remote agents, no host modification. The Blue Live Adapter ingests defensive telemetry read-only. Distribution is the point — neither side trusts the other’s claims, only the correlated evidence.