🚨 NEXUS Enhanced

Multi-Agent Enterprise Incident Response RL Environment

CrowdStrike-Scale Incident Training via GRPO

πŸ“Š Training Metrics
Episodes Completed
0
Total training episodes
Aggregate across all local runs
Average Reward
0.00
Mean of all completed episodes
Scope: aggregate view
Best Reward
0.00
Highest episode reward achieved
Improvement
0%
vs baseline untrained performance
πŸ“ˆ Reward Curve (Learning Progress)
Training reward history in aggregate mode. Use Advanced Metrics for per-run filtering.
Open Advanced Metrics (live + Colab exports tab)
πŸ“‹ Episode History
Shows completed sessions currently held in server memory (most recent first).
Episode
Incident
Reward
Status
Loading episodes...
πŸ§ͺ Manual Incident Validation

Test the environment manually by creating incidents and executing steps.

Guided text is hardcoded in this page for repeatable demos. Use Start Test, then Guided: fill + execute until the episode completes (or continue manually anytime).

▢️ Scripted auto-demo (INC003)

Runs POST /demo/run/INC003 in one shot (separate session from manual test). Step rewards stay 0 until the episode completes (sparse reward)β€”use the final breakdown at the bottom for the real score. Output stays visible until you clear it.

Click "Start Test" to begin manual validation