LEGACY DATASET
These charts come from the kube-sre-gym-style heuristic + early notebook runs — the 11 hand-curated tasks in rl-agent/scenarios/{easy,medium,hard}/*.json, recorded into rl-agent/checkpoints/<run>/metrics.jsonl and colab/logs/reward_breakdown_history.jsonl. They do not include the 381-task PPO Kaggle run.

Adversarial scenario designer

Designer is procedural fallback. Source: training snapshot (5 scenarios). Showing last 5 scenarios.

Generated scenarios

Base taskFaultsDifficultyRed herrings
task3decimal_corruption, redis_pool_exhaustionhard["postgres-vacuum", "redis-primary"]
task6tls_expiry, dns_resolutionhard["service-mesh", "coredns"]
task7configmap_race, cache_poisoninghard["redis-cache", "gc-pause"]
task11liveness_regression, image_pull_backoffhard["scheduler", "kubelet"]
task2payments_oom, kafka_lagmedium["broker-2", "zookeeper"]

How it works

POST /adversarial/design
{
  "base_task_id": "task2",
  "difficulty": "hard",
  "mix_faults": true,
  "add_red_herrings": 2
}