Incident Reports

Affected	Not Affected
All pods with `nfs-subdir-retain` PVCs	Pods using `longhorn` storage
All pods with `nfs-subdir-delete` PVCs	Pods with no persistent storage
Grafana (was on `nfs-subdir-retain`)	VictoriaMetrics (was also NFS at the time)
Harbor Registry, LifeOps DB	Vault, Authentik (Longhorn)

VMID	Name	RAM	Balloon	Notes
103	k8s-node2	12GB	0 (off)	Fixed
104	k8s-node3	12GB	0 (off)	Fixed
108	k8s-node1	12GB	0 (off)	Fixed
107	k8s-controlplane	8GB	2048	Kept — fewer critical pods
109	TrueNAS-Scale	16GB	2048	Kept — not K8s workload

Component	Status During Incident
Traefik bouncer (IP ban check)	✅ Working (stream cache from last sync)
AppSec WAF (per-request block)	✅ Working (blocks still applied)
HTTP scenario detection	❌ Dead since deployment (all IPs whitelisted)
SSH brute force detection	✅ Working (not affected by LAPI restart)
Telegram notifications	❌ Dead since deployment (template bug)
Community blocklist (CAPI)	✅ Working (pulled periodically)

¶ Incident Reports