Sample runbook
High CPU on web tier
This is an example of what a Resolvify runbook looks like. AI generates these from your documentation, past incidents, and tribal knowledge—then refines them with success/failure feedback.
Runbook metadata
- Trigger:
- Alert: CPU > 80% for 5 min on web tier
- Mode:
- Approve-to-run (human signs off)
- Risk level:
- Medium (scaling action)
Identify top CPU consumers
Run top -b -n 1 and correlate with process list. Check if spike correlates with recent deploy.
Scale web tier (if load-driven)
If CPU is load-driven and below capacity limits, add 1–2 instances. Parameter: scale_count (default: 2).
Restart runaway process (if identified)
If a single process is consuming > 70% CPU and is known-safe to restart, execute restart with 30s grace period.
Log outcome and close ticket
Update ITSM ticket with action taken. If no resolution, escalate to L2 with context.
Rollback: Revert scale if CPU normalizes within 15 min; otherwise keep. Every execution is logged and auditable.
Want to see this in action with your stack?
Book a pilot