How to Work with Runbooks
Runbooks are step-by-step remediation playbooks. This guide covers creating, configuring, approving, and executing runbooks in an enterprise workflow.
When to use runbooks
- After RCA: When you have a root cause and a clear fix (e.g. restart service, scale pod, clear cache). Generate a runbook from the incident or pick a template.
- Recurring fixes: When the same type of incident happens often — turn the fix into an approved runbook so next time you (or the AI) can run it with one click.
- Controlled automation: When you want automation but need approval gates (L1/L2) so humans approve before risky steps run.
Why approval matters
Approval ensures that high-risk or irreversible actions are reviewed before execution. Draft runbooks must be approved before they can be run from Automation; per-step approval can be required depending on your Settings & RBAC and autonomy level.
Where to find Runbooks
- In the left sidebar, click Runbooks.
- The Runbook library lists all runbooks (draft, pending approval, and approved).
![]()
Capture from: Tenant Plane → AUTOMATE → Runbooks. Add as public/img/runbook-library.png.
Creating a runbook
Option A: Create from scratch
- Click New Runbook (or Create runbook).
- Enter a Title (e.g., “Restart Oracle listener on host”).
- Optionally set Applicability so the runbook is suggested only for matching services or CIs:
- Service types — e.g.,
oracle,api-gateway - CI patterns — e.g.,
db-*,web-* - Labels — tags that match incident CIs or services
- Service types — e.g.,
- Add Steps (see Adding and editing steps below).
- Click Save. The runbook is saved as Draft.
Option B: Generate from an incident (AI)
- Open an Incident that has RCA results.
- In the incident detail, click Run RCA if not already done.
- After RCA completes, click Generate Runbook (or Suggest runbook).
- The AI proposes steps based on root cause and evidence. Review and edit steps as needed.
- Save the runbook. It remains in Draft until approved.
![]()
Capture from: Tenant Plane → Incidents → open incident → RCA. Add as public/img/runbook-generate-from-incident.png.
Adding and editing steps
Each step can be manual or automation (command/script).
- In the runbook editor, click Add step.
- For each step set:
- Description — What this step does (e.g., “Restart Oracle listener”).
- Action type — Manual or Command/automation.
- Command — For automation: the actual command (e.g.,
systemctl restart oracle-xeorlsnrctl stopthenlsnrctl start). Use variables for dynamic targets (see below). - Target — Where the command runs (e.g.,
{{target_ci}},{{oracle_host}}). These are filled at execution time from the incident or job variables. - Rollback command (optional) — Command to undo this step if it fails.
- Validation command (optional) — Command to verify success (exit 0 = success).
- Use Reorder to change step order. Save when done.
Using variables in steps
| Variable | Meaning | Example |
|---|---|---|
{{target_ci}} | Primary CI linked to the incident | Filled when running from an incident |
{{oracle_host}} | Host for Oracle operations | Set in job variables or from incident CIs |
{{service_name}} | Affected service name | From incident’s affected service |
When you execute a runbook from an incident, these variables are auto-populated from the incident’s CIs and services. For manual runs, provide them in the execution dialog.
![]()
Capture from: Tenant Plane → Runbooks → Edit runbook → step. Add as public/img/runbook-step-editor.png.
Approval workflow
- When the runbook is ready, click Submit for approval (or Request approval). Status becomes Pending.
- Approvers (e.g., SRE lead) open Runbooks or Automation → Approvals, select the runbook, and click Approve (or Reject).
- After approval, status becomes Approved. The runbook is then available for execution and can be suggested by RCA.
![]()
Capture from: Tenant Plane → AUTOMATE → Automation (Approvals). Add as public/img/runbook-approval.png.
Executing a runbook
From an incident
- Open the Incident.
- In the Diagnose or Remediate tab, find Suggested runbooks (from RCA) or Runbooks.
- Click Execute on the runbook you want to run.
- Review pre-filled variables (
target_ci,oracle_host,service_name). Add or override if needed. - Click Start execution. Monitor progress in the Automation / Jobs view.
From the Automation page
- Go to Automation (or Runbooks → execute from library).
- Select the runbook and click Run or Execute.
- Enter Variables (e.g.,
target_ci,oracle_host) for the run. - Submit. The job appears in the execution list; open it for live output and status.
Scheduled execution
Runbooks can be run on a schedule via Automation → Schedules. See Scheduling & Automation.
Example: End-to-end runbook
- Create runbook “Restart Oracle listener.”
- Steps: (1) Validate connectivity to
{{oracle_host}}, (2) Runlsnrctl stopthenlsnrctl starton{{oracle_host}}, (3) Runsqlplus -s / as sysdba <<< "SELECT 1 FROM DUAL"to validate. - Applicability: Service types:
oracle; CI patterns:db-*. - Submit for approval → Approve.
- When an incident affects an Oracle CI, RCA suggests this runbook; Execute uses the incident’s host/CI as
{{oracle_host}}and{{target_ci}}.
Next steps
- Scheduling & Automation — Run runbooks on a schedule.
- Monitoring Policies — Define what is monitored and when alerts fire.
- Runbooks (module reference) — Technical overview of runbooks.