RCA Lab

The RCA Lab is AtlasAI’s AI-powered root cause analysis workspace. It combines multi-signal evidence analysis, historical pattern matching via RAG, and topology-aware reasoning to produce ranked root cause hypotheses with explainable evidence chains.

Key Features

Multi-signal analysis — Analyzes metrics, logs, traces, alerts, topology, and change events simultaneously
RAG-powered reasoning — Retrieves context from past incidents, runbooks, and documentation to inform analysis
Topology-aware RCA — Walks service dependency graphs to trace failures upstream and downstream
Confidence scoring — Each hypothesis includes a confidence percentage based on evidence strength
Evidence chains — Transparent reasoning showing exactly which data points led to each conclusion
Comparative analysis — Compare the current incident to historical incidents with similar signatures
Interactive exploration — Drill into any hypothesis to see supporting and contradicting evidence
Feedback loop — Mark RCA results as correct, partially correct, or incorrect to improve future accuracy

How to Access

Navigate to RESPOND → RCA Lab in the left sidebar to access the standalone lab, or click Run RCA from within any incident (command palette: Cmd+K → “RCA Lab”).

Basic Usage

Open the RCA Lab from the sidebar
Select an incident to analyze, or create a new investigation
Review the auto-collected evidence — the lab pulls available metrics, logs, and topology for the affected service
Add additional evidence manually if the auto-collected data is insufficient
Click Analyze to run the reasoning engine
Review the ranked hypotheses in the Results panel
Click into a hypothesis to see the full evidence chain
Mark the correct root cause to improve future RCA accuracy