Netdata AI
Netdata AI is a set of analysis and troubleshooting capabilities built into Netdata Cloud. It turns high‑fidelity telemetry into explanations, timelines, and recommendations so teams resolve issues faster and document decisions with confidence.
Why it’s accurate and powerful
- Per‑second granularity: Every Netdata Agent collects metrics at 1‑second resolution, preserving short‑lived spikes and transient behavior.
- On‑device ML: Unsupervised models run on every agent, continuously scoring anomalies for every metric with zero configuration.
- Evidence‑based correlation: Netdata’s correlation engine relates metrics, anomalies, and events across nodes to form defendable root‑cause hypotheses.
- Full context: Reports and investigations combine statistical summaries, anomaly timelines, alert history, and dependency information.
Capabilities
1) Insights
Generates on‑demand, professional reports (see AI Insights):
- Infrastructure Summary – incident timelines, health, and prioritized actions
- Performance Optimization – bottlenecks, contention, and concrete tuning steps
- Capacity Planning – growth projections and exhaustion dates
- Anomaly Analysis – forensics on unusual behavior and likely causes
Each report includes an executive summary, evidence, and actionable recommendations. Reports are downloadable as PDFs and shareable with your team. You can also schedule reports.
2) Investigations
Ask open‑ended questions (“what changed here?”, “why did X regress?”) and get a researched answer using your telemetry — see the Investigations overview. Launch from the “Troubleshoot with AI” button (captures current scope) or from Insights → New Investigation. Create Custom Investigations and set up Scheduled Investigations.
3) Troubleshooting
- Alert Troubleshooting – one‑click analysis for any alert with a root‑cause hypothesis and supporting signals
- Anomaly Advisor – interactive exploration of how anomalies propagate across systems
- Metric Correlations – focus on the most relevant charts for any time window
See the Troubleshooting overview. From any view, use the Troubleshoot with AI button.
4) Anomaly Detection
Local, unsupervised ML runs on every agent, learning normal behavior and scoring anomalies for all metrics in real time. Anomaly ribbons appear on charts, and historical scores are stored alongside metrics for analysis. See ML Anomaly Detection, configure via ML Configuration, and review methodology in ML Accuracy.
5) MCP (Model Context Protocol)
Connect AI clients to Netdata’s MCP server to bring live observability into natural‑language workflows and optional automation. Options include MCP, Chat with Netdata, and MCP Clients like Claude Desktop, Cursor, VS Code, JetBrains IDEs, Claude Code, Gemini CLI, and the Netdata Web Client.
Usage and credits
- Eligible Spaces receive 10 free AI credits; each Insights report, investigation, or alert troubleshooting run consumes 1 AI credit.
- Additional usage is available via AI Credits. Track usage from Settings → Usage & Billing → AI Credits.
Note
- No model training on your data: information is used only to generate your outputs.
- Despite our best efforts to eliminate inaccuracies, AI responses may sometimes be incorrect, please think carefully before making important changes or decisions.
Do you have any feedback for this page? If so, you can open a new issue on our netdata/learn repository.