Netdata AI

Netdata AI is a set of analysis and troubleshooting capabilities built into Netdata Cloud. It turns high‑fidelity telemetry into explanations, timelines, and recommendations so teams resolve issues faster and document decisions with confidence.

Netdata AI overview

Why it’s accurate and powerful

Per‑second granularity: Every Netdata Agent collects metrics at 1‑second resolution, preserving short‑lived spikes and transient behavior.
On‑device ML: Unsupervised models run on every agent, continuously scoring anomalies for every metric with zero configuration.
Evidence‑based correlation: Netdata’s correlation engine relates metrics, anomalies, and events across nodes to form defendable root‑cause hypotheses.
Full context: Reports and investigations combine statistical summaries, anomaly timelines, alert history, and dependency information.

Capabilities

1) Conversations

Real-Time Conversations provide a live, interactive dialogue with Netdata AI. Ask rapid-fire questions, explore hypotheses, and get instant visualizations (Live Exhibits) embedded directly in the chat. Use Conversations for the exploratory phase of troubleshooting, then pivot to Investigations for comprehensive reports.

2) Insights

Generates on‑demand, professional reports (see AI Insights):

Infrastructure Summary – incident timelines, health, and prioritized actions
Performance Optimization – bottlenecks, contention, and concrete tuning steps
Capacity Planning – growth projections and exhaustion dates
Anomaly Analysis – forensics on unusual behavior and likely causes

Each report includes an executive summary, evidence, and actionable recommendations. Reports are downloadable as PDFs and shareable with your team. You can also schedule reports.

3) Investigations

Ask open‑ended questions ("what changed here?", "why did X regress?") and get a researched answer using your telemetry — see the Investigations overview. Launch from Insights → New Investigation. Create Custom Investigations and set up Scheduled Investigations.

4) Troubleshooting

Alert Troubleshooting – one‑click analysis for any alert with a root‑cause hypothesis and supporting signals
Anomaly Advisor – interactive exploration of how anomalies propagate across systems
Metric Correlations – focus on the most relevant charts for any time window

See the Troubleshooting overview.

5) Alerts Automation

Alerts Automation uses AI to suggest, generate, and test alert configurations. Describe what you want to monitor in plain English, and the AI generates the complete alert definition, tests it against historical data to show how it would have triggered, and lets you deploy it to your nodes — no need to learn alert configuration syntax or manually tune thresholds.

6) Anomaly Detection

Local, unsupervised ML runs on every agent, learning normal behavior and scoring anomalies for all metrics in real time. Anomaly ribbons appear on charts, and historical scores are stored alongside metrics for analysis. See ML Anomaly Detection, configure via ML Configuration, and review methodology in ML Accuracy.

7) MCP (Model Context Protocol)

Connect AI clients to Netdata’s MCP server to bring live observability into natural‑language workflows and optional automation. Options include MCP, Chat with Netdata, and MCP Clients like Claude Desktop, Cursor, VS Code, JetBrains IDEs, Claude Code, Gemini CLI, and the Netdata Web Client.

Usage and credits

Eligible Spaces receive 10 free AI credits; each Insights report, investigation, or alert troubleshooting run consumes 1 AI credit.
Additional usage is available via AI Credits. Track usage from Settings → Usage & Billing → AI Credits.

Note

No model training on your data: information is used only to generate your outputs.
Despite our best efforts to eliminate inaccuracies, AI responses may sometimes be incorrect, please think carefully before making important changes or decisions.

Do you have any feedback for this page? If so, you can open a new issue on our netdata/learn repository.

Why it’s accurate and powerful​

Capabilities​

1) Conversations​

2) Insights​

3) Investigations​

4) Troubleshooting​

5) Alerts Automation​

6) Anomaly Detection​

7) MCP (Model Context Protocol)​

Usage and credits​

Note​