Sign in →

Metering Health

Per-metric metering health — event volume, error rate, latency, and Z-score anomaly detection so you catch a broken meter before it breaks billing.

Updated 2026-06-15Suggest edits

Metering Health

Metering Health (Operations → Metering Health) tracks the health of each billable unit's metering, so a meter that suddenly stops, spikes, or starts erroring gets caught before it corrupts billing.

Health states

StatusTrigger
HEALTHYNormal operation
WARNINGDegraded — e.g. >5% error rate or >1000 ms latency
CRITICALStopped, or >20% error rate
NO_DATANo events received

A period selector (1h / 6h / 24h / 7d) re-scopes the view, with a manual Refresh. A health grid shows one card per metric; clicking a card opens a trend side-panel.

Per-metric signals

Each metric reports event count (24h), events in the last hour, error rate, average latency, P95/P99 latency, last-event time, and a recent-counts sparkline.

Anomaly detection

When detected, anomalies appear in an alert strip and table. Each anomaly has a type (SPIKE / DROP), severity (CRITICAL / HIGH / WARNING), sigma deviation, and observed vs expected values.

Metering Health focuses on a single metric's pipeline health. For pipeline-wide throughput and source health, see Ingestion.