Releases · m-lab/prometheus-support

Upgrade of github-receiver from v0.2 to v0.3.

Enable alert routing between ops-tracker and dev-tracker on a per-alert basis. All alerts now include a "repo" label.

Multiple updates and one new dashboard:

NDT_GlobalTestRate.json -- includes a three week overlay to visually contrast past performance.
Ops_PlatformOverview.json -- restricts some queries to only platform instances of the node exporter.
Ops_SwitchOverview.json -- updates the selection query that identifies sites, so all sites are available even if they are offline currently.
Pipeline_Embargo.json -- a new dashboard for visually comparing scraper output to embago input and embargo output to etl input.

New alerts:

SnmpScrapingDownAtSite -- corrects an earlier alert that would only fire if all SNMP metrics from all nodes were missing. This new alert fires when a single site stops collecting SNMP metrics.
VdlimitMetricsMissingForNode -- an integrity check to guarantee that all metrics used by mlab-ns are available.

Provide feedback