Alerts
Configure alerts for QNSP platform events.
Alerts
Configure alerts for QNSP platform events.
Scope: The YAML fragments below are illustrative patterns. They are not a guarantee that each rule is deployed in production. For cloud-portal bootstrap outages (
/api/session/bootstrap503), seedocs/operations/portal-bootstrap-observability.mdin the monorepo — ai-orchestrator auto-remediation and optionalALERT_WEBHOOK_URLare not wired to that path unless you add explicit alarms or integrations.
Alert types
Threshold alerts
Trigger when metric crosses threshold:
- alert: HighErrorRate
expr: rate(qnsp_errors_total[5m]) > 0.01
for: 5m
labels:
severity: critical
Anomaly alerts
Trigger on unusual patterns:
- alert: UnusualTraffic
expr: |
qnsp_requests_total >
avg_over_time(qnsp_requests_total[7d]) * 2
Availability alerts
Trigger on service unavailability:
- alert: ServiceDown
expr: up{job="qnsp"} == 0
for: 1m
Built-in alerts
| Alert | Condition | Severity |
|---|---|---|
| High error rate | >1% errors | critical |
| High latency | p99 > 1s | warning |
| Rate limit spike | >100/min | warning |
| Auth failures | >10/min | warning |
| Service unhealthy | health != ok | critical |
Alert destinations
Webhook
{
"type": "webhook",
"url": "https://alerts.example.com/webhook",
"headers": {"Authorization": "Bearer ..."}
}
{
"type": "email",
"recipients": ["ops@example.com"]
}
PagerDuty
{
"type": "pagerduty",
"routingKey": "..."
}
Slack
{
"type": "slack",
"webhookUrl": "https://hooks.slack.com/..."
}
Alert management
Alert management APIs are not shipped in this repo.
PDP / edge enforcement (deployed alarms)
When enable_pdp_service and enable_pdp_edge_enforcement_observability are true, Terraform provisions CloudWatch alarms described in docs/operations/pdp-edge-enforcement-observability.md (PDP snapshot stale and materialization degraded via log metric filters on the PDP log group). Edge edge_enforcement_degraded_privileged_denied_total is on the dashboard (alarms cannot use SEARCH on that signal in CloudWatch). Wire SNS with pdp_edge_enforcement_observability_alarm_sns_topic_arns in infra/terraform/variables.tf.