Grafana OnCall — Open Source Incident Response and On-Call Management
Manage on-call schedules and incident routing with Grafana OnCall. Integrates natively with Grafana alerting for automated escalations, multi-channel notifications, and team rotation management.
Instalación lista para agent
Este activo puede instalarse después de elegir el runtime, revisar el plan y ejecutar el comando correspondiente.
npx -y tokrepo@latest install 017130a6-393a-11f1-9bc6-00163e2b0d79 --target codexEjecutar después de confirmar el plan con dry-run.
What it is
Grafana OnCall is an open-source incident response and on-call management tool. It integrates natively with Grafana alerting to route alerts to the right person at the right time. Features include on-call schedules, escalation chains, multi-channel notifications (Slack, PagerDuty, phone, SMS), and team rotation management.
Grafana OnCall targets DevOps and SRE teams already using Grafana for monitoring who want on-call management without paying for PagerDuty or Opsgenie.
How it saves time or tokens
Grafana OnCall eliminates the gap between monitoring and incident response. Alerts from Grafana flow directly into OnCall without webhook configuration. Escalation chains ensure that if the primary on-call does not acknowledge, the alert automatically routes to the next person.
Schedule management with overrides and swaps handles the messy reality of PTO and shift changes without manual coordination.
How to use
- Enable the OnCall plugin in your Grafana instance
- Create on-call schedules with team rotations
- Define escalation chains: who gets notified first, second, and third
- Connect notification channels (Slack, email, phone)
Example
# Terraform configuration for Grafana OnCall
resource 'grafana_oncall_schedule' 'primary' {
name = 'Primary On-Call'
type = 'calendar'
time_zone = 'America/New_York'
}
resource 'grafana_oncall_escalation_chain' 'critical' {
name = 'Critical Alerts'
}
resource 'grafana_oncall_escalation' 'step1' {
escalation_chain_id = grafana_oncall_escalation_chain.critical.id
type = 'notify_on_call_from_schedule'
notify_on_call_from_schedule = grafana_oncall_schedule.primary.id
position = 0
}
resource 'grafana_oncall_escalation' 'step2' {
escalation_chain_id = grafana_oncall_escalation_chain.critical.id
type = 'wait'
duration = 300 # 5 minutes
position = 1
}
Related on TokRepo
- Monitoring tools -- Monitoring and alerting tools
- DevOps tools -- Infrastructure and operations
Common pitfalls
- OnCall requires Grafana 9.0+ and the OnCall plugin; older Grafana versions are not supported
- Phone and SMS notifications require a Twilio or Grafana Cloud connection; self-hosted setups default to Slack and email
- Escalation chains without a final catch-all step can result in unacknowledged alerts during off-hours
Preguntas frecuentes
PagerDuty is a mature, managed service with extensive integrations. Grafana OnCall is free, open source, and integrates natively with Grafana. OnCall covers core on-call needs; PagerDuty offers more advanced features like event intelligence and stakeholder communication.
Yes. OnCall provides webhook endpoints that accept alerts from any monitoring system. You can route alerts from Prometheus Alertmanager, Datadog, or custom systems via HTTP webhooks.
Yes. OnCall supports multiple schedules, escalation chains, and teams. Each team can have its own rotation, notification preferences, and escalation policies.
Team members can request shift swaps through the OnCall UI. The swap is approved by the other party and reflected in the schedule immediately. Overrides let managers assign temporary coverage without changing the rotation.
Yes. Grafana Cloud includes OnCall as a managed service with phone and SMS capabilities built in. The open-source version can be self-hosted and provides the same core features minus managed phone/SMS.
Referencias (3)
- Grafana OnCall GitHub— Grafana OnCall is open-source incident response management
- Grafana Docs— Native Grafana alerting integration
- Grafana Terraform Provider— Infrastructure as code with Terraform provider
Relacionados en TokRepo
Discusión
Activos relacionados
Grafana — Open Source Data Visualization & Observability
Grafana is the leading open-source platform for monitoring and observability. Visualize metrics, logs, and traces from Prometheus, Loki, Elasticsearch, and 100+ data sources.
Plane — Open-Source AI Project Management
Open-source Jira/Linear alternative with AI-powered pages. Issues, sprints, modules, roadmaps, and real-time analytics. Self-hostable via Docker. AGPL-3.0, 47,500+ stars.
Huly — All-in-One Open Source Project Management Platform
Huly is an open-source alternative to Linear, Jira, Slack, and Notion. Project tracking, team chat, knowledge base, and HR tools in a single unified platform.
oncall-guide — Incident Response Subagent
Open-source Claude Code subagent for incident response — walks the oncall checklist autonomously: deploys, errors, rollback. Inspired by Boris Cherny.