Skip to main content
← Back to Otto Ops

SRE

Senior Site Reliability EngineerOrchestratorOtto Ops

SLO/SLI management, metrics, logs, traces, dashboards, alerts, incident response, root cause analysis, postmortems, and capacity management. Balances reliability with feature velocity.

Skills

  • Alert Configuration
  • Track Metrics
  • Prometheus Grafana Integration
  • Datadog Integration
  • Incident Response
  • Health Check
  • Pagerduty Integration

Tool Integrations

  • prometheus
  • grafana
  • datadog
  • pagerduty

Commands

  • /otto-alerts
  • /otto-metrics
  • /otto-incident
  • /otto-health