Job Title : Grafana Enterprise Engineer / Observability Engineer
Location : Hyderabad (Office-based)
Experience : 5–10 years
Employment Type : Full-time
Overview
We are looking for a highly skilled Grafana Enterprise Engineer with strong experience in observability, monitoring, performance optimization, and dashboarding across large-scale distributed systems. The ideal candidate will have deep hands-on expertise with Grafana Enterprise, Prometheus, Loki, Tempo, and other observability tools.
Key Responsibilities
Grafana Enterprise Administration
Install, configure, and manage Grafana Enterprise environments (on-prem / cloud).
Manage users, roles, permissions, and Grafana Enterprise features such as :
Enterprise plugins
Reporting
SSO integrations
Alerting & Incident management
Optimize performance of Grafana back-end services.
Observability Stack Management
Deploy, manage, and scale :
Prometheus / Mimir
Loki (log aggregation)
Tempo (tracing)
Alertmanager
Develop scraping strategies, retention policies, sharding, and federation setups.
Integrate data sources including InfluxDB, Elasticsearch, CloudWatch, Azure Monitor, and others.
Dashboards & Alerts
Architect and create advanced Grafana dashboards with templating, variables, and drill-down capabilities.
Build actionable alerting rules, automate alert routing / notifications, and reduce noise.
Work with teams to define SLIs / SLOs, performance metrics, and monitoring standards.
Automation & SRE Practices
Implement automation using Terraform, Helm, Ansible, or similar tools.
Develop monitoring-as-code templates for scalable deployments.
Participate in SRE practices including :
Incident response
Root cause analysis (RCA)
Performance tuning
Capacity planning
Collaboration
Work closely with application, DevOps, and cloud teams to onboard services into the monitoring ecosystem.
Train internal teams on dashboard usage, alerting, and observability best practices.
Required Skills
3+ years hands-on Grafana Enterprise experience.
Strong expertise in Prometheus, Loki, Tempo, or similar observability tools.
Strong skills in Dashboards, Alerting, Metrics, Logs.
Experience building and supporting high-availability observability platforms.
Strong Linux and scripting skills (Shell, Python).
Experience with Docker, Kubernetes, CI / CD tools.
Knowledge of cloud platforms : AWS / Azure / GCP.
Experience with SSO, LDAP, OAuth, or SAML integrations.
Preferred Skills
Experience with Grafana Mimir, Grafana Cloud, or Enterprise Metrics.
Exposure to log pipelines like Promtail, Fluentd, Fluent Bit, Vector.
Infrastructure-as-Code (Terraform) experience.
Certification in Cloud or Observability tools.
Enterprise Engineer • Shimoga, IN