Description GSPANN is hiring a Program Manager with expertise in managing infrastructure to lead large-scale observability and infrastructure initiatives. The role focuses on driving end-to-end program delivery, automation, and reliability across CI / CD and DevOps environments.
Role and Responsibilities
- Lead and mentor a high-performing team of observability engineers in India, fostering collaboration, innovation, and accountability.
- Manage the end-to-end delivery of observability platforms to consistently meet Service-Level Agreements (SLAs) and business objectives.
- Define and execute the observability roadmap covering Application Performance Monitoring (APM), infrastructure monitoring, logging, distributed tracing, and synthetic monitoring.
- Implement proactive monitoring and automation practices to reduce Mean Time to Recovery (MTTR) and improve system reliability.
- Collaborate with DevOps, Site Reliability Engineering (SRE), application, and infrastructure teams to integrate observability into Continuous Integration / Continuous Deployment (CI / CD) pipelines.
- Partner with US-based stakeholders to gather requirements, prioritize initiatives, and deliver regular program updates.
- Oversee vendor management, licensing, and platform optimization for tools such as Datadog, New Relic, Dynatrace, or equivalent solutions.
- Establish governance frameworks, Key Performance Indicators (KPIs), and dashboards to track platform adoption, performance, and business impact.
- Drive best practices in incident management, Root Cause Analysis (RCA), anomaly detection, and Artificial Intelligence for IT Operations (AIOps).
Skills and Experience
At least 10 years of experience in engineering program management, with a strong focus on observability, infrastructure, or DevOps.Proven success managing large-scale programs in observability or infrastructure domains.Deep understanding of modern observability tools, frameworks, and practices.Excellent leadership, communication, and stakeholder management capabilities.Experience managing cross-functional global teams and large-scale initiatives.Familiarity with CI / CD, DevOps, and SRE principles and methodologies.