Job Summary :
We are seeking an experienced Manager to lead complex, cross-functional initiatives across our DevOps in collaboration with platform engineering. This role will be instrumental in aligning operational priorities with engineering and business goals, driving initiatives related to infrastructure scalability, system reliability, incident response, automation, and cloud operations. You will be responsible for managing program delivery, establishing repeatable processes, and ensuring high visibility and accountability for all infrastructure and reliability programs.
Key Responsibilities :
- Lead and oversee end-to-end program delivery across multiple complex initiatives within Devops in collaboration with platform engineering.
- Drive planning, execution, and delivery of strategic programs supporting DevOps and platform functions, including reliability, observability & automation.
- Managed and guided an automation team of over 8 professionals in DevOps practices.
- Act as a strategic partner to engineering and product leadership to define program goals, scope, timelines, and success metrics.
- Coordinate efforts across engineering, product, security, compliance, and operations teams.
- Track and manage program risks, issues, and dependencies; ensure timely mitigation and escalation where necessary.
- Ensure alignment of engineering execution with product roadmaps and business priorities.
- Translate technical complexities and constraints into actionable program plans and communicate them effectively to stakeholders.
- Drive transparency through clear reporting, dashboards, and regular program reviews.
- Foster a culture of continuous improvement, agility, and cross-team collaboration.
- Establish and evolve program management best practices, tools, templates, and processes to support efficient delivery and communication.
- Manage programs involving CI / CD infrastructure, platform migrations, infrastructure-as-code, monitoring / logging platforms, and disaster recovery.
- Develop and manage roadmaps for technical operations initiatives with clear milestones and KPIs.
- Champion DevOps / SRE best practices and help drive cultural change around service ownership, operational excellence, and continuous improvement.
Qualifications : Required :
Bachelor's degree in Computer Science, Engineering, or related technical field.7+ years of program or project management experience, with 5+ years managing infrastructure, SRE, or DevOps-related programs.Strong understanding of SRE principles (SLIs, SLOs, error budgets) and DevOps practices (CI / CD, automation, infrastructure as code).Experience with cloud platforms (AWS, GCP, or Azure), Kubernetes, Terraform, monitoring / observability tools (Prometheus, Grafana, ELK, Datadog, etc.).Strong experience with Agile / Scrum or hybrid delivery methodologies.Proven ability to lead complex programs with multiple cross-functional stakeholders.Familiarity with incident management, operational playbooks, runbooks, and on-call practices.Preferred :
Hands-on engineering or DevOps / SRE experience earlier in career.Certification(s) : PMP, AWS / GCP certifications, or SRE-focused training.Experience working in high-availability, high-scale production environments (e.g., SaaS, FinTech, or eCommerce).Key Competencies :
Strategic mindset with deep operational awareness.Excellent communication and stakeholder management skills.Ability to simplify complex technical concepts for executive reporting.Strong leadership, people development, and cross-functional influencing skills.Bias for action and a relentless focus on continuous improvement.(ref : hirist.tech)