DevOps / SRE Engineering Leader Job Description
Role Summary :
We are looking for a highly accomplished DevOps / SRE Engineering Leader to oversee multi-cloud infrastructure, manage critical Kubernetes environments, lead a high-performing engineering team, and ensure enterprise-grade reliability, automation, and security across global cloud platforms. You will be responsible for strategic DevOps and SRE initiatives, platform stability, security compliance, and FinOps optimization in a dynamic, fast-paced technology environment.
Key Responsibilities :
- Lead DevOps and SRE strategy and execution across hybrid and multi-cloud platforms (AWS, Azure, Oracle, GCP).
- Manage critical SaaS infrastructure with 99.99% uptime requirements across global regions.
- Design and optimize CI / CD pipelines using Jenkins, GitLab CI, Azure DevOps, Helm, and GitOps.
- Implement Infrastructure as Code using Terraform, Ansible, and Python.
- Oversee container orchestration with Kubernetes (AKS / EKS), Docker, and Rancher.
- Define and enforce cloud security, compliance (SOC 2, PCI DSS, ISO 27001), and governance standards.
- Drive cloud cost optimization through FinOps best practices and tooling (e.g., Cast AI, Azure Cost Management).
- Lead observability, monitoring, and incident response using tools such as Prometheus, Grafana, Datadog, ELK Stack, and
Azure Monitor.
Manage stakeholder communication, project delivery, and resource planning aligned with business OKRs.Mentor and scale distributed engineering teams fostering a culture of technical excellence and accountability.Deliver high-scale platform modernization, cloud migration, and automation initiatives.Required Skills and Technologies :
Cloud Platforms : AWS, Azure, Oracle Cloud, GCP.CI / CD & DevOps : Jenkins, GitLab, Azure DevOps, Helm, GitOps.IaC Tools : Terraform, Ansible, Puppet, Shell scripting, Python.Containers & Orchestration : Docker, Kubernetes (AKS / EKS), Rancher, OpenShift.Monitoring & Observability : Grafana, Prometheus, ELK, Datadog, New Relic.Security & Compliance : SOC 2, PCI DSS, ISO 27001, IAM, PIM.Network & Infrastructure : Cisco, FortiGate, VMware, KVM, HPE, Veritas NetBackup.Project & Team Management : Agile, Scrum, Jira, Confluence, ITSM, ITIL.Qualifications :
Bachelor's or Master's degree in Computer Science, IT, Telecommunications, or a related field.Preferred Certifications :
Certified Kubernetes Administrator (CKA).AWS / Azure Certified Solutions Architect.(ISC) Certified in Cybersecurity (CC).ITIL, Cisco CCNP or Specialist-level certifications.Preferred Experience :
10+ years of experience in SRE / DevOps with at least 5 years in technical leadership roles.Proven success in large-scale cloud architecture and production operations.Experience working with global teams and international client stakeholders.Hands-on expertise in cloud migration, application modernization, and platform security.Soft Skills :
Strategic thinking & leadership.Strong communication & stakeholder management.Analytical problem-solving.Resilience and adaptability in dynamic environments.Commitment to continuous learning and innovation.(ref : hirist.tech)