Talent.com
Epam
Lead Systems Engineer (DevOps & SRE)Epam • Chennai, India
Lead Systems Engineer (DevOps & SRE)

Lead Systems Engineer (DevOps & SRE)

Epam • Chennai, India
30+ days ago
Job description

Description

Join our organization as a Lead Systems Engineer (DevOps & SRE) and play a crucial role in ensuring the reliability, scalability, capacity planning, and performance of our infrastructure and applications.

The ideal candidate will have a strong background in software engineering, system administration, containerization, and cloud technologies, and will lead the design, development, and maintenance of scalable and reliable infrastructure.

You will also be responsible for implementing and managing CI/CD pipelines, monitoring system performance and reliability, developing and maintaining automation tools, ensuring security and compliance, mentoring and guiding junior SREs and DevOps engineers, and staying up-to-date with the latest industry trends and technologies.

#LI-DNI

Technologies

  • CI/CD, Jenkins, Docker, Kubernetes, Terraform, Ansible, Python, Prometheus, Grafana, ELK stack, Splunk, Dynatrace, Datadog or similar, SLI, SLO, SLA and Error Budget concepts

Responsibilities

  • Lead the design, development, and maintenance of scalable and reliable infrastructure
  • Implement and manage CI/CD pipelines to ensure efficient and smooth software releases
  • Monitor system performance and reliability, proactively identifying and resolving issues
  • Develop and maintain automation tools to streamline infrastructure management and deployment processes
  • Collaborate with development teams to ensure best practices for software development, deployment, and operations
  • Ensure security and compliance across all infrastructure and operations
  • Mentor and guide junior SREs and DevOps engineers, fostering a culture of collaboration and continuous learning
  • Conduct root cause analysis of system failures and implement solutions to prevent recurrence
  • Optimize resource utilization to ensure cost-effective operations
  • Stay up-to-date with the latest industry trends and technologies, integrating them into our processes where appropriate

Requirements

  • 8+ years of experience in a DevOps/SRE role
  • Strong experience with cloud platforms (AWS, GCP, Azure)
  • Proficiency in infrastructure as code (IaC) tools (Terraform, CloudFormation, etc.)
  • Extensive experience with containerization and orchestration (Docker, Kubernetes)
  • Strong knowledge of CI/CD tools (Jenkins, GitLab CI, CircleCI, etc.)
  • Proficiency in scripting languages (Python, Bash, etc.)
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK stack, etc.)
  • Ability to participate in capacity planning and scalability assessments to support business growth and requirements
  • Well aware of SLI, SLO, SLA and Error Budget concepts and their implementations and provide on-call support and participate in incident management & response activities as needed
  • Solid understanding of networking and security principles
  • Excellent problem-solving skills and the ability to work under pressure
  • Strong communication and collaboration skills
  • B2+ English level proficiency

We offer

  • Opportunity to work on technical challenges that may impact across geographies
  • Vast opportunities for self-development: online university, knowledge sharing opportunities globally, learning opportunities through external certifications
  • Opportunity to share your ideas on international platforms
  • Sponsored Tech Talks & Hackathons
  • Unlimited access to LinkedIn learning solutions
  • Possibility to relocate to any EPAM office for short and long-term projects
  • Focused individual development
  • Benefit package: Health benefits Retirement benefits Paid time off Flexible benefits
  • Forums to explore beyond work passion (CSR, photography, painting, sports, etc.)
Create a job alert for this search

Lead Systems Engineer (DevOps & SRE) • Chennai, India

Similar jobs

Site Reliability Engineer(SRE)

EXLchennai, tamil nadu, in

We are hiring an experienced SRE leader to manage a global Incident Management team and drive operational excellence for this engagement.The role involves leading teams, handling complex incidents,... Show more

 • Promoted

Senior Scalability & Platform Engineer / SRE

Sceneplay, Inc.chennai, tamil nadu, in

We are hiring a senior scalability engineer to help prepare a modern microservices backend for high-concurrency production usage.This role is focused on backend performance, reliability, capacity p... Show more

 • Promoted

Lead - defense system

Planys Technologieschennai, tamil nadu, India

Planys is a deep-tech company developing mission-critical robotic and digital systems for.The company designs and builds unmanned systems.Planys is actively engaged in the.Unmanned Underwater Vehic... Show more

 • Promoted

CI/CD DevOps Manager

Klarnachennai, tamil nadu, in

We are looking for an experienced.Engineering Manager to lead a CI/CD and developer platform team.You will drive improvements in.Team Leadership & People Management.Manage, mentor, and grow a team ... Show more

 • Promoted

AgenticOps Platform Engineer Lead

Bridge AIchennai, tamil nadu, in

We are looking for a senior, hands-on AgentOps Platform Engineer to design, build, and operate the cloud-native infrastructure that powers our AI agents at scale.This is a lead-by-example role:.You... Show more

 • Promoted

Senior Syslog Engineer

Securonixchennai, tamil nadu, in

Securonix is leading the transformation of cybersecurity by helping organizations stay ahead of modern threats.Security teams are no longer constrained by data or tools.They are constrained by spee... Show more

 • Promoted

Lead - Defense System

Planys Technologieschennai, tamil nadu, India

Planys is a deep-tech company developing mission-critical robotic and digital systems for.The company designs and builds unmanned systems.Planys is actively engaged in the.Unmanned Underwater Vehic... Show more

 • Promoted

TCS hiring Ansible Engineer

Tata Consultancy Serviceschennai, tamil nadu, India

Ability to write, debug, and optimize reusable, idempotent playbooks.Structuring projects effectively using Ansible Roles, Collections, and variables to scale and maintain code.Deep understanding o... Show more

 • Promoted

DevOps Engineer - Mid Level

Centific Global Technologies India Private LimitedIndia Office - Chennai

Senior Azure DevOps Engineer We're looking for a skilled Senior Azure DevOps Engineer with expertise in setting up CI/CD pipelines, handling deployment issues, automating processes in Azure DevOps,... Show more

Cloud Lead (Azure L3/ L4)

Atidan Technologieschennai, India

Experience in migrating workloads from on-premises to Azure.Azure Active Directory, SSO, ADFS, Azure AD B2C, Azure AD Identity Protection.Azure Monitoring & Maintenance.Azure Automation & Autoscali... Show more