Talent.com
Epam
Lead Systems Engineer (DevOps & SRE)Epam • Chennai, India
Lead Systems Engineer (DevOps & SRE)

Lead Systems Engineer (DevOps & SRE)

Epam • Chennai, India
30+ days ago
Job description

Description

Join our organization as a Lead Systems Engineer (DevOps & SRE) and play a crucial role in ensuring the reliability, scalability, capacity planning, and performance of our infrastructure and applications.

The ideal candidate will have a strong background in software engineering, system administration, containerization, and cloud technologies, and will lead the design, development, and maintenance of scalable and reliable infrastructure.

You will also be responsible for implementing and managing CI/CD pipelines, monitoring system performance and reliability, developing and maintaining automation tools, ensuring security and compliance, mentoring and guiding junior SREs and DevOps engineers, and staying up-to-date with the latest industry trends and technologies.

#LI-DNI

Technologies

  • CI/CD, Jenkins, Docker, Kubernetes, Terraform, Ansible, Python, Prometheus, Grafana, ELK stack, Splunk, Dynatrace, Datadog or similar, SLI, SLO, SLA and Error Budget concepts

Responsibilities

  • Lead the design, development, and maintenance of scalable and reliable infrastructure
  • Implement and manage CI/CD pipelines to ensure efficient and smooth software releases
  • Monitor system performance and reliability, proactively identifying and resolving issues
  • Develop and maintain automation tools to streamline infrastructure management and deployment processes
  • Collaborate with development teams to ensure best practices for software development, deployment, and operations
  • Ensure security and compliance across all infrastructure and operations
  • Mentor and guide junior SREs and DevOps engineers, fostering a culture of collaboration and continuous learning
  • Conduct root cause analysis of system failures and implement solutions to prevent recurrence
  • Optimize resource utilization to ensure cost-effective operations
  • Stay up-to-date with the latest industry trends and technologies, integrating them into our processes where appropriate

Requirements

  • 8+ years of experience in a DevOps/SRE role
  • Strong experience with cloud platforms (AWS, GCP, Azure)
  • Proficiency in infrastructure as code (IaC) tools (Terraform, CloudFormation, etc.)
  • Extensive experience with containerization and orchestration (Docker, Kubernetes)
  • Strong knowledge of CI/CD tools (Jenkins, GitLab CI, CircleCI, etc.)
  • Proficiency in scripting languages (Python, Bash, etc.)
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK stack, etc.)
  • Ability to participate in capacity planning and scalability assessments to support business growth and requirements
  • Well aware of SLI, SLO, SLA and Error Budget concepts and their implementations and provide on-call support and participate in incident management & response activities as needed
  • Solid understanding of networking and security principles
  • Excellent problem-solving skills and the ability to work under pressure
  • Strong communication and collaboration skills
  • B2+ English level proficiency

We offer

  • Opportunity to work on technical challenges that may impact across geographies
  • Vast opportunities for self-development: online university, knowledge sharing opportunities globally, learning opportunities through external certifications
  • Opportunity to share your ideas on international platforms
  • Sponsored Tech Talks & Hackathons
  • Unlimited access to LinkedIn learning solutions
  • Possibility to relocate to any EPAM office for short and long-term projects
  • Focused individual development
  • Benefit package: Health benefits Retirement benefits Paid time off Flexible benefits
  • Forums to explore beyond work passion (CSR, photography, painting, sports, etc.)
Create a job alert for this search

Lead Systems Engineer (DevOps & SRE) • Chennai, India

Similar jobs

Infra Devops Engg

ScaleneWorksChennai, Tamil Nadu, India
Quick Apply

Infra DevOps Infrastructure Administrator Level 11, 10.Redhat Administration preferable.Good hands-on knowledge of Source Code Management (Version Control System) tools like Git, GitLab.Managing an... Show more

Lead Engineer / Sr. Lead Engineer - System Intelligence

Mahindra and Mahindra Limited [Automotive and Farm Equipment Business]chennai, tamil nadu, India

Analyze complex real-time vehicle data from telematics, ECU signals, EOL tests, and dealer management systems within UAT and GPH1 data streams to identify patterns, anomalies, and trends.Interpret ... Show more

 • Promoted

Sr Kubernetes Admin

ScaleneWorksChennai, Tamil Nadu, India
Quick Apply

Job Title: Sr Kubernetes Admin.Position: Senior Systems Engineer.Category: Software Development/ Engineering.Shift: Rotational Shift (Primarily - 7PM-4AM IST) - US Hours.Main location: Bangalore, C... Show more

Cloud Engineer / Solutions Architect – DevOps & IT

Water Weaver Solutionschennai, tamil nadu, India

Location: Chennai, In Office  .We are looking for a hands-on Cloud Engineer / Solutions Architect with 3-5 years of experience to own our cloud infrastructure, CI/CD pipelines, and overall IT envir... Show more

 • Promoted

DevOps/Site Reliability Engineer

Techtiniumchennai, tamil nadu, India

Techtinium Technologies is looking for a DevOps/Site Reliability Engineer who can ensure that a complex and growing cloud infrastructure is healthy, monitored, automated and is designed to scale.Yo... Show more

 • Promoted

Site Reliability Engineer

Karixchennai, tamil nadu, India

JD – Lead Site Reliability Engineer (SRE).We are looking for a Lead Site Reliability Engineer (SRE) with strong experience in managing production systems, distributed architectures, and cloud-nativ... Show more

 • Promoted

Aws cloud infrastructure lead

Sarbajira Software Pvt Ltdchennai, tamil nadu, India

Sarbajira Software is a trusted IT consulting company specializing in delivering tailored IT solutions for businesses of all sizes.With a team of experienced and certified professionals, we are com... Show more

 • Promoted

IT Systems Engineer

EndaceChennai, TN, IN
Quick Apply

Do you want to work for a world leader in network monitoring technology?.We are looking for a proactive IT Systems Engineer to provide efficient provisioning, maintenance and optimisation of infras... Show more

Cloud DevSecOps

Saaki Argus & Averil ConsultingChennai, Tamil Nadu, India
Quick Apply

A global Engineering, Research & Development (ERD) Services Company.Bangalore/Mysore (5 days Work from Office).DevOps, SRE, Observability, FinOPS, SecOps, Kubernetes, Monitoring and managed ser... Show more

L3/L4 Infra Support Engineer

Consolidated Analyticschennai, tamil nadu, India

Job description: L3/L4 Infra Support Engineer - Windows/Azure.Systems Administrator to join our Systems Team to help design, implement, maintain, and support our growing server infrastructure in th... Show more

 • Promoted

Cloud Lead

ScaleneWorksChennai, Tamil Nadu, India
Quick Apply

Position Name : Cloud Architect.Position Title : Manager Cloud Engineering.A Cloud Architect is responsible for designing, managing, and overseeing the cloud computing strategy of an organization.T... Show more

Senior DevOps Engineer

HirschChennai, TN, IN
Quick Apply

AWS, Terraform, CI/CD pipelines, and hosting applications on both Windows and Linux environments.The role includes building cloud infrastructure, automating deployments, and managing release proces... Show more

DevOps Engineer - Mid Level

Centific Global Technologies India Private LimitedIndia Office - Chennai

Senior Azure DevOps Engineer We're looking for a skilled Senior Azure DevOps Engineer with expertise in setting up CI/CD pipelines, handling deployment issues, automating processes in Azure DevOps,... Show more

Lead Site Reliability Engineer

Concentrixchennai, tamil nadu, in

As a Lead Site Reliability Engineer, you will own the reliability and availability of our production systems.You will champion SRE principles across engineering teams — defining SLOs, managing erro... Show more

 • Promoted

T24 Transact SRE Engineer (With min 5 years exp in T24 Transact SRE) Max notice period 45 days

Luxofttamil nadu, chennai, India

Note: This role is only for candidates who are having experience in T24 Temenos application SRE.Role : T24 Transact SRE Engineer Exp level: 5 to 9 years in T24 Transact SRE Location : Chennai, Bang... Show more

 • Promoted

Forward Deployed SRE

LocalOps Incchennai, tamil nadu, India

As a forward deployed DevOps engineer at LocalOps, you will be the face of our cloud operations for enterprise customers.You'll work directly with customer engineering teams during EST business hou... Show more

 • Promoted

Site Reliability Engineer (SRE) – Azure Virtual Desktop (AVD)

Insight Globalchennai, tamil nadu, in

We are looking for a highly skilled.Site Reliability Engineer (SRE).Azure compute, networking, storage, and identity.SRE / Azure Infrastructure roles.Azure Virtual Desktop (AVD/WVD).FSLogix (profil... Show more

 • Promoted • New!

BeyondTrust Engineer – SME/Level 3 Engineer

London Strategychennai, tamil nadu, in

We are looking for a experienced BeyondTrust SME / level 3 engineer to join our team and support the delivery of critical infrastructure and security programs.As a BeyondTrust Engineer, you will be... Show more

 • Promoted

AWS Cloud Infrastructure Lead

Sarbajira Software Pvt Ltdchennai, tamil nadu, India

Sarbajira Software is a trusted IT consulting company specializing in delivering tailored IT solutions for businesses of all sizes.With a team of experienced and certified professionals, we are com... Show more

 • Promoted

DevOps Lead Engineer (Contract)

Saaki Argus & Averil ConsultingChennai, Tamil Nadu, India
Quick Apply

CI/CD strategy, cloud automation, and deployment excellence for Azure-based enterprise platforms.This role requires hands-on leadership, strong automation skills, and deep knowledge of Azure servic... Show more