Talent.com
Lead Systems Engineer (DevOps & SRE)
Lead Systems Engineer (DevOps & SRE)Epam • Chennai, India
Lead Systems Engineer (DevOps & SRE)

Lead Systems Engineer (DevOps & SRE)

Epam • Chennai, India
30+ days ago
Job description

Description

Join our organization as a Lead Systems Engineer (DevOps & SRE) and play a crucial role in ensuring the reliability, scalability, capacity planning, and performance of our infrastructure and applications.

The ideal candidate will have a strong background in software engineering, system administration, containerization, and cloud technologies, and will lead the design, development, and maintenance of scalable and reliable infrastructure.

You will also be responsible for implementing and managing CI / CD pipelines, monitoring system performance and reliability, developing and maintaining automation tools, ensuring security and compliance, mentoring and guiding junior SREs and DevOps engineers, and staying up-to-date with the latest industry trends and technologies.

#LI-DNI

Technologies

  • CI / CD, Jenkins, Docker, Kubernetes, Terraform, Ansible, Python, Prometheus, Grafana, ELK stack, Splunk, Dynatrace, Datadog or similar, SLI, SLO, SLA and Error Budget concepts

Responsibilities

  • Lead the design, development, and maintenance of scalable and reliable infrastructure
  • Implement and manage CI / CD pipelines to ensure efficient and smooth software releases
  • Monitor system performance and reliability, proactively identifying and resolving issues
  • Develop and maintain automation tools to streamline infrastructure management and deployment processes
  • Collaborate with development teams to ensure best practices for software development, deployment, and operations
  • Ensure security and compliance across all infrastructure and operations
  • Mentor and guide junior SREs and DevOps engineers, fostering a culture of collaboration and continuous learning
  • Conduct root cause analysis of system failures and implement solutions to prevent recurrence
  • Optimize resource utilization to ensure cost-effective operations
  • Stay up-to-date with the latest industry trends and technologies, integrating them into our processes where appropriate
  • Requirements

  • 8+ years of experience in a DevOps / SRE role
  • Strong experience with cloud platforms (AWS, GCP, Azure)
  • Proficiency in infrastructure as code (IaC) tools (Terraform, CloudFormation, etc.)
  • Extensive experience with containerization and orchestration (Docker, Kubernetes)
  • Strong knowledge of CI / CD tools (Jenkins, GitLab CI, CircleCI, etc.)
  • Proficiency in scripting languages (Python, Bash, etc.)
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK stack, etc.)
  • Ability to participate in capacity planning and scalability assessments to support business growth and requirements
  • Well aware of SLI, SLO, SLA and Error Budget concepts and their implementations and provide on-call support and participate in incident management & response activities as needed
  • Solid understanding of networking and security principles
  • Excellent problem-solving skills and the ability to work under pressure
  • Strong communication and collaboration skills
  • B2+ English level proficiency
  • We offer

  • Opportunity to work on technical challenges that may impact across geographies
  • Vast opportunities for self-development : online university, knowledge sharing opportunities globally, learning opportunities through external certifications
  • Opportunity to share your ideas on international platforms
  • Sponsored Tech Talks & Hackathons
  • Unlimited access to LinkedIn learning solutions
  • Possibility to relocate to any EPAM office for short and long-term projects
  • Focused individual development
  • Benefit package : Health benefits Retirement benefits Paid time off Flexible benefits
  • Forums to explore beyond work passion (CSR, photography, painting, sports, etc.)
  • Create a job alert for this search

    Lead Systems Engineer DevOps SRE • Chennai, India

    Similar jobs
    Senior DevOps / Cloud Lead

    Senior DevOps / Cloud Lead

    Staffingine LLC • chennai, tamil nadu, in
    Interested candidates can share their resume at usha@staffingine.Senior DevOps / Cloud Lead (Hands-On) – Remote.We are hiring a Senior DevOps / Cloud Lead for Forge Global on a contract basis (6 Mo...Show more
    Last updated: 4 days ago • Promoted
    Openshift L3 Engineer

    Openshift L3 Engineer

    Protonlogics IT Solutions • chennai, tamil nadu, in
    Shift : UK / US Shift (Mandatory).Lead resolution of critical or complex OpenShift and Kubernetes issues.Perform deep-dive troubleshooting, root cause analysis, and preventive action planning.Oversee ...Show more
    Last updated: 1 day ago • Promoted
    Site Reliability Engineer (SRE) – Core IT Infrastructure

    Site Reliability Engineer (SRE) – Core IT Infrastructure

    TECEZE • Chennai, Tamil Nadu, India
    Site Reliability Engineer (SRE) – Core IT Infrastructure.Infrastructure Reliability & Operations.Design, implement, and maintain. Ensure reliability, performance, scalability, and security of.Monito...Show more
    Last updated: 13 days ago • Promoted
    Sr Kubernetes Admin

    Sr Kubernetes Admin

    ScaleneWorks • Chennai, Tamil Nadu, India
    Quick Apply
    Job Title : Sr Kubernetes Admin.Position : Senior Systems Engineer.Category : Software Development / Engineering.Shift : Rotational Shift (Primarily - 7PM-4AM IST) - US Hours. Main location : Bangalore, C...Show more
    Last updated: 30+ days ago
    Senior AWS Cloud Infrastructure Engineer

    Senior AWS Cloud Infrastructure Engineer

    SQ1 Security • Chennai, Tamil Nadu, India
    We are seeking an experienced and proactive.AWS Cloud Infrastructure Engineer.You will be responsible for maintaining and optimizing a diverse infrastructure that includes.EC2, RDS, VPNs, Transit G...Show more
    Last updated: 8 days ago • Promoted
    OutSystems Technical Lead

    OutSystems Technical Lead

    Xebia • Chennai, Tamil Nadu, India
    Lead the design and development of enterprise OutSystems applications.Define architecture and enforce coding standards.Mentor and manage OutSystems developers. Coordinate with Business, QA, and DevO...Show more
    Last updated: 3 days ago • Promoted
    SRE

    SRE

    Tata Consultancy Services • Chennai, Tamil Nadu, India
    Tata Consultancy Services is hiring for SRE for supporting forgerock application.Role : SRE for supporting forgerock application. Location : Chennai, Bangalore, Pune.SRE for supporting forgerock appl...Show more
    Last updated: 13 days ago • Promoted
    AWS Cloud DevOps Lead

    AWS Cloud DevOps Lead

    Luxoft • Chennai, Tamil Nadu, India
    We're seeking a solid and creative AWS Cloud DevOps eager to solve scale problems and work on cutting-edge and open-source technologies. In this project, you will have the opportunity to write code ...Show more
    Last updated: 10 days ago • Promoted
    Senior Systems Administrator - Redhat Enterprise Linux

    Senior Systems Administrator - Redhat Enterprise Linux

    Tata Consultancy Services • Chennai, Tamil Nadu, India
    Senior Systems Administrator - Redhat Enterprise Linux.Minimum of 8 years’ experience in Redhat systems (Linux, Satellite and Ansible) in a large multi-platform environment.Familiarity with securit...Show more
    Last updated: 13 days ago • Promoted
    Platform Engineer (Cloud & DevOps)

    Platform Engineer (Cloud & DevOps)

    Altrum AI • Chennai, IN
    Aligne’s AltrumAI platform empowers enterprises to unlock the full potential of Generative AI responsibly and confidently. We have built this SaaS platform to help organisations adopt AI with trust,...Show more
    Last updated: 7 days ago • Promoted
    Cloud Engineer

    Cloud Engineer

    Sharp Brains • chennai, tamil nadu, in
    We are looking for an experienced.The candidate will act as a technical SME, handle complex incidents, lead cloud implementations, and support business-critical workloads.Operations & Support (L3 L...Show more
    Last updated: 3 days ago • Promoted
    Site Reliability Engineering Lead

    Site Reliability Engineering Lead

    Dextara Datamatics • Chennai, Tamil Nadu, India
    Role : Site Reliability Engineering Lead.Experience Required : 7+ years of experience in SRE DevOps, or Cloud Infrastructure with minimum 2+ years in a lead / mentoring. Deep AWS expertise (EC2, S3, RDS...Show more
    Last updated: 2 days ago • Promoted
    System Administrator (Terraform Automatoin SME)

    System Administrator (Terraform Automatoin SME)

    Tata Consultancy Services • Chennai, Tamil Nadu, India
    Role - Terraform Automatoin SME.Terraform, Ansible, DevOps, GitLab.Architect and maintain Terraform / Cloud formation modules for multi-cloud infrastructure provisioning. Define and implement Terraf...Show more
    Last updated: 13 days ago • Promoted
    L3 / L4 Infra Support Engineer

    L3 / L4 Infra Support Engineer

    Consolidated Analytics • Chennai, Tamil Nadu, India
    Job description : L3 / L4 Infra Support Engineer - Windows / Azure.Systems Administrator to join our Systems Team to help design, implement, maintain, and support our growing server infrastructure in th...Show more
    Last updated: 20 days ago • Promoted
    DevSecOps

    DevSecOps

    JMAN Group • Chennai, Tamil Nadu, India
    JMAN Group is a fast-growing data engineering & data science consultancy.We work primarily with Private Equity Funds and their Portfolio Companies to create commercial value using Data & Artificial...Show more
    Last updated: 2 days ago • Promoted
    DevOps Lead Engineer (Contract)

    DevOps Lead Engineer (Contract)

    Saaki Argus & Averil Consulting • Chennai, Tamil Nadu, India
    Quick Apply
    Hiring : Azure DevOps Lead Engineer.CI / CD strategy, cloud automation, and deployment excellence for Azure-based enterprise platforms. This role requires hands-on leadership, strong automation skills,...Show more
    Last updated: 5 days ago
    Senior Site Reliability Engineer(Lead)

    Senior Site Reliability Engineer(Lead)

    ACL Digital • Chennai, IN
    Continuous monitoring of system performance and identify potential issues before they impact users.Experience working with Industry leading monitoring tools. Respond to incidents related to monitori...Show more
    Last updated: 14 days ago • Promoted
    Cloud / Infrastructure Engineer

    Cloud / Infrastructure Engineer

    Proglite • Chennai, Tamil Nadu, India
    The candidate is expected to own the operational stability and performance of hybrid cloud infrastructure (AWS, GCP and Nutanix). This involves leading automation efforts, architecting for reliabili...Show more
    Last updated: 20 days ago • Promoted