Talent.com
This job offer is not available in your country.
Site Reliability Engineer / Lead - CI / CD Pipeline

Site Reliability Engineer / Lead - CI / CD Pipeline

SolutionTech HRMumbai
5 days ago
Job description

Key Responsibilities :

  • Lead and mentor a team of SREs / DevOps Engineers, fostering a culture of ownership, reliability, and continuous improvement.
  • Own the availability, scalability, and performance of production systems and services.
  • Design and manage distributed systems and microservices architectures at scale.
  • Develop and implement incident response strategies, root cause analysis, and create actionable postmortems.
  • Drive improvements in infrastructure automation, CI / CD pipelines, and deployment strategies.
  • Collaborate with cross-functional teams including engineering, product, and QA to embed SRE best practices.
  • Implement observability tools (e.g., Prometheus, Grafana, ELK, Datadog) to monitor system performance and proactively detect issues.
  • Manage and optimize cloud infrastructure on AWS, including services such as EC2, ELB,

AutoScaling, S3, CloudFront, and CloudWatch.

  • Utilize Infrastructure-as-Code tools such as Terraform, CloudFormation, or Pulumi for provisioning and maintaining infrastructure.
  • Apply strong Linux, networking, load balancing, and security principles to ensure platform
  • resilience.

  • Leverage Docker and Kubernetes for container orchestration and scalable deployments.
  • Build internal tools and automation using Python, Go, or Bash scripting.
  • Support event-driven architectures leveraging Kafka or RabbitMQ for high-throughput, real-time systems.
  • Proactively contribute to reliability-focused architecture and design Skills & Experience :
  • 6 - 10 years of overall experience in backend engineering, infrastructure, DevOps, or SRE roles.
  • Minimum 3 years of experience leading SRE, DevOps, or Infrastructure teams.
  • Proven track record managing distributed systems and microservices at scale.
  • Deep understanding of Linux systems, networking fundamentals, load balancing, and infrastructure security.
  • Strong hands-on experience with AWS services : EC2, ELB, AutoScaling, CloudFront, S3, and CloudWatch.
  • Expert-level knowledge of Docker and Kubernetes in production environments.
  • Proficient with Infrastructure-as-Code tools : Terraform, CloudFormation, or Pulumi.
  • Hands-on experience with monitoring and observability tools : Prometheus, Grafana, ELK
  • Stack, or Datadog.

  • Strong scripting or programming skills in Python, Go, Bash, or similar languages.
  • Familiarity with Kafka or RabbitMQ for event-driven and messaging architectures.
  • Excellent incident management skills, including triage, RCA, and communication.
  • Ability to thrive in fast-paced environments and adapt to changing Qualifications :
  • Bachelors degree in Computer Science, Engineering, or a related field.
  • Experience in startup or high-growth environments.
  • Contributions to open-source DevOps or SRE tools are a plus.
  • Certifications in AWS, Kubernetes, or other cloud-native technologies are advantageous.
  • (ref : hirist.tech)

    Create a job alert for this search

    Reliability Pipeline • Mumbai

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XebiaThane, IN
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 25 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Haysmumbai, maharashtra, in
    Required skills and qualifications.Experience : Proven experience in technical support or engineering, preferably in AI / ML / GenAI environments. Technical Proficiency : Expertise in GenAI models (e.GPT,...Show moreLast updated: 24 days ago
    • Promoted
    RELX - Site Reliability Engineer - IAC Terraform

    RELX - Site Reliability Engineer - IAC Terraform

    REED ELSEVIER INDIA (a part of RELX India Pvt Ltd)Mumbai
    Job Description : - Lead initiatives to identify and eliminate manual, repetitive tasks through automation and tooling.Develop s...Show moreLast updated: 17 days ago
    • Promoted
    Akasa Air - Site Reliability Engineer

    Akasa Air - Site Reliability Engineer

    SNV AVIATION PRIVATE LIMITED / Akasa AirMumbai
    As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and infrastructure. This includes troubleshooting issues, developing and maintaini...Show moreLast updated: 18 days ago
    • Promoted
    Site Reliability Engineer - AWS / Azure Cloud Services

    Site Reliability Engineer - AWS / Azure Cloud Services

    DeqodeMumbai
    Profile : Site Reliability Engineer (SRE) Experience Required : 6+ Years Locations : Mumbai, Gurgaon, Ch...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConcordKalyan-Dombivli, IN
    Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    UplersKalyan-Dombivli, IN
    Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 23 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2mumbai city, maharashtra, in
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer - Docker / Kubernetes

    Site Reliability Engineer - Docker / Kubernetes

    hirezy.aiMumbai
    Technical Skills : - Programming : Proficiency in languages like Python, Bash, or Java is essential.Operating Systems : ...Show moreLast updated: 26 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub Servicesmumbai, maharashtra, in
    Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebiamumbai, maharashtra, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 6 days ago
    • Promoted
    MindCraft Software - Site Reliability Engineer - DevOps

    MindCraft Software - Site Reliability Engineer - DevOps

    MindCraft Software Pvt. Ltd.Thane
    SRE (Site Reliability Engineer) Exp : 5-7 years Location : Thane - 5+ years in SRE or DevOps roles supporting high-scale platforms (fint...Show moreLast updated: 15 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Kalyan-Dombivli, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - Observability Services

    Site Reliability Engineer - Observability Services

    TeamWare SolutionsMumbai
    Role Summary : We are seeking a highly skilled Site Reliability Engineer (SRE) with a strong focus on observability.The ideal candidate will have 5-8 years of experie...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer - Cloud Computing

    Lead Site Reliability Engineer - Cloud Computing

    NeemtreeMumbai
    Responsibilities : - Team Leadership : Manage and mentor a team of SREs, assigning tasks, providing technical guidance, and fostering a culture of collaboration and ...Show moreLast updated: 3 days ago
    • Promoted
    Docsumo - Senior DevOps / Site Reliability Engineer - Python

    Docsumo - Senior DevOps / Site Reliability Engineer - Python

    DocsumoMumbai
    About Docsumo : Docsumo is a Document Workflow platform that converts unstructured documents (like bank statements, financials, policies) into structured, actionable ...Show moreLast updated: 8 days ago
    • Promoted
    Azilen Technologies - Site Reliability Engineer - Cloud Technologies

    Azilen Technologies - Site Reliability Engineer - Cloud Technologies

    Azilen Technologies Pvt LtdMumbai
    About the job : Who you are : - Deployment of large distributed application in Production / Staging environment Show moreLast updated: 30+ days ago
    • Promoted
    Associate Platform Reliability Engineer (SRE)

    Associate Platform Reliability Engineer (SRE)

    Jefferiesmumbai, maharashtra, in
    Jefferies,’’ ‘‘we,’’ ‘‘us’’ or ‘‘our’’) is a U.Our largest subsidiary, Jefferies LLC, a U.Jefferies International Limited, a U. Our strategy focuses on continuing to build out our investment banking...Show moreLast updated: 21 days ago