Talent.com
This job offer is not available in your country.
Site Reliability Engineer - Observability Services

Site Reliability Engineer - Observability Services

TeamWare SolutionsMumbai
30+ days ago
Job description

Role Summary :

We are seeking a highly skilled Site Reliability Engineer (SRE) with a strong focus on observability. The ideal candidate will have 5-8 years of experience in implementing and managing monitoring, logging, and alerting systems. This role requires expertise in the Kubernetes stack, as well as a solid foundation in coding and Infrastructure as Code to ensure the reliability and health of our systems.

Key Responsibilities :

  • Observability Implementation : Design and implement comprehensive observability solutions, including monitoring, logging, and alerting.
  • Kubernetes Stack Management : Work extensively with the Kubernetes stack and related tools such as Prometheus, Loki, Grafana, and Alert Manager to ensure system performance and reliability.
  • Coding & Automation : Apply proficiency in Python & Go to solve complex problems, automate tasks, and contribute to the development of tools and systems.
  • Infrastructure & CI / CD : Utilize Infrastructure as Code and manage CI / CD pipelines to ensure continuous and reliable deployments.
  • Troubleshooting : Apply strong troubleshooting and problem-solving skills to diagnose and resolve issues efficiently and proactively.

Required Skills :

  • Observability : Expertise in all aspects of observability, including Monitoring, Logging, and Alerting.
  • Kubernetes Stack : Deep knowledge and hands-on experience with Prometheus, Loki, Grafana, and Alert Manager.
  • Programming : Strong coding skills in Python & Go, sufficient for technical challenges.
  • DevOps : Experience with CI / CD pipelines and Infrastructure as Code (IaC).
  • Problem-Solving : Strong troubleshooting and problem-solving abilities.
  • Cloud : Experience with AWS is mandatory.
  • Nice to Have Skills :

  • Incident Management : Familiarity with PagerDuty.
  • Integrations : Experience with the Zoom Developer Platform.
  • Education & Experience :

    Education : A Bachelor's degree in Computer Science, Information Technology, or a related field is preferred.

    Experience : A minimum of 5-8 years of experience in a Site Reliability or DevOps engineering role, with a focus on observability.

    (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Mumbai

    Related jobs
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebiamumbai city, maharashtra, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Haysmumbai, maharashtra, in
    Required skills and qualifications.Experience : Proven experience in technical support or engineering, preferably in AI / ML / GenAI environments. Technical Proficiency : Expertise in GenAI models (e.GPT,...Show moreLast updated: 24 days ago
    • Promoted
    RELX - Site Reliability Engineer - IAC Terraform

    RELX - Site Reliability Engineer - IAC Terraform

    REED ELSEVIER INDIA (a part of RELX India Pvt Ltd)Mumbai
    Job Description : - Lead initiatives to identify and eliminate manual, repetitive tasks through automation and tooling.Develop s...Show moreLast updated: 17 days ago
    • Promoted
    Akasa Air - Site Reliability Engineer

    Akasa Air - Site Reliability Engineer

    SNV AVIATION PRIVATE LIMITED / Akasa AirMumbai
    As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and infrastructure. This includes troubleshooting issues, developing and maintaini...Show moreLast updated: 18 days ago
    • Promoted
    Site Reliability Engineer - AWS / Azure Cloud Services

    Site Reliability Engineer - AWS / Azure Cloud Services

    DeqodeMumbai
    Profile : Site Reliability Engineer (SRE) Experience Required : 6+ Years Locations : Mumbai, Gurgaon, Ch...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialMumbai
    What You'll Do & How You'll Make Your Mark.Be responsible for downtime and maintain the product SLA.Participate in weekly on call rotation, solving escalated tickets, resolving outages, and debuggi...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConcordKalyan-Dombivli, IN
    Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 16 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Mumbai, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2mumbai city, maharashtra, in
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XebiaMumbai, IN
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 25 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub Servicesdombivli, maharashtra, in
    Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer / Lead - CI / CD Pipeline

    Site Reliability Engineer / Lead - CI / CD Pipeline

    SolutionTech HRMumbai
    Key Responsibilities : - Lead and mentor a team of SREs / DevOps Engineers, fostering a culture of ownership, reliability,...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer - Docker / Kubernetes

    Site Reliability Engineer - Docker / Kubernetes

    hirezy.aiMumbai
    Technical Skills : - Programming : Proficiency in languages like Python, Bash, or Java is essential.Operating Systems : ...Show moreLast updated: 26 days ago
    • Promoted
    Senior Site Reliability Engineer I

    Senior Site Reliability Engineer I

    ConfidentialMumbai
    This Senior Site Reliability Engineer (SRE) position offers the opportunity to work on impactful projects that enhance reliability and reduce manual work through automation.You ll leverage your exp...Show moreLast updated: 8 days ago
    • Promoted
    Senior Reliability Rotating Engineer – Global Capability Centre

    Senior Reliability Rotating Engineer – Global Capability Centre

    Essarnavi mumbai, maharashtra, in
    We are a team of reliability experts, delivering cutting-edge condition monitoring, protection, and reliability solutions for rotating equipment and critical assets. By combining remote diagnostics ...Show moreLast updated: 3 days ago
    • Promoted
    Azilen Technologies - Site Reliability Engineer - Cloud Technologies

    Azilen Technologies - Site Reliability Engineer - Cloud Technologies

    Azilen Technologies Pvt LtdMumbai
    About the job : Who you are : - Deployment of large distributed application in Production / Staging environment Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer II

    Senior Site Reliability Engineer II

    ConfidentialMumbai
    We are seeking a skilled and proactive Site Reliability Engineer (SRE).This role involves close collaboration with.NET developers and QA teams, ensuring seamless transitions and ongoing reliability...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    UplersMumbai, IN
    Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 23 days ago