Talent.com
Senior Site Reliability Engineer

Senior Site Reliability Engineer

iVoyantKalyan-Dombivli, IN
15 hours ago
Job description

One of our clients is looking for an experienced Senior Site Reliability Engineer (SRE) - Mission-Critical SaaS Cloud Products to join their team.

Key Responsibilities :

Reliability and Performance Management :

  • Design, implement, and maintain highly available, scalable, and resilient cloud-native architectures for mission-critical SaaS products.
  • Develop and implement SLOs, SLIs, and SLAs to measure and improve service reliability.
  • Continuously optimize system performance and resource utilization across multiple cloud platforms.
  • Finetune / Optimize Application performance by analyzing the code, traces and database queries.

Incident Management and Troubleshooting :

  • Lead incident response efforts, effectively troubleshooting complex issues to minimize downtime and impact.
  • Reduce Mean Time to Recover (MTTR) through proactive monitoring, automated alerting, and efficient problem-solving techniques.
  • Conduct thorough Root Cause Analysis (RCA) for all major incidents and implement preventive measures.
  • Observability and Monitoring :

  • Design and implement end-to-end observability solutions across our distributed systems.
  • Develop and maintain comprehensive monitoring strategies using tools like ELK Stack, Prometheus, Grafana.
  • Create and optimize product status dashboards to provide real-time visibility into system health and performance.
  • Automation and Infrastructure as Code (IaC) :

  • Implement Infrastructure as Code practices using tools like Terraform.
  • Develop and maintain automated deployment pipelines and CI / CD workflows.
  • Create self-healing systems and automate routine operational tasks to reduce manual intervention.
  • Cloud-Agnostic Architecture :

  • Design and implement cloud-agnostic solutions that can operate efficiently across multiple cloud providers.
  • Develop expertise in event-driven architecture and related technologies (e.g., Apache Kafka / EventHub, Redis, Mongo Atlas, IoTHub).
  • Implement and manage containerized applications using Kubernetes across different cloud environments.
  • Continuous Improvement :

  • Regularly review and refine operational practices to enhance efficiency and reliability.
  • Stay updated with the latest industry trends and technologies in SRE, cloud computing, and DevOps.
  • Contribute to the development of internal tools and frameworks to support SRE practices.
  • Requirements :

  • Strong knowledge of cloud platforms - Azure and their associated services.
  • Expert in Observability tools (ELK Stack, Dynatrace, Prometheus)
  • Expertise in containerization technologies such as Docker and Kubernetes
  • Understanding of Event-driven architecture and database technologies (Mongo Atlas, Azure SQL, Postgres DB)
  • Proficient in IaaC tools such as - Terraform and GitHub Actions.
  • Proficiency in one or more programming languages - Python / .Net / Java
  • Strong understanding of networking concepts, load balancing, and security practices.
  • Create a job alert for this search

    Senior Site Reliability Engineer • Kalyan-Dombivli, IN

    Related jobs
    • Promoted
    Senior Reliability Rotating Engineer – Global Capability Centre

    Senior Reliability Rotating Engineer – Global Capability Centre

    EssarNavi Mumbai, Maharashtra, India
    We are a team of reliability experts, delivering cutting-edge condition monitoring, protection, and reliability solutions for rotating equipment and critical assets. By combining remote diagnostics ...Show moreLast updated: 29 days ago
    • Promoted
    Site Reliability Engineer-Vice President -Software Production Management & Reliability Engineering

    Site Reliability Engineer-Vice President -Software Production Management & Reliability Engineering

    Morgan StanleyMumbai, India
    Vice President - Software Production Management & Reliability Engineering.We're seeking someone to join our team as Vice President ( Site Reliability Engineer ) who will be responsible for providin...Show moreLast updated: 5 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialMumbai City, Pune, Mumbai
    Review and contribute to code in both application and infrastructure stacks.Promote automation and develop new tools to improve operational processes. Participate in on-call incident management, fro...Show moreLast updated: 30+ days ago
    • Promoted
    RELX - Site Reliability Engineer - IAC Terraform

    RELX - Site Reliability Engineer - IAC Terraform

    REED ELSEVIER INDIA (a part of RELX India Pvt Ltd)Mumbai
    Job Description : - Lead initiatives to identify and eliminate manual, repetitive tasks through automation and tooling.Develop s...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer- Cloud Platform

    Senior Site Reliability Engineer- Cloud Platform

    ConfidentialMumbai
    As a Senior Site Reliability Engineer, you will be responsible for : .Demonstrating best practices pertaining to Cloud DevOps development along with a willingness to continually learn Cloud native te...Show moreLast updated: 30+ days ago
    • Promoted
    Akasa Air - Site Reliability Engineer

    Akasa Air - Site Reliability Engineer

    SNV AVIATION PRIVATE LIMITED / Akasa AirMumbai
    As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and infrastructure. This includes troubleshooting issues, developing and maintaini...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    Media.netMumbai, Maharashtra, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 25 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmadombivli, maharashtra, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 10 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Tata Consultancy Servicesmumbai city, India
    Senior Site Reliability Engineer (SRE).Senior Site Reliability Engineer (SRE).Desired Experience Range : 7 - 10 yrs.Notice Period : Immediate to 90Days only. We are currently planning to do a Virtual....Show moreLast updated: 4 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynechronMumbai, Maharashtra, India
    We have immediate opportunity for.Site Reliability Engineer Devop 5 to 9 years.SRE (Senior Site Reliability Engineer) Devop. We began life in 2001 as a small, self-funded team of technology speciali...Show moreLast updated: 10 days ago
    • Promoted
    Senior Site Reliability Engineer I

    Senior Site Reliability Engineer I

    ConfidentialMumbai
    This Senior Site Reliability Engineer (SRE) position offers the opportunity to work on impactful projects that enhance reliability and reduce manual work through automation.You ll leverage your exp...Show moreLast updated: 30+ days ago
    • Promoted
    Media.net - Senior Site Reliability Engineer - IAC Terraform

    Media.net - Senior Site Reliability Engineer - IAC Terraform

    Media.netMumbai
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 25 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Kalyan-Dombivli, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    o9 Solutions, Inc.thane, maharashtra, in
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 11 days ago
    • Promoted
    Senior Site Reliability Engineer II

    Senior Site Reliability Engineer II

    ConfidentialMumbai
    We are seeking a skilled and proactive Site Reliability Engineer (SRE).This role involves close collaboration with.NET developers and QA teams, ensuring seamless transitions and ongoing reliability...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeKalyan-Dombivli, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 2 days ago
    • Promoted
    Associate Platform Reliability Engineer (SRE)

    Associate Platform Reliability Engineer (SRE)

    JefferiesMumbai, Maharashtra, India
    Jefferies,’’ ‘‘we,’’ ‘‘us’’ or ‘‘our’’) is a U.Our largest subsidiary, Jefferies LLC, a U.Jefferies International Limited, a U. Our strategy focuses on continuing to build out our investment banking...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    XequalstoMumbai
    Description : Senior Site Reliability Engineer (SRE) Location : Mumbai , Navi Mumbai - Hybrid office visits will be scheduled as and when requi...Show moreLast updated: 10 days ago