Talent.com
Senior Site Reliability Engineer

Senior Site Reliability Engineer

iVoyantPune, IN
1 day ago
Job description

One of our clients is looking for an experienced Senior Site Reliability Engineer (SRE) - Mission-Critical SaaS Cloud Products to join their team.

Key Responsibilities :

Reliability and Performance Management :

  • Design, implement, and maintain highly available, scalable, and resilient cloud-native architectures for mission-critical SaaS products.
  • Develop and implement SLOs, SLIs, and SLAs to measure and improve service reliability.
  • Continuously optimize system performance and resource utilization across multiple cloud platforms.
  • Finetune / Optimize Application performance by analyzing the code, traces and database queries.

Incident Management and Troubleshooting :

  • Lead incident response efforts, effectively troubleshooting complex issues to minimize downtime and impact.
  • Reduce Mean Time to Recover (MTTR) through proactive monitoring, automated alerting, and efficient problem-solving techniques.
  • Conduct thorough Root Cause Analysis (RCA) for all major incidents and implement preventive measures.
  • Observability and Monitoring :

  • Design and implement end-to-end observability solutions across our distributed systems.
  • Develop and maintain comprehensive monitoring strategies using tools like ELK Stack, Prometheus, Grafana.
  • Create and optimize product status dashboards to provide real-time visibility into system health and performance.
  • Automation and Infrastructure as Code (IaC) :

  • Implement Infrastructure as Code practices using tools like Terraform.
  • Develop and maintain automated deployment pipelines and CI / CD workflows.
  • Create self-healing systems and automate routine operational tasks to reduce manual intervention.
  • Cloud-Agnostic Architecture :

  • Design and implement cloud-agnostic solutions that can operate efficiently across multiple cloud providers.
  • Develop expertise in event-driven architecture and related technologies (e.g., Apache Kafka / EventHub, Redis, Mongo Atlas, IoTHub).
  • Implement and manage containerized applications using Kubernetes across different cloud environments.
  • Continuous Improvement :

  • Regularly review and refine operational practices to enhance efficiency and reliability.
  • Stay updated with the latest industry trends and technologies in SRE, cloud computing, and DevOps.
  • Contribute to the development of internal tools and frameworks to support SRE practices.
  • Requirements :

  • Strong knowledge of cloud platforms - Azure and their associated services.
  • Expert in Observability tools (ELK Stack, Dynatrace, Prometheus)
  • Expertise in containerization technologies such as Docker and Kubernetes
  • Understanding of Event-driven architecture and database technologies (Mongo Atlas, Azure SQL, Postgres DB)
  • Proficient in IaaC tools such as - Terraform and GitHub Actions.
  • Proficiency in one or more programming languages - Python / .Net / Java
  • Strong understanding of networking concepts, load balancing, and security practices.
  • Create a job alert for this search

    Senior Site Reliability Engineer • Pune, IN

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    AllianzPune
    Site Reliability Engineer (SRE) - One Identity Access Management The primary objective of the Site Reliability Engineer (SRE) specializing in One Identity Access Mana...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialPune
    The Software Engineering team delivers next-generation application enhancements and new products for a changing world.Working at the cutting edge, we design and develop software for platforms, peri...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Tech Lead - Site Reliability

    Sr. Tech Lead - Site Reliability

    ConfidentialPune
    We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to join our team.This role involves ensuring the reliability, scalability, and efficiency of cloud infrastructure and applicat...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    o9 Solutions, Inc.pune, maharashtra, in
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 11 days ago
    • Promoted
    Rosemallow Technologies - Site Reliability Engineer

    Rosemallow Technologies - Site Reliability Engineer

    ROSEMALLOW TECHNOLOGIES PRIVATE LIMITEDPune
    Job Title : Site Reliability Engineer (SRE).Department : Technology / Infrastructure / DevOps.Employment Type : Full-time.Job Summary : Show moreLast updated: 30+ days ago
    • Promoted
    Reveille Technologies - Site Reliability Engineer - DevOps

    Reveille Technologies - Site Reliability Engineer - DevOps

    Reveille TechnologiesPune
    Job Summary : We are seeking a proactive and skilled Site Reliability Engineer (SRE) to join our team on a Contract-to-Hire (C2H) basis.The ideal c...Show moreLast updated: 30+ days ago
    • Promoted
    Qualys - Senior Site Reliability Engineer - DevOps

    Qualys - Senior Site Reliability Engineer - DevOps

    QUALYS SECURITY TECHSERVICES PRIVATE LIMITEDPune
    About the job : Come work at a place where innovation and teamwork come together to support the most exciting missions in the world! <...Show moreLast updated: 30+ days ago
    • Promoted
    CrelioHealth - Site Reliability Engineer - CI / CD Pipeline

    CrelioHealth - Site Reliability Engineer - CI / CD Pipeline

    CRELIANT SOFTWARE PRIVATE LIMITEDPune
    Job Role : Site Reliability Engineer.Job Summary : We are seeking a Senior DevOps & SRE Engineer to join our team and help us build,...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SFS Group India Pvt. Ltd.Pune, Maharashtra, India
    Act as the Site Reliability Engineer for global operations, ensuring system stability, scalability, and efficiency through advanced automation, observability, and proactive infrastructure managemen...Show moreLast updated: 6 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Pune, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Site Reliability Enginee

    Principal Site Reliability Enginee

    ConfidentialPune
    As a Principal Site Reliability Engineer, you will be responsible for developing sophisticated systems and software based on the customer s business goals, needs and general business environment.Yo...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - Linux

    Site Reliability Engineer - Linux

    Persistent SystemsPune, Maharashtra, India
    We are looking for a versatile and experienced Linux & Cloud Infrastructure Engineer to join our technology team.This role involves managing and optimizing cloud infrastructure, automating system c...Show moreLast updated: 11 days ago
    • Promoted
    Site Reliability Engineer - OpenShift

    Site Reliability Engineer - OpenShift

    ConfidentialPune
    Applies software engineering principles to the operations domain.Contributes to a service's codebase, writes automation that aids in the management of a service, and performs operational engineerin...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer -Lead position

    Site Reliability Engineer -Lead position

    ConfidentialKolkata, Pune
    Overall 8 - 11 years of experience with relevant 5-8 years of strong experience in Site Reliability Engineering.Experience in running production environment by monitoring availability and taking a ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgePune, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TechVeritoPune, Maharashtra, India
    As a SRE Engineer, you will have a strong background in cloud infrastructure management and deployment, with expertise in AWS cloud, DevOps tools, and Kubernetes ecosystem.The primary focus of this...Show moreLast updated: 25 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmapune, maharashtra, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 10 days ago
    • Promoted
    TCS Is Hiring For Site Reliability Engineering (SRE)

    TCS Is Hiring For Site Reliability Engineering (SRE)

    Tata Consultancy ServicesPune, Maharashtra, India
    Exp Range- 8-10 years Location- Pune / Kochi / Indore (Must have) - To Detect the Incidents and act proactively escalate using the built in dashboards. Hands on using Dynatrace dashboards and creatio...Show moreLast updated: 4 days ago