Talent.com
No longer accepting applications
Senior Site Reliability Engineer

Senior Site Reliability Engineer

iVoyantIndia, India
7 days ago
Job description

One of our clients is looking for an experienced Senior Site Reliability Engineer (SRE) - Mission-Critical SaaS Cloud Products to join their team.

Key Responsibilities :

Reliability and Performance Management :

  • Design, implement, and maintain highly available, scalable, and resilient cloud-native architectures for mission-critical SaaS products.
  • Develop and implement SLOs, SLIs, and SLAs to measure and improve service reliability.
  • Continuously optimize system performance and resource utilization across multiple cloud platforms.
  • Finetune / Optimize Application performance by analyzing the code, traces and database queries.

Incident Management and Troubleshooting :

  • Lead incident response efforts, effectively troubleshooting complex issues to minimize downtime and impact.
  • Reduce Mean Time to Recover (MTTR) through proactive monitoring, automated alerting, and efficient problem-solving techniques.
  • Conduct thorough Root Cause Analysis (RCA) for all major incidents and implement preventive measures.
  • Observability and Monitoring :

  • Design and implement end-to-end observability solutions across our distributed systems.
  • Develop and maintain comprehensive monitoring strategies using tools like ELK Stack, Prometheus, Grafana.
  • Create and optimize product status dashboards to provide real-time visibility into system health and performance.
  • Automation and Infrastructure as Code (IaC) :

  • Implement Infrastructure as Code practices using tools like Terraform.
  • Develop and maintain automated deployment pipelines and CI / CD workflows.
  • Create self-healing systems and automate routine operational tasks to reduce manual intervention.
  • Cloud-Agnostic Architecture :

  • Design and implement cloud-agnostic solutions that can operate efficiently across multiple cloud providers.
  • Develop expertise in event-driven architecture and related technologies (e.g., Apache Kafka / EventHub, Redis, Mongo Atlas, IoTHub).
  • Implement and manage containerized applications using Kubernetes across different cloud environments.
  • Continuous Improvement :

  • Regularly review and refine operational practices to enhance efficiency and reliability.
  • Stay updated with the latest industry trends and technologies in SRE, cloud computing, and DevOps.
  • Contribute to the development of internal tools and frameworks to support SRE practices.
  • Requirements :

  • Strong knowledge of cloud platforms - Azure and their associated services.
  • Expert in Observability tools (ELK Stack, Dynatrace, Prometheus)
  • Expertise in containerization technologies such as Docker and Kubernetes
  • Understanding of Event-driven architecture and database technologies (Mongo Atlas, Azure SQL, Postgres DB)
  • Proficient in IaaC tools such as - Terraform and GitHub Actions.
  • Proficiency in one or more programming languages - Python / .Net / Java
  • Strong understanding of networking concepts, load balancing, and security practices.
  • Create a job alert for this search

    Senior Site Reliability Engineer • India, India

    Related jobs
    • Promoted
    Senior AppDynamics Observability SME

    Senior AppDynamics Observability SME

    Dexian IndiaNagpur, IN
    Position Title : Senior AppDynamics Observability SME.IT operations, system administration, or engineering.Ansible, Jenkins, Terraform, Python to develop configuration, deployment, and orchestration...Show moreLast updated: 7 days ago
    • Promoted
    Technical Lead

    Technical Lead

    ThumoNagpur, IN
    Founding Engineer @ Thumo (Africa’s first super-app).We’re building Africa’s super-app, starting with food delivery.M funding round led by Soma Capital with top Silicon Valley angels, we’re hiring ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site reliability engineer

    Site reliability engineer

    CapgeminiIndia, India, India
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 8 hours ago
    • Promoted
    Senior site reliability engineer- elk expert

    Senior site reliability engineer- elk expert

    IVedha Inc.India, India, India
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation : India (Remote) - Must be available to work in the EST (US / Canada) Time Zone. Role Summary : Are you a Senio...Show moreLast updated: 1 day ago
    • Promoted
    Rotating Equipment Reliability Consultant / Trainer

    Rotating Equipment Reliability Consultant / Trainer

    EC-Energy EventsNagpur, IN
    EC-Energy Events is looking for an experienced Rotating Equipment Reliability Consultant / Trainer to join our growing pool of experts supporting technical conferences, training programs, and worksho...Show moreLast updated: 29 days ago
    • Promoted
    Resident Engineer – Kubernetes & Portworx

    Resident Engineer – Kubernetes & Portworx

    CMK Resources, Inc.Nagpur, IN
    CMK Resources Resident Engineer – Kubernetes & Portworx (3 openings).Help Shape the Future of Kubernetes Storage.Our client's largest and most strategic customer is moving VMware-based workloads to...Show moreLast updated: 30+ days ago
    • Promoted
    DevOps / Platform Engineer

    DevOps / Platform Engineer

    iVedha Inc.Nagpur, IN
    Hiring a seasoned DevOps / Platform Engineer to drive automation, platform reliability, and robust.Design, deploy, and manage CI / CD pipelines and infrastructure automation, leveraging AI for.Implemen...Show moreLast updated: 30+ days ago
    • Promoted
    Emulation Engineer / Lead

    Emulation Engineer / Lead

    eInfochips (An Arrow Company)Nagpur, IN
    Role : Emulation Engineer / Lead.Job Location : Noida, Chennai, Bangalore, Hyderabad, Ahmedabad.You must be having BS or MS in Electrical OR Electronics engineering. Minimum 4+ Years of Emulation Expe...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiIndia, India
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 7 days ago
    • Promoted
    Senior MLOps Engineer

    Senior MLOps Engineer

    Mitchell Martin Inc.Nagpur, IN
    Include, but are not limited to, the following : .Own productionizing models—from tracked experiments to governed releases—ensuring resilient services with clear SLOs, runbooks, and fast, safe rollba...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Engineer - Protocols

    Senior Engineer - Protocols

    RecroNagpur, IN
    As a Software Engineer, you will play a key role in enhancing our cloud-scale NAS platform.Your responsibilities will include : . Collaborating on requirements analysis, design reviews to evolve Nasun...Show moreLast updated: 18 days ago
    • Promoted
    Delinea Implementation Engineer

    Delinea Implementation Engineer

    K&K Talents - IndiaNagpur, IN
    This position is with one of our.Title : Delinea Implementation Engineer.Employment Type : Full-time Permanent.Delinea Implementation Engineer. Delinea (formerly Thycotic & Centrify) Privileged Access...Show moreLast updated: 10 days ago
    • Promoted
    IT Senior Engineer

    IT Senior Engineer

    KPG99 INCNagpur, IN
    Support the migration of applications to AWS (cloud migration currently underway).Must have strong hands-on experience with AWS,. NET, and cloud-based architectures.Full stack capability required, i...Show moreLast updated: 10 days ago
    • Promoted
    AI Exploration Engineer

    AI Exploration Engineer

    Mitchell Martin Inc.Nagpur, IN
    Design and execute machine learning experiments to evaluate emerging AI technologies and frameworks.Prototype and assess end-to-end AI solutions to inform product and platform strategy.Formulate hy...Show moreLast updated: 30+ days ago
    • Promoted
    Deployment Engineer

    Deployment Engineer

    AvocaNagpur, IN
    Build, launch & optimize AI agents that power the next generation of home-service customer experiences.Avoca is the all-in-one AI lead-conversion platform. Our technology boosts booking rates, slash...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeIndia
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 10 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.India, India
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Design Verification Engineer

    Senior Design Verification Engineer

    IgnitariumNagpur, IN
    We are seeking a skilled Design Verification Engineer with hands-on experience in live projects.If you have a passion for developing functional verification environments, excellent debugging skills...Show moreLast updated: 18 days ago