Talent.com
This job offer is not available in your country.
Site Reliability Engineer

Site Reliability Engineer

Resource AlgorithmDelhi, India
18 hours ago
Job description

Senior SRE (Engineering & Reliability)

Job Summary :

We are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead to oversee the reliability, scalability, and performance of our critical systems.

As an SeniorSRE, you will play a pivotal role in establishing and implementing SRE practices, leading a team of engineers, and driving automation, monitoring, and incident response strategies. This position combines software engineering and systems engineering expertise to build and maintain high-performing, reliable systems.

Experience : 7+ years

Key Responsibilities :

Reliability & Performance :

  • Lead efforts to maintain high availability and reliability of critical services.
  • Define and monitor SLIs, SLOs, and SLAs to ensure business requirements are met.
  • Proactively identify and resolve performance bottlenecks and system inefficiencies. Incident

Management & Response :

  • Establish and improve incident management processes and on-call rotations.
  • Lead incident response and root cause analysis for high-priority outages.
  • Drive post-incident reviews and ensure actionable insights are implemented.
  • Automation & Tooling :

  • Develop and implement automated solutions to reduce manual operational tasks.
  • Enhance system observability through metrics, logging, and distributed tracing tools (e.g.,
  • Prometheus, Grafana, Elastic APM).

  • Optimize CI / CD pipelines for seamless deployments.
  • Collaboration :

  • Partner with software engineering teams to improve the reliability of applications and infrastructure.
  • Work closely with product / engineering teams to design scalable and robust systems.
  • Ensure seamless integration of monitoring and alerting systems across teams. Leadership &
  • Team Building :

  • Manage, mentor, and grow a team of SREs.
  • Promote SRE best practices and foster a culture of reliability and performance across the organization.
  • Drive performance reviews, skills development, and career progression for team members.
  • Capacity Planning & Cost Optimization :

  • Perform capacity planning and implement autoscaling solutions to handle traffic spikes.
  • Optimize infrastructure and cloud costs while maintaining reliability and performance.
  • Skills & Qualifications :

    Required Skills :

  • Technical Expertise : o Experience with cloud platforms (AWS / Azure / GCP) and Kubernetes.
  • Hands-on knowledge of infrastructure-as-code tools like Terraform / Helm / Ansible.

    o Proficiency in Java o Expertise in distributed systems, databases, and load balancing.

    Monitoring & Observability :

    Proficient with tools like Prometheus, Grafana,, Elastic APM, or New relic.

    o Understanding of metrics-driven approaches for system monitoring and alerting.

  • Automation & CI / CD :
  • o Hands-on experience with CI / CD pipelines (e.g., Jenkins, Azure Pipelines etc).

    o Skilled in automation frameworks and tools for infrastructure and application deployments.

  • Incident Management :
  • o Proven track record in handling incidents, post-mortems, and implementing solutions to prevent recurrence.

    Leadership & Communication Skills :

  • Strong people management and leadership skills with the ability to inspire and motivate teams.
  • Excellent problem-solving and decision-making skills.
  • Clear and concise communication, with the ability to translate technical concepts for non-technical stakeholders.
  • Preferred Qualifications :

  • Experience with database optimization, Kafka, or other messaging systems.
  • Knowledge of autoscaling techniques
  • Previous experience in an SRE, DevOps, or infrastructure engineering leadership role.
  • Understanding of compliance and security best practices in distributed systems.
  • Create a job alert for this search

    Site Reliability Engineer • Delhi, India

    Related jobs
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Meerut, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    o9 Solutions, Inc.Delhi, India
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    RecRootsDelhi, India
    The core premise for the SRE lies in treating operational issues as a software problem.We code our way out of problems where operations are concerned, addressing availability, scalability, latency,...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent PartnersDelhi, India
    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure power...Show moreLast updated: 1 day ago
    • Promoted
    AWS Site Reliability Engineer

    AWS Site Reliability Engineer

    HTC Global ServicesDelhi, India
    HTC – A brief profile Established in 1990, HTC Inc.Troy, Michigan, is a leading global Information Technology solution and BPO provider. HTC assists clients across multiple industry verticals, offer...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    IntraEdgeDelhi, India
    Job Title : Site Reliability Engineer (SRE) – Production Support.We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and clou...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    QualityKiosk TechnologiesDelhi, India
    QualityKiosk Technologies is one of the world's largest independent Quality Engineering (QE) providers and digital transformation enablers, helping companies build and manage applications for optim...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TechVeritoDelhi, India
    As a SRE Engineer, you will have a strong background in cloud infrastructure management, migration and deployment, with expertise in Google Cloud Platform (GCP), DevOps tools, and Kubernetes ecosys...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TalentiserDelhi, India
    YOUR IMPACT : Reliability, Automation, and Observability As a hybrid Site Reliability Engineer / DevOps Engineer, you'll be a key driver in ensuring the stability, performance, and scalability of our ...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer / Lead

    Site Reliability Engineer / Lead

    CoforgeNoida, Uttar Pradesh, India
    Skills : Docker, Prometheus, grafana, ELK, DataDog.We at Coforge are hiring a highly skilled and experienced.You will lead a team of SREs, collaborate with development and operations teams, and impl...Show moreLast updated: 11 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Endpoint ClinicalDelhi, India
    Endpoint is an interactive response technology (IRT®) systems and solutions provider that supports the life sciences industry. Since 2009, we have been working with a single vision in mind, to help ...Show moreLast updated: 1 day ago
    • Promoted
    Software Engineer, Site Reliability Engineering (Ecoh Core)

    Software Engineer, Site Reliability Engineering (Ecoh Core)

    EcohDelhi, IN
    Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.Strong problem-solving and analytical skills. Ability to debug, optimize code, and automate routine tasks.E...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    PoshmarkDelhi, India
    We’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Sonata SoftwareGhaziabad, IN
    We're Hiring : Senior Site Reliability Engineer.Onsite (Office : Hyderabad – Mandatory from Day 1).Senior Site Reliability Engineer (SRE). This is a high-impact role where you’ll design scalable archi...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub ServicesDelhi, India
    Key Responsibilities Manage and scale production systems hosted on.Google Cloud Platform (GCP) Implement.SRE best practices : monitoring, alerting, SLAs, SLOs, and error budgets Automate operationa...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynechronDelhi, India
    Good-day, We have immediate opportunity for Senior Site Reliability Engineer.Senior Site Reliability Engineer Job Location : Synechron. Notice : Immediate Joiner About Company : At Synechron, we belie...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ACL DigitalDelhi, India
    Service Management : Maintain application uptime / performance, manage system enhancements and defects, oversee daily operational activities, and ensure continuous improvement and adherence to ITIL be...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TrantorDelhi, India
    Job Title - Site Reliability Engineer Role- Contract (9 Months- Extendable) Exp- 5+ years Loc- Bangalore ( Hybrid) Notice- Immediate joiner only. Duties : Responsible for maintaining and scaling prod...Show moreLast updated: 7 days ago