Talent.com
Site Reliability Engineer
Site Reliability EngineerHRhelpdesk • Gurgaon, Haryana, India
Site Reliability Engineer

Site Reliability Engineer

HRhelpdesk • Gurgaon, Haryana, India
12 days ago
Job description

About the company : Company is a rapidly growing, private equity backed SaaS product company and provides cloud-based solutions.

Job Summary : As a Site Reliability Engineer (SRE), you will be responsible for building and maintaining the infrastructure, tools, and pipelines that keep our systems running smoothly. You will collaborate closely with DevOps, engineering, and product teams to design and deploy reliable, scalable, and automated systems. You will also improve the application code for user-facing bugs, ensuring enhanced performance and resilience.

RESPONSIBILITIES : Comfortable with work shift aligned with U.S. time zone (7 pm to 3 am IST)

1. CI / CD Pipeline Management :

  • Design, implement, and maintain robust CI / CD pipelines for automated software deployment.
  • Collaborate with DevOps and engineering teams to integrate testing, monitoring, and security checks into pipelines.
  • Continuously improve deployment processes to ensure smooth and error-free production releases.

2. Monitoring and Observability :

  • Create and manage comprehensive logging dashboards in Datadog to monitor system health, performance, and logs.
  • Set up alerting mechanisms to proactively identify and respond to system issues.
  • Analyze and visualize key performance metrics to drive improvements.
  • 3. Collaborate on Architectural Solutions :

  • Work closely with DevOps and engineering teams to design scalable, resilient, and secure infrastructure.
  • Ensure solutions adhere to best practices for performance, security, and maintainability.
  • 4. Code Optimization and Bug Fixing :

  • Improve application code to resolve user-facing bugs and enhance system resilience.
  • Troubleshoot and fix issues that impact the performance or availability of production systems.
  • Contribute to the continuous improvement of the codebase, focusing on optimizing performance and reliability.
  • 5. Automation and Continuous Improvement :

  • Automate repetitive tasks related to infrastructure management, monitoring, and troubleshooting.
  • Identify and propose innovative solutions to improve system efficiency and performance. 6. Custom Node.js CLI Tool Development :
  • Develop and automate custom Node.js CLI tools to enhance operational workflows and streamline repetitive tasks.
  • Implement automated solutions to optimize system processe
  • Requirements

    MUST HAVES :

  • Experience Level : 6-8 years
  • Comfortable with work shift aligned with U.S. time zone (7 pm to 3 am IST)
  • Prior experience working in cross-functional teams
  • Systems architecture and design skills
  • Proficiency in scripting languages such as Bash, Python, or PowerShell.
  • Experience with CI / CD tools such as Github Actions or similar platforms.
  • Build and deployment automation experience especially in a containerized world
  • Proficiency with common ops tools (ECS, Logstash, Datadog + Kibana, EKS etc)
  • Experience with AWS or Azure
  • Comfort maintaining live production systems
  • Strong communication and collaboration skills, with the ability to work effectively in a fast-paced team environment.
  • Create a job alert for this search

    Site Reliability Engineer • Gurgaon, Haryana, India

    Related jobs
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Confidential • Gurugram, Gurgaon / Gurugram, India
    Site Reliability Engineer (SRE).The ideal candidate will have hands-on experience managing large-scale, distributed systems in production, with a deep understanding of reliability, scalability, and...Show more
    Last updated: 26 days ago • Promoted
    Senior Site Reliability Engineer-III

    Senior Site Reliability Engineer-III

    Confidential • Gurgaon / Gurugram
    Define and enforce SLOs, SLIs, and error budgets across microservices.Architect an observability stack (metrics, logs, traces) and derive operational insights. Automate toil and manual operations th...Show more
    Last updated: 26 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Synamedia • Gurgaon, Haryana, India
    JOB DESCRIPTION At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the ...Show more
    Last updated: 16 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Xebia • Gurgaon, India
    Performance & Reliability Engineer ( Senior, Lead , Principal & Manager).Location : Pune, Chennai, Bangalore & Gurgaon.Role : Performance & Reliability Engineer. Job Location : Gurgaon, Chennai, Pune, ...Show more
    Last updated: 15 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ISS STOXX • Gurgaon, Haryana, India
    This role is critical in ensuring the reliability scalability and performance of our systems while driving automation and continuous improvement. Assist the Principal SRE in driving the architecture...Show more
    Last updated: 12 days ago • Promoted
    Senior Site Reliability Engineer (C# / Python)

    Senior Site Reliability Engineer (C# / Python)

    Entech • Gurgaon, Haryana, India
    Entech is hiring a Senior Software Site Reliability Engineer (C# / Python) for a long-term remote opportunity with our client. You’ll ensure enterprise systems are reliable, scalable, and performan...Show more
    Last updated: 7 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Confidential • Gurgaon / Gurugram, India
    Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives.The work you do with our team will directly improve health outcomes by connect...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaService • Gurgaon, Haryana, India
    About InstaService InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ ...Show more
    Last updated: 20 days ago • Promoted
    Macquarie - Site Reliability Engineering Manager

    Macquarie - Site Reliability Engineering Manager

    Macquarie • Gurgaon
    Description : We have an exciting opportunity to join our Commodities and Global Markets (CGM) Trade Validation team as a Site Reliability Engineer who can lead a tea...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Yum! India Global Services Private Limited • Gurugram, Haryana, India
    Design, test, implement, deploy, and support continuous integration pipelines that build and deploy to cloud-based environments (development, stage / testing, production). In this role, you will help ...Show more
    Last updated: 4 days ago • Promoted
    Site Reliability Engineer III

    Site Reliability Engineer III

    Zinnia • Gurgaon, Haryana, India
    Zinnia is the leading technology platform for accelerating life and annuities growth.With innovative enterprise solutions and data insights Zinnia simplifies the experience of buying selling and ad...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Manager

    Site Reliability Manager

    dunnhumby • Gurugram, Haryana, India
    Europe, Asia, Africa, and the Americas working for transformative, iconic brands such as Tesco, Coca-Cola, Meijer, Procter & Gamble and Metro. Customer Data Science, empowering businesses everywhere...Show more
    Last updated: 4 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    o9 Solutions, Inc. • Gurgaon, Haryana, India
    Be part of something revolutionary At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    S&P Global • Gurgaon, Haryana, India
    Grade Level (for internal use) : .S&P Global provides innovative products and services that enhance transparency reduce risk and improve operational efficiency. Our customers include banks hedge f...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer III

    Site Reliability Engineer III

    Confidential • Gurugram, Gurgaon / Gurugram, India
    Zinnia is the leading technology platform for accelerating life and annuities growth.With innovative enterprise solutions and data insights, Zinnia simplifies the experience of buying, selling, and...Show more
    Last updated: 26 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Confidential • Gurugram, Gurgaon / Gurugram, India
    Grade Level (for internal use) : .S&P Global provides innovative products and services that enhance transparency, reduce risk, and improve operational efficiency. Our customers include banks, hedge fu...Show more
    Last updated: 30+ days ago • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Confidential • Gurugram, Gurgaon / Gurugram, India
    Join our software, system, and test engineering group as a.Lead Site Reliability Engineer.AWS infrastructure, automating CI / CD pipelines, and ensuring scalable, reliable deployments.You will levera...Show more
    Last updated: 12 days ago • Promoted
    Manager, Site Reliability Engineering

    Manager, Site Reliability Engineering

    Cvent • Gurgaon, Haryana, India
    Cvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform.We build teams that...Show more
    Last updated: 29 days ago • Promoted