Talent.com
Site Reliability Engineer
Site Reliability EngineerHydrolix • gurgaon, India
Site Reliability Engineer

Site Reliability Engineer

Hydrolix • gurgaon, India
4 hours ago
Job description

About the job

At Hydrolix, we are revolutionizing the world of data management and analytics with our innovative cloud data platform, purpose-built for petabyte-scale datasets. Our mission is to help organizations drastically reduce data costs while increasing their data retention.We are looking for a Site Reliability Engineer (SRE) with 8 to 10+ years experience to join our dynamic Services team. In this role, you will contribute to the reliability and scalability of our cutting-edge platform, ensuring exceptional solutions tailored to our customers’ unique needs. This is a highly technical, hands-on role that requires deep expertise in system reliability and automation.

Key Responsibilities

  • Infrastructure Reliability : Deploy, maintain, and ensure a highly reliable fleet of Kubernetes clusters and Hydrolix deployments across multiple cloud platforms.
  • Service Optimization : Design, implement, and maintain systems and processes to enhance the reliability, availability, and performance of our services.
  • CI / CD Management : Build and optimize CI / CD tools and processes to ensure efficient and reliable deployments.
  • Monitoring and Incident Response : Develop and manage monitoring, alerting, and incident response strategies to minimize downtime and enable rapid recovery.
  • Root Cause Analysis : Conduct comprehensive root cause analyses for system failures, implementing long-term preventive measures.
  • Automation and Efficiency : Automate repetitive tasks and optimize system performance to improve operational efficiency.
  • On-Call Support : Participate in covering weekday business hours and once-monthly weekend shifts.

Collaboration and Customer Engagement

  • Cross-Functional Teamwork : Work closely with software engineering, infrastructure, and product teams to integrate reliability practices into every stage of the development lifecycle.
  • Reliability Advocacy : Champion SRE best practices and foster a culture of operational excellence across the organization.
  • Global Team Collaboration : Collaborate with a distributed team of engineers worldwide to provide round-the-clock support.
  • Customer Support : Interface with customers to address and resolve reported incidents, ensuring a seamless user experience.
  • Qualifications and Skills

  • SRE Expertise : Proven experience as a Site Reliability Engineer or similar role, with a history of supporting complex distributed systems.
  • Observability Tools : Experience with monitoring and debugging tools like Prometheus, Vector, Grafana, Superset, or Kibana.
  • Cloud Platforms : Proficiency in at least one major cloud platform (AWS, GCP, Azure, or Linode).
  • UI Development Experience : Hands-on experience building internal tooling using modern frontend frameworks (e.g., React, Vue, or Angular etc), enabling improved visibility, and operational workflows for engineering teams.
  • Database Knowledge : Experience with SQL databases; familiarity with PostgreSQL is a plus but not required.
  • Programming / Scripting Skills : Proficiency in Unix scripting and programming languages such as Python or Go
  • Linux Expertise : Strong experience with Linux systems, including performance tuning and system-level troubleshooting.
  • Communication Skills : Excellent written and verbal communication skills, with the ability to convey technical concepts clearly to diverse audiences, including customers and cross-functional teams.
  • Hydrolix provides equal employment opportunities without regard to an applicant’s race, sex, pregnancy, sexual orientation, gender identity or expression, genetic information, national origin, age, physical or mental disability, medical condition, religion, marital status or veteran status.Applicants with disabilities may be entitled to reasonable accommodation under the terms of the Americans with Disabilities Act and certain state or local laws. A reasonable accommodation is a change in the way things are normally done which will ensure an equal employment opportunity without imposing undue hardship on Hydrolix. Please inform us if you need assistance completing any forms or to otherwise participate in the application process.

    Create a job alert for this search

    Site Reliability Engineer • gurgaon, India

    Related jobs
    Site Reliability Engineer

    Site Reliability Engineer

    Yum! India Global Services Private Limited • Gurugram, Haryana, India
    Design, test, implement, deploy, and support continuous integration pipelines that build and deploy to cloud-based environments (development, stage / testing, production). In this role, you will help ...Show more
    Last updated: 10 days ago • Promoted
    Senior Lead Engineer - Full Stack

    Senior Lead Engineer - Full Stack

    REA • Gurgaon, India
    Senior Lead Engineer Full Stack.In 1995, in a garage in Melbourne, Australia, REA Group was born from a simple question : Can we change the way the world experiences property?Could we? Yes.Fast for...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Insight Global • gurgaon, India
    Contract with Insight Global Client.Show more
    Last updated: 22 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ISS STOXX • Gurgaon, Haryana, India
    This role is critical in ensuring the reliability scalability and performance of our systems while driving automation and continuous improvement. Assist the Principal SRE in driving the architecture...Show more
    Last updated: 18 days ago • Promoted
    Customer Engineer

    Customer Engineer

    Lepton Software • Gurugram, Haryana, India
    Lepton Software is a leading provider of Location Intelligence, Smart Mapping, and GIS-based software solutions, empowering enterprises across Telecom, BFSI, Government, Retail, and Automotive sect...Show more
    Last updated: 20 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Infinova Global Corporate Services LLP • gurugram, India
    Infinova is an emerging player in intelligent business transformation, dedicated to helping organizations scale smarter and achieve sustainable success. We are building a foundation that combines st...Show more
    Last updated: 4 hours ago • Promoted • New!
    SDE-2 (Full-Stack Engineer)

    SDE-2 (Full-Stack Engineer)

    Mechademy • Gurugram, Haryana, India
    Mechademy is a pioneering company combining decades of expertise in turbomachinery with cutting-edge machine learning algorithms through its IoT platform, Turbomechanica®.The platform leverages dat...Show more
    Last updated: 27 days ago • Promoted
    Lead Engineer - Full Stack [T500-19146]

    Lead Engineer - Full Stack [T500-19146]

    REA Cyber City • Gurugram, Haryana, India
    In 1995, in a garage in Melbourne, Australia, REA Group was born from a simple question : “Can we change the way the world experiences property?”. Fast forward 30 years, REA Group is a market leader ...Show more
    Last updated: 28 days ago • Promoted
    Manager, Site Reliability Engineering

    Manager, Site Reliability Engineering

    Cvent • Gurugram, Haryana, India
    Cvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform.We build teams that...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer III

    Site Reliability Engineer III

    Zinnia • Gurgaon, Haryana, India
    Zinnia is the leading technology platform for accelerating life and annuities growth.With innovative enterprise solutions and data insights Zinnia simplifies the experience of buying selling and ad...Show more
    Last updated: 30+ days ago • Promoted
    Engineer

    Engineer

    Samtel Avionics • Gurugram, Haryana, India
    Software Development - Embedded Engineer.Strong proficiency in C programming for embedded systems.Experience in device driver development (UART, I2C, SPI, PCIe). Good understanding of RTOS concepts ...Show more
    Last updated: 6 days ago • Promoted
    Lead Engineer - Full Stack

    Lead Engineer - Full Stack

    REA • Gurgaon, India
    In 1995, in a garage in Melbourne, Australia, REA Group was born from a simple question : Can we change the way the world experiences property?. Fast forward 30 years, REA Group is a market leader in...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    S&P Global • Gurgaon, Haryana, India
    Grade Level (for internal use) : .S&P Global provides innovative products and services that enhance transparency reduce risk and improve operational efficiency. Our customers include banks hedge f...Show more
    Last updated: 30+ days ago • Promoted
    Full Stack Developer - Senior Systems Engineer

    Full Stack Developer - Senior Systems Engineer

    AutoZone • Gurugram, Haryana, India
    AutoZone is the nation's leading retailer and a leading distributor of automotive replacement parts and accessories with more than 6,000 stores in the US, Puerto Rico, Mexico, and Brazil.Each store...Show more
    Last updated: 30+ days ago • Promoted
    Senior Technical Engineering Manager, Software Development Engineer in Test

    Senior Technical Engineering Manager, Software Development Engineer in Test

    Alkami Technology • Gurugram, Haryana, India
    As a Senior Technical Engineering Manager, SDET at Alkami, you will lead, mentor, and scale a team of Software Development Engineers in Test (SDETs) responsible for delivering high-quality, scalabl...Show more
    Last updated: 28 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Xebia • Gurgaon, Gurgaon (district)
    Performance & Reliability Engineer ( Senior, Lead , Principal & Manager).Location : Pune, Chennai, Bangalore & Gurgaon.Role : Performance & Reliability Engineer. Job Location : Gurgaon, Chennai, Pune, ...Show more
    Last updated: 17 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Alp Consulting Ltd. • gurgaon, India
    Show more
    Last updated: 21 hours ago • Promoted • New!
    Sr Staff SDET

    Sr Staff SDET

    Alkami • Gurgaon, India
    Guide the work of a group of SDETs in one or more functional areas with responsibility for all aspects of test automation, including framework enhancements and being an evangelist of quality.Review...Show more
    Last updated: 30+ days ago • Promoted