Talent.com
Site Reliability Engineer
Site Reliability EngineerHydrolix • India
Site Reliability Engineer

Site Reliability Engineer

Hydrolix • India
4 hours ago
Job description

About the job

At Hydrolix, we are revolutionizing the world of data management and analytics with our innovative cloud data platform, purpose-built for petabyte-scale datasets. Our mission is to help organizations drastically reduce data costs while increasing their data retention.We are looking for a Site Reliability Engineer (SRE) with 8 to 10+ years experience to join our dynamic Services team. In this role, you will contribute to the reliability and scalability of our cutting-edge platform, ensuring exceptional solutions tailored to our customers’ unique needs. This is a highly technical, hands-on role that requires deep expertise in system reliability and automation.

Key Responsibilities

  • Infrastructure Reliability : Deploy, maintain, and ensure a highly reliable fleet of Kubernetes clusters and Hydrolix deployments across multiple cloud platforms.
  • Service Optimization : Design, implement, and maintain systems and processes to enhance the reliability, availability, and performance of our services.
  • CI / CD Management : Build and optimize CI / CD tools and processes to ensure efficient and reliable deployments.
  • Monitoring and Incident Response : Develop and manage monitoring, alerting, and incident response strategies to minimize downtime and enable rapid recovery.
  • Root Cause Analysis : Conduct comprehensive root cause analyses for system failures, implementing long-term preventive measures.
  • Automation and Efficiency : Automate repetitive tasks and optimize system performance to improve operational efficiency.
  • On-Call Support : Participate in covering weekday business hours and once-monthly weekend shifts.

Collaboration and Customer Engagement

  • Cross-Functional Teamwork : Work closely with software engineering, infrastructure, and product teams to integrate reliability practices into every stage of the development lifecycle.
  • Reliability Advocacy : Champion SRE best practices and foster a culture of operational excellence across the organization.
  • Global Team Collaboration : Collaborate with a distributed team of engineers worldwide to provide round-the-clock support.
  • Customer Support : Interface with customers to address and resolve reported incidents, ensuring a seamless user experience.
  • Qualifications and Skills

  • SRE Expertise : Proven experience as a Site Reliability Engineer or similar role, with a history of supporting complex distributed systems.
  • Observability Tools : Experience with monitoring and debugging tools like Prometheus, Vector, Grafana, Superset, or Kibana.
  • Cloud Platforms : Proficiency in at least one major cloud platform (AWS, GCP, Azure, or Linode).
  • UI Development Experience : Hands-on experience building internal tooling using modern frontend frameworks (e.g., React, Vue, or Angular etc), enabling improved visibility, and operational workflows for engineering teams.
  • Database Knowledge : Experience with SQL databases; familiarity with PostgreSQL is a plus but not required.
  • Programming / Scripting Skills : Proficiency in Unix scripting and programming languages such as Python or Go
  • Linux Expertise : Strong experience with Linux systems, including performance tuning and system-level troubleshooting.
  • Communication Skills : Excellent written and verbal communication skills, with the ability to convey technical concepts clearly to diverse audiences, including customers and cross-functional teams.
  • Hydrolix provides equal employment opportunities without regard to an applicant’s race, sex, pregnancy, sexual orientation, gender identity or expression, genetic information, national origin, age, physical or mental disability, medical condition, religion, marital status or veteran status.Applicants with disabilities may be entitled to reasonable accommodation under the terms of the Americans with Disabilities Act and certain state or local laws. A reasonable accommodation is a change in the way things are normally done which will ensure an equal employment opportunity without imposing undue hardship on Hydrolix. Please inform us if you need assistance completing any forms or to otherwise participate in the application process.

    Create a job alert for this search

    Site Reliability Engineer • India

    Related jobs
    Site Reliability Engineer

    Site Reliability Engineer

    Pagos Consultants • India, India
    This team will play a pivotal role in spearheading innovation.As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its future d...Show more
    Last updated: 4 days ago • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaService • India, India
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show more
    Last updated: 25 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Synechron • Pune, Republic Of India, IN
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5 to 9 years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology special...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer Rotation Shift

    Site Reliability Engineer Rotation Shift

    Synechron • Pune, Republic Of India, IN
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5-8 years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology specialist...Show more
    Last updated: 20 days ago • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    Media.net • Republic Of India, IN
    Net is a leading, global ad tech company that focuses on creating the most transparent and efficient path for advertiser budgets to become publisher revenue. Our proprietary contextual technology is...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2 • India
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show more
    Last updated: 4 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Insight Global • India
    Contract with Insight Global Client.Join our Site Reliability Engineering (SRE) team as a Fullstack Developer, focused on building and maintaining highly reliable, automated, and scalable systems.Y...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • India, India
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer 2

    Site Reliability Engineer 2

    PhonePe • India
    Headquartered in India, its flagship product, the PhonePe digital payments app, was launched in Aug 2016.As of April 2025, PhonePe has over 60 Crore (600 Million) registered users and a digital pay...Show more
    Last updated: 4 hours ago • Promoted • New!
    Aws Site Reliability Engineer

    Aws Site Reliability Engineer

    HTC Global Services • Chennai, Republic Of India, IN
    Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show more
    Last updated: 26 days ago • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Datum Technologies Group • Chennai, Republic Of India, IN
    Job Title : Lead Site Reliability Engineer (SRE).Duration : Contract to Hire (On the Payroll of Datum Technology Group).Location : Chennai || Mumbai || Gurugram. Interview Process : Virtual (2 Rounds) +...Show more
    Last updated: 2 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    HRhelpdesk • Indore, Republic Of India, IN
    Company is a rapidly growing, private equity backed SaaS product company and provides cloud-based solutions.As a Site Reliability Engineer (SRE), you will be responsible for building and maintainin...Show more
    Last updated: 16 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Infinova Global Corporate Services LLP • India
    Infinova is an emerging player in intelligent business transformation, dedicated to helping organizations scale smarter and achieve sustainable success. We are building a foundation that combines st...Show more
    Last updated: 4 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Alp Consulting Ltd. • India
    Show more
    Last updated: 4 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Worldline • Pune, Republic Of India, IN
    Worldline helps businesses of all shapes and sizes to accelerate their growth journey - quickly, simply, and securely.We are the innovators at the heart of the payments technology industry, shaping...Show more
    Last updated: 22 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Grootan Technologies • Chennai, Republic Of India, IN
    Site Reliability Engineer (SRE).In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications.You will leverage your e...Show more
    Last updated: 16 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PhonePe • Pune, Republic Of India, IN
    Troubleshoot issues across the entire stack - hardware, software, application, and network.Work to improve the reliability and performance of the next generation of distributed systems.Work to impr...Show more
    Last updated: 26 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Media.net • Republic Of India, IN
    Net is a leading, global ad tech company that focuses on creating the most transparent and efficient path for advertiser budgets to become publisher revenue. Our proprietary contextual technology is...Show more
    Last updated: 9 days ago • Promoted