Talent.com
Site Reliability Engineer
Site Reliability EngineerHydrolix • Bangalore, IN
Site Reliability Engineer

Site Reliability Engineer

Hydrolix • Bangalore, IN
1 day ago
Job description

About the job

At Hydrolix, we are revolutionizing the world of data management and analytics with our innovative cloud data platform, purpose-built for petabyte-scale datasets. Our mission is to help organizations drastically reduce data costs while increasing their data retention.We are looking for a Site Reliability Engineer (SRE) with 8 to 10+ years experience to join our dynamic Services team. In this role, you will contribute to the reliability and scalability of our cutting-edge platform, ensuring exceptional solutions tailored to our customers’ unique needs. This is a highly technical, hands-on role that requires deep expertise in system reliability and automation.

Key Responsibilities

  • Infrastructure Reliability : Deploy, maintain, and ensure a highly reliable fleet of Kubernetes clusters and Hydrolix deployments across multiple cloud platforms.
  • Service Optimization : Design, implement, and maintain systems and processes to enhance the reliability, availability, and performance of our services.
  • CI / CD Management : Build and optimize CI / CD tools and processes to ensure efficient and reliable deployments.
  • Monitoring and Incident Response : Develop and manage monitoring, alerting, and incident response strategies to minimize downtime and enable rapid recovery.
  • Root Cause Analysis : Conduct comprehensive root cause analyses for system failures, implementing long-term preventive measures.
  • Automation and Efficiency : Automate repetitive tasks and optimize system performance to improve operational efficiency.
  • On-Call Support : Participate in covering weekday business hours and once-monthly weekend shifts.

Collaboration and Customer Engagement

  • Cross-Functional Teamwork : Work closely with software engineering, infrastructure, and product teams to integrate reliability practices into every stage of the development lifecycle.
  • Reliability Advocacy : Champion SRE best practices and foster a culture of operational excellence across the organization.
  • Global Team Collaboration : Collaborate with a distributed team of engineers worldwide to provide round-the-clock support.
  • Customer Support : Interface with customers to address and resolve reported incidents, ensuring a seamless user experience.
  • Qualifications and Skills

  • SRE Expertise : Proven experience as a Site Reliability Engineer or similar role, with a history of supporting complex distributed systems.
  • Observability Tools : Experience with monitoring and debugging tools like Prometheus, Vector, Grafana, Superset, or Kibana.
  • Cloud Platforms : Proficiency in at least one major cloud platform (AWS, GCP, Azure, or Linode).
  • UI Development Experience : Hands-on experience building internal tooling using modern frontend frameworks (e.g., React, Vue, or Angular etc), enabling improved visibility, and operational workflows for engineering teams.
  • Database Knowledge : Experience with SQL databases; familiarity with PostgreSQL is a plus but not required.
  • Programming / Scripting Skills : Proficiency in Unix scripting and programming languages such as Python or Go
  • Linux Expertise : Strong experience with Linux systems, including performance tuning and system-level troubleshooting.
  • Communication Skills : Excellent written and verbal communication skills, with the ability to convey technical concepts clearly to diverse audiences, including customers and cross-functional teams.
  • Hydrolix provides equal employment opportunities without regard to an applicant’s race, sex, pregnancy, sexual orientation, gender identity or expression, genetic information, national origin, age, physical or mental disability, medical condition, religion, marital status or veteran status.Applicants with disabilities may be entitled to reasonable accommodation under the terms of the Americans with Disabilities Act and certain state or local laws. A reasonable accommodation is a change in the way things are normally done which will ensure an equal employment opportunity without imposing undue hardship on Hydrolix. Please inform us if you need assistance completing any forms or to otherwise participate in the application process.

    Create a job alert for this search

    Site Reliability Engineer • Bangalore, IN

    Related jobs
    Site Reliability Engineer

    Site Reliability Engineer

    Synamedia • Bengaluru, Karnataka, India
    At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show more
    Last updated: 22 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Reyika • Bengaluru, Karnataka, India
    Senior Site Reliability Engineer / Reliability Architect.Pune,Bengalore,Chennai,Pune,Noida.Reliability Architect with over 9 years of experience in proactive monitoring, automation, and observabili...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Signzy • Bengaluru, Karnataka, India
    Signzy is an AI-powered RPA platform for financial services.No matter how complex your workflow or operational complexity, Signzy can completely automate your back-operations decision-making proces...Show more
    Last updated: 8 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Delta Electronics India • Bengaluru, Karnataka, India
    Define and monitor Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to balance reliability with feature velocity and ensure optimal system availability.Respond to...Show more
    Last updated: 10 days ago • Promoted
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Tata Consultancy Services • Bengaluru, Karnataka, India
    Senior Site Reliability Engineer (SRE).Senior Site Reliability Engineer (SRE).Desired Experience Range : 7 - 10 yrs.Notice Period : Immediate to 90Days only. We are currently planning to do a Virtual....Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Pagos Consultants • Bengaluru, IN
    This team will play a pivotal role in spearheading innovation.As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its future d...Show more
    Last updated: 5 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Synechron • Bengaluru, Karnataka, India
    We have immediate opportunity for Senior Site Reliability Engineer.Senior Site Reliability Engineer.At Synechron, we believe in the power of digital to transform businesses for the better.Our globa...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Enterprise Minds, Inc • Bengaluru, Karnataka, India
    Senior Site Reliability Engineer (GCP | Terraform | Ansible | SRE | On-Call).Site Reliability Engineer (SRE).If you thrive in fast-paced environments, excel in incident management, and love buildin...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    o9 Solutions, Inc. • Bengaluru, Karnataka, India
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show more
    Last updated: 18 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ACL Digital • Bengaluru, Karnataka, India
    Overall work experience of 4 - 6 years.Deep understanding and Experience of AWS Cloud is a must.Experience with automation tools such as Terraform. Proficient at building Docker container configurat...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    GREYTIP SOFTWARE PRIVATE LIMITED • Bengaluru, Karnataka, India
    The ideal candidate will have hands-on experience in.You will play a key role in ensuring the reliability, availability, and performance of our production systems. Monitor production systems using e...Show more
    Last updated: 15 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2 • Bengaluru, Karnataka, India
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent Partners • Bengaluru, Karnataka, India
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • Bangalore, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Glocomms • Bengaluru, Karnataka, India
    We are currently looking for an SRE Lead - to join our customer - an IT consultancy with urgent projects on board.This will be a 6 month contract initially with an option to extend further.Assess a...Show more
    Last updated: 8 days ago • Promoted
    Staff Site Reliability Engineer (Observability)

    Staff Site Reliability Engineer (Observability)

    Palo Alto Networks • Bengaluru, Karnataka, India
    At Palo Alto Networks® everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and m...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Media.net • Bengaluru, Karnataka, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show more
    Last updated: 30+ days ago • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Rakuten India • Bengaluru, Karnataka, India
    Design, develop SLA, SLO, SLI of services within the Business Unit.Involve in whole process of Development, Production System Operation including system maintenance, monitoring, automation, backend...Show more
    Last updated: 30+ days ago • Promoted