Talent.com
Site Reliability Engineer

Site Reliability Engineer

Prometheus consultingHyderabad
1 day ago
Job description

WHAT YOU'LL DO :

  • Support, maintain, and enhance the reliability, scalability, and performance of our Azure-based Data Analytics Platform.
  • Collaborate closely with Data Engineers, Developers, and Architects to operationalize solutions in Synapse, Fabric, Databricks, and related Azure services.
  • Design and implement monitoring, alerting, and observability strategies to ensure end-to-end visibility of data services and pipelines.
  • Drive automation for provisioning, deployment, scaling, and recovery of critical services using Infrastructure-as-Code (IaC).
  • Implement CI / CD pipelines tailored for data workloads (e.g., notebook deployments, schema evolution, integration testing).
  • Ensure system compliance with enterprise security, privacy, and data governance policies.
  • Participate in incident response, troubleshooting, and root cause analysis to improve system resilience.
  • Optimize cost, performance, and service availability through best-practice configurations and usage monitoring.
  • Contribute to SRE playbooks and knowledge bases for operational excellence.
  • Act as a technical mentor and advocate for reliability engineering within data and product teams.

WHAT YOU'LL NEED :

  • Proven experience as an SRE or DevOps engineer supporting large-scale data platforms, preferably in an enterprise Azure environment.
  • Strong hands-on expertise with Azure Data Services, especially :
  • 1. Azure Synapse Analytics

    2. Microsoft Fabric

    3. Azure Databricks

    4. Azure Data Lake Storage

    5. Azure Data Factory / Synapse Pipelines

  • Deep understanding of data architecture principles, data pipeline orchestration, and distributed data processing.
  • Proficiency in Infrastructure-as-Code tools like Terraform, Bicep, or ARM templates.
  • Solid scripting experience (e.g., PowerShell, Python, or Bash) for automation tasks.
  • Familiarity with CI / CD tools (e.g., Azure DevOps, GitHub Actions) and containerization
  • Expertise in monitoring / logging solutions such as Azure Monitor, Log Analytics, Application Insights, and third-party tools like Prometheus / Grafana or Honeycomb.
  • Knowledge of cloud security and data governance best practices.
  • Strong analytical and problem-solving skills, with the ability to work collaboratively in a cross-functional team.
  • Excellent communication skills to engage technical and non-technical stakeholders.
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Hyderabad

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TalentiserHyderabad, Telangana, India
    Reliability, Automation, and Observability As a hybrid Site Reliability Engineer / DevOps Engineer, you'll be a key driver in ensuring the stability, performance, and scalability of our mission-criti...Show moreLast updated: 23 days ago
    • Promoted
    Engineer, Site Reliability [T500-20517]

    Engineer, Site Reliability [T500-20517]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 17 days ago
    • Promoted
    AWS Site Reliability Engineer

    AWS Site Reliability Engineer

    HTC Global ServicesHyderabad, Telangana, India
    HTC – A brief profile Established in 1990, HTC Inc.Troy, Michigan, is a leading global Information Technology solution and BPO provider. HTC assists clients across multiple industry verticals, of...Show moreLast updated: 23 days ago
    • Promoted
    • New!
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Atyeti IncHyderabad, Telangana, India
    We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our growing team.Bachelor’s degree in computer science, Engineering, or equivalent practical experience.Site Re...Show moreLast updated: 17 hours ago
    • Promoted
    Engineer, Site Reliability [T500-20521]

    Engineer, Site Reliability [T500-20521]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesHyderabad, Telangana, India
    We are currently seeking a for a position SRE Engineer in Hyderabad.Job ID : 375656 • • • •Apply Here : • • (TCS iBegin) • •Job Description : • • - Proven experience as a DevOps / SRE Engineer - Expertise in...Show moreLast updated: 13 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeHyderabad, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SID Global SolutionsHyderabad, Telangana, India
    Job Role : Site Reliability Engineer (SRE) – GCP.SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortu...Show moreLast updated: 23 days ago
    • Promoted
    Engineer, Site Reliability [T500-20503]

    Engineer, Site Reliability [T500-20503]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmaSecunderabad, Telangana, India
    Site Reliability Engineer (Multi-Cloud Deployments) Location : Bangalore / Remote Experience : 4–10 years Type : Full-time (6-month probation) About CodeKarma CodeKarma is redefining how engine...Show moreLast updated: 12 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiHyderabad, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Sonata SoftwareHyderabad, Telangana, India
    Hello Connetions Greetings of the day!!! We have immediate openings for SRE Role - Site Reliability Engineer Experience - 7 to 12yrs Work Location -Hyderabad Notice Period -immediate Interested ...Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    o9 Solutions, Inc.Hyderabad, Telangana, India
    Be part of something revolutionary At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 13 days ago
    • Promoted
    Sr Engineer, Site Reliability Engineer [T500-20464]

    Sr Engineer, Site Reliability Engineer [T500-20464]

    TMUS Global SolutionsHyderabad, Telangana, India
    About T-Mobile : T-Mobile US, Inc.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship b...Show moreLast updated: 17 days ago
    • Promoted
    Engineer, Site Reliability [T500-20515]

    Engineer, Site Reliability [T500-20515]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 17 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    ConfidentialHyderabad / Secunderabad, Telangana
    Collaborate with development, operations, and product teams to define, review, and implement reliability standards and best practices. Design, implement, and maintain highly available and scalable a...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    NationsBenefits IndiaHyderabad, Telangana, India
    Site Reliability Engineer (SRE) | Fintech | Kubernetes | Datadog |.SRE team focused on maintaining the performance, reliability, and availability of our fintech platforms.Triage and resolve product...Show moreLast updated: 13 days ago
    • Promoted
    Engineer, Site Reliability [T500-20519]

    Engineer, Site Reliability [T500-20519]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 17 days ago
    • Promoted
    Engineer, Site Reliability [T500-20518]

    Engineer, Site Reliability [T500-20518]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 17 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    iVoyantSecunderabad, Telangana, India
    One of our clients is looking for an experienced Senior Site Reliability Engineer (SRE) - Mission-Critical SaaS Cloud Products to join their team. Key Responsibilities : Reliability and Performance M...Show moreLast updated: 2 days ago