Talent.com
No longer accepting applications
Site Reliability Engineer

Site Reliability Engineer

Datum Technologies GroupCoimbatore, Republic Of India, IN
18 hours ago
Job description

Job Title : Site Reliability Engineer (SRE) – Azure & AI

Experience : 7+ years

Work Mode : Hybrid

Work Location : Chennai / Mumbai / Gurgaon

Job Summary :

We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure , AI infrastructure , and automation . The ideal candidate will have a solid background in managing cloud environments using GitHub / Azure DevOps , and hands-on experience in AI model deployment and scaling . This role involves working closely with engineering teams to deliver reliable, secure, and scalable cloud infrastructure that supports AI workloads and enterprise applications.

Key Responsibilities :

  • Design, build, and maintain scalable cloud infrastructure on Microsoft Azure .
  • Automate infrastructure provisioning and deployment using Terraform , Argo , and Helm .
  • Manage and optimize Azure Kubernetes Service (AKS) for AI and microservices workloads.
  • Support AI model hosting using frameworks such as Huggingface Transformers , vLLM , or Llama.Cpp on Azure OpenAI , VMs , or GPUs .
  • Implement CI / CD pipelines using GitHub Actions and integrate with JFrog Artifactory .
  • Monitor and maintain system performance and reliability using Grafana , ensuring proactive issue resolution.
  • Collaborate with development teams to align infrastructure with application requirements.
  • Enforce networking and information security best practices .
  • Manage and optimize caching and data layer performance using Redis .

Required Skills & Technologies :

  • Azure Cloud Services (including Azure OpenAI )
  • AI Model Hosting & Infrastructure
  • GitHub (CI / CD, workflows)
  • Azure Kubernetes Service (AKS)
  • Argo , Helm , Terraform
  • Docker , JFrog , Grafana
  • Networking & Security , Redis
  • Create a job alert for this search

    Site Reliability Engineer • Coimbatore, Republic Of India, IN

    Related jobs
    • Promoted
    Site Reliability Engineer (Sre) – Infrastructure & Automation

    Site Reliability Engineer (Sre) – Infrastructure & Automation

    InstaServicePālghāt, Republic Of India, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 6 days ago
    • Promoted
    Lead Engineer

    Lead Engineer

    HyqooCoimbatore, IN
    Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServicePalakkad, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 7 days ago
    • Promoted
    Resident Engineer – Kubernetes & Portworx

    Resident Engineer – Kubernetes & Portworx

    CMK Resources, Inc.Palakkad, IN
    CMK Resources Resident Engineer – Kubernetes & Portworx.Remote - based in India working U.EST standard time business hours. compensation expectation of up to 30 lakhs per annum depending on experie...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Jade GlobalCoimbatore, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 9 days ago
    • Promoted
    Senior DevOps & Database Reliability Engineer – 100% Remote

    Senior DevOps & Database Reliability Engineer – 100% Remote

    Hyly.AITiruppur, IN
    Remote
    AI, we’re building the first AI + Data Fabric for the multifamily industry, transforming how clients manage, secure, and scale their marketing and operational data. As the industry moves toward a co...Show moreLast updated: 1 day ago
    • Promoted
    Compliance Engineer - Sustainability Compliance (Remote)

    Compliance Engineer - Sustainability Compliance (Remote)

    CertivoPalakkad, IN
    Remote
    Certivo turns regulatory evidence into market access.Our AI, CORA, automates supplier outreach, data extraction, and rule checks, then assembles market-ready packets mapped to every product × site ...Show moreLast updated: 20 days ago
    • Promoted
    Site Reliability Engineering - GCP

    Site Reliability Engineering - GCP

    ConfidentialCoimbatore, India
    Google Cloud Platform to join our multi-functional SRE team.You will focus on enhancing operational automation and monitoring to improve efficiency within our cloud environments.Your role involves ...Show moreLast updated: 13 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech Solutionscoimbatore, tamil nadu, in
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 9 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmatiruppur, tamil nadu, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 29 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.tiruppur, India
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 23 days ago
    • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade Globaltiruppur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 9 days ago
    • Promoted
    Sr Engineer, Site Reliability T500-21295

    Sr Engineer, Site Reliability T500-21295

    TMUS Global SolutionsPālghāt, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiCoimbatore, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 19 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeTiruppur, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 22 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Synamediatiruppur, tamil nadu, in
    At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show moreLast updated: 3 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    mindcurvCochin, Coimbatore, Trivandrum, Kerala, IN
    Quick Apply
    About Mindcurv We help our customers rethink their digital business, experiences, and technology to navigate the new digital reality. We do this by designing sustainable and accountable solutions fo...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionscoimbatore, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 6 days ago