Talent.com
Lead Site Reliability Engineer

Lead Site Reliability Engineer

Futurism Technologies, INC.vellore, India
20 hours ago
Job description

Job Title : Site Reliability Engineering (SRE) Lead

Location : Hinjewadi Phase-1 (WFO)

Experience : 7+ years of experience

Shift Time : 11 : 00 AM to 8 : 00 PM

Working Days : Monday to Friday

About the Role

We are seeking a highly skilled and experienced SRE Lead to drive the reliability, scalability, and performance of our multi-cloud infrastructure spanning AWS and Azure. You will lead a team responsible for building and maintaining automated deployment pipelines, infrastructure as code, and observability systems using GitHub Actions, Terraform, and Datadog.

As the SRE Leader, you will collaborate closely with development, operations, and security teams to ensure our services are highly available, secure, and performant, while fostering a culture of automation, monitoring, and continuous improvement.

Key Responsibilities

  • Lead and mentor a team of SRE engineers to design, build, and maintain reliable, scalable, and secure cloud infrastructure across AWS and Azure.
  • Architect and implement Infrastructure as Code (IaC) solutions primarily using Terraform to manage multi-cloud environments efficiently.
  • Develop, maintain, and optimize CI / CD pipelines leveraging GitHub Actions to enable fast and reliable software delivery.
  • Establish and drive best practices in site reliability, monitoring, alerting, and incident response using Datadog and other observability tools.
  • Collaborate with software engineering teams to improve system reliability through automation, load testing, and performance tuning.
  • Define and track SLOs, SLIs, and error budgets; lead incident retrospectives and continuous improvement initiatives.
  • Manage cloud resource costs and optimize usage across multiple cloud providers.
  • Promote a DevOps culture emphasizing automation, continuous deployment, and proactive incident management.
  • Stay current with the latest industry trends and technologies in cloud, automation, and SRE practices.

Required Skills

  • 7+ years of experience in Site Reliability Engineering, DevOps, or cloud infrastructure roles.
  • Implement dashboards to monitor and track SLOs, SLIs, and error budgets; lead incident retrospectives and continuous improvement initiatives.
  • Proven experience leading and mentoring engineering teams.
  • Strong hands-on experience with AWS and Azure cloud platforms.
  • Expert in Infrastructure as Code using Terraform with multi-cloud deployments.
  • Proficient in building and managing CI / CD pipelines using GitHub Actions.
  • Deep knowledge of monitoring and observability tools, especially Datadog.
  • Solid understanding of networking, security, container orchestration (Kubernetes is a plus), and cloud-native architectures.
  • Strong scripting and automation skills (Python, Bash, or similar).
  • Experience with incident management, root cause analysis, and capacity planning.
  • Excellent communication, leadership, and collaboration skills.
  • Technical Skills

  • IAC : Terraform
  • CICD : Git Action, Git workflow and ArgoCD
  • Observability : Datadog, Prometheus and Fluent bit
  • POD Orchestration : EKS and EKS Faregate
  • Cloud : AWS and Azzure
  • Preferred

  • Certifications such as AWS Certified DevOps Engineer, Azure DevOps Engineer, or HashiCorp Terraform Associate.
  • Experience with Kubernetes and service mesh technologies.
  • Familiarity with chaos engineering and resilience testing.
  • Knowledge of security best practices in cloud environments.
  • Create a job alert for this search

    Site Reliability Engineer • vellore, India

    Related jobs
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeVellore, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 14 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Vellore, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech Solutionsvellore, tamil nadu, in
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer- Elk Expert

    Senior Site Reliability Engineer- Elk Expert

    iVedha Inc.Vellore, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 15 days ago
    • Promoted
    Senior MLOps Engineer

    Senior MLOps Engineer

    Mitchell Martin Inc.Vellore, IN
    Include, but are not limited to, the following : .Own productionizing models—from tracked experiments to governed releases—ensuring resilient services with clear SLOs, runbooks, and fast, safe rollba...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Sustenance Engineer - Storage

    Lead Sustenance Engineer - Storage

    DDNVellore, IN
    This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DataDirect Networks (DDN) is a globa...Show moreLast updated: 30+ days ago
    • Promoted
    MLOps Lead Engineer

    MLOps Lead Engineer

    RecroVellore, IN
    Experience with Azure services such as Azure AI services, Azure Search, Azure ML, Databricks, Azure Kubernetes Service, and AWS services like AWS SageMaker, AWS Bedrock and AWS Lambda.Exposure to G...Show moreLast updated: 22 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmavellore, tamil nadu, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 21 days ago
    • Promoted
    Lead - Cloud Reliability Engineer

    Lead - Cloud Reliability Engineer

    Searce Incvellore, tamil nadu, in
    The ‘process-first’ AI-native modern tech consultancy that's rewriting the rules.As an engineering-led consultancy, we are dedicated to relentlessly improving the real business outcomes.Our solvers...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer / Senior Cloud Engineer

    Senior Site Reliability Engineer / Senior Cloud Engineer

    CloudHirevellore, tamil nadu, in
    The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture.Repo...Show moreLast updated: 21 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiVellore, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 11 days ago
    • Promoted
    Technical Lead

    Technical Lead

    ThumoVellore, IN
    Founding Engineer @ Thumo (Africa’s first super-app).We’re building Africa’s super-app, starting with food delivery.M funding round led by Soma Capital with top Silicon Valley angels, we’re hiring ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade Globalvellore, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 1 day ago
    • Promoted
    Deployment Engineer

    Deployment Engineer

    AvocaVellore, IN
    Build, launch & optimize AI agents that power the next generation of home-service customer experiences.Avoca is the all-in-one AI lead-conversion platform. Our technology boosts booking rates, slash...Show moreLast updated: 30+ days ago
    • Promoted
    Delinea Implementation Engineer

    Delinea Implementation Engineer

    K&K Talents - Indiavellore, tamil nadu, in
    This position is with one of our.Title : Delinea Implementation Engineer.Employment Type : Full-time Permanent.Delinea Implementation Engineer. Delinea (formerly Thycotic & Centrify) Privileged Access...Show moreLast updated: 13 days ago
    • Promoted
    Project Manager Civil

    Project Manager Civil

    STAT ConsultancyChittoor, Andhra Pradesh, India
    This is a full-time on-site (30 kms from CMC Vellore,TN) job role for Project Manager- Civil, who will be responsible for overseeing & forecasting all construction related activities on-site and en...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Tata Consultancy Servicesvellore, India
    Senior Site Reliability Engineer (SRE).Senior Site Reliability Engineer (SRE).Desired Experience Range : 7 - 10 yrs.Notice Period : Immediate to 90Days only. We are currently planning to do a Virtual....Show moreLast updated: 11 days ago
    • Promoted
    • New!
    Hiring HighByte Senior Lead Engineers.

    Hiring HighByte Senior Lead Engineers.

    Cognizantvellore, India
    A HighByte Senior Lead Engineer is responsible for Leading HighByte deployment for multiple sites, defining the Infrastructure and Env specifications, creating the required documents and SOPs, depl...Show moreLast updated: 20 hours ago