Talent.com
No longer accepting applications
Site Reliability Engineer

Site Reliability Engineer

HireAlphakottayam, India
1 day ago
Job description

Job Description- Site Reliability Engineer

Experience- 8+ Years

Responsibilities :

  • Ensure high availability, performance, and scalability of mission-critical systems and services.
  • Lead the design and implementation of resilient and fault-tolerant infrastructure.
  • Drive incident response, root cause analysis, and postmortem culture. Mentor others in incident practices.
  • Write and maintain operational documentation, runbooks, and architecture diagrams.
  • Drive and promote protocols on production readiness and operational excellence.
  • Own and evolve infrastructure automation using Terraform or similar tools to remove as much as possible any human intervention.
  • Help automate infrastructure provisioning and other engineering processes by working on automations built on top of an engineering platform written in GitHub Actions.
  • Build internal platforms, tools, and frameworks to improve developer productivity and service reliability.
  • Work closely with software engineers, platform teams, and product managers to align on company goals.
  • Coach and up-skill other engineering team members

Skills and Qualifications :

  • 8–12+ years in SRE, DevOps, or related infrastructure-focused roles.
  • Understand large-scale complex systems from a reliability perspective.
  • Design, implement and maintain processes and tools.
  • Passion for producing clean, standards-compliant, secure code.
  • Bringing a developer mindset and applying it to infrastructure
  • Strong experience with Linux / Unix systems.
  • Deep experience with Kubernetes.
  • Deep experience with tools like Terraform, Ansible, Helm.
  • Strong coding skills in scripts for automating the execution of certain tasks with a programming language like Python, Bash or any other scripting language.
  • Experience with at least one relational and non-relational databases (ex : PostgreSQL, MySQL, MongoDB, Redis, ElasticSearch).
  • Ability to identify time consuming and error prone manual tasks and then build / leverage tooling to automate them.
  • Ability to identify root causes of instability in a large-scale distributed system across stacks.
  • Experience leading high-severity incident responses and postmortems
  • Nice to haves / Pluses :

  • Experience with cloud-based solutions such as Amazon AWS, Google Cloud, or Microsoft Azure.
  • Experience supporting scalable DBs like PostgreSQL, or MongoDB in production.
  • Understanding of cost
  • Create a job alert for this search

    Site Reliability Engineer • kottayam, India

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiKottayam, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 11 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    iSoftStoneErnākulam, Republic Of India, IN
    Greetings from ISoftStone Inc!.This is Rajlaxmi from the HR department of ISoftStone Inc.We are looking for a SRE / Devops. Location- Bangalore / Hybrid (2-3 days WFO).Bachelors degree in computer scie...Show moreLast updated: 17 hours ago
    • Promoted
    Senior Site Reliability Engineer- Elk Expert

    Senior Site Reliability Engineer- Elk Expert

    iVedha Inc.Kottayam, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 16 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Kottayam, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeAlappuzha, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 14 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmakottayam, kerala, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 22 days ago
    • Promoted
    • New!
    Site Reliability Engineer - Azure

    Site Reliability Engineer - Azure

    PhonePeAlleppey, Republic Of India, IN
    We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production serv...Show moreLast updated: 13 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupAlappuzha, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 23 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech Solutionsmount, kerala, in
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 2 days ago
    • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade Globalalappuzha, kerala, in
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 2 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ACL DigitalAlappuzha, Kerala, India
    ACL Digital (An Alten Group Company) hiring for Site Reliability Engineer.Interested candidates can reach out at.Experience : 5+ Years Location : Devarabisanahalli, Bengaluru Notice Period : Less than...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Sr Engineer, Site Reliability T500-21295

    Sr Engineer, Site Reliability T500-21295

    TMUS Global SolutionsAlleppey, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 14 hours ago
    • Promoted
    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Jade GlobalAlleppey, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer- Cloud Platform

    Senior Site Reliability Engineer- Cloud Platform

    ConfidentialCochin / Kochi / Ernakulam
    As a Senior Site Reliability Engineer, you will be responsible for : .Demonstrating best practices pertaining to Cloud DevOps development along with a willingness to continually learn Cloud native te...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionsalappuzha, kerala, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 19 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Atomic NorthKochi, Kerala, India
    Hybrid ( Bengaluru, Bhopal, Gurgaon, Hyderabad, Jaipur, Mumbai, Pune , Chennai) Shift : .What You’ll Do Design, build, and manage secure, scalable AWS infrastructure (EC2, Lambda, EKS, S3, RDS, IAM, ...Show moreLast updated: 13 hours ago
    • Promoted
    Senior Site Reliability Engineer / Senior Cloud Engineer

    Senior Site Reliability Engineer / Senior Cloud Engineer

    CloudHiremount, kerala, in
    The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture.Repo...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialCochin / Kochi / Ernakulam
    Job Title - Site Reliability Engineer + Specialist + Global Song.Management Level : 9,Specialist .Must have skills : Python, Go, or Java. Good to have skills : Expertise with cloud platfor...Show moreLast updated: 30+ days ago