Talent.com
Senior Site Reliability Engineer
Senior Site Reliability EngineerOneAdvanced • India
Senior Site Reliability Engineer

Senior Site Reliability Engineer

OneAdvanced • India
3 days ago
Job description

We’re looking for a Senior SRE Automation Engineer to lead and drive automation across the operations lifecycle. The ideal candidate will be responsible for identifying and implementing automation opportunities to reduce manual intervention, minimise service tickets, and enable self-service capabilities.

Key Responsibilities

  • Design and implement automation pipelines to eliminate manual operational tasks and improve service efficiency.
  • Automate all manual patching activity
  • Integrate AI / ML-based tools for incident detection, root cause analysis, and automated remediation to enhance platform resilience.
  • Build and maintain self-healing scripts and workflows using Infrastructure-as-Code (IaC) and event-driven automation frameworks.
  • Analyze recurring incidents to identify patterns and opportunities for automation and optimization.
  • Identify and automate standard operating procedures and repetitive day-to-day tasks to reduce ticket volume and manual intervention.
  • Lead service improvement initiatives through automation to improve overall team performance and customer satisfaction.
  • Own and continuously improve observability and alerting strategies to support proactive operations.
  • Effectively communicate with users to build trust and drive timely resolution of issues within SLA.
  • Collaborate with cross-functional teams to resolve complex problems and align on operational goals.
  • Handle escalations and critical incidents in a fast-paced environment with clear communication and swift action.
  • Mentor junior engineers, fostering a DevOps-first culture and encouraging skill development.
  • Demonstrate strong analytical and troubleshooting skills, including real-time issue identification and resolution in live environments.
  • Maintain thorough and accurate documentation of automation implementations, including known gaps and future opportunities.

Required Skills & Experience

  • Excellent analytical and problem-solving skills to diagnose, troubleshoot, and resolve complex technical issues.
  • Automation experience e.g automated patching
  • Proficient in scripting and programming languages such as Python, Go, and Bash.
  • Strong hands-on experience with automation frameworks and tools including Terraform, Ansible, Chef, and Puppet.
  • Familiarity with automation scripting tools for infrastructure and operations (e.g. Python, Terraform, Ansible).
  • Experience working with AI-driven operations tools and AIOps platforms such as Moogsoft, BigPanda, Dynatrace, or custom ML-based pipelines.
  • In-depth knowledge of CI / CD, GitOps, and event-driven systems for modern DevOps practices.
  • Solid background in Linux systems and containerized environments like Docker and Kubernetes.
  • Proven experience in designing resilient, self-healing systems for high availability and operational efficiency.
  • Deep understanding of cloud platforms and technologies, including Microsoft Azure, Amazon Web Services (AWS), as well as on-premises and data center environments
  • Experience integrating with LLMs for operational tasks or incident summarization.
  • Certifications in cloud platforms or DevOps tools (e.g., AWS Certified DevOps Engineer).
  • Exposure to service mesh, service discovery, or modern networking stacks.
  • Create a job alert for this search

    Senior Site Reliability Engineer • India

    Related jobs
    Site Reliability Engineer

    Site Reliability Engineer

    Pagos Consultants • India, India
    This team will play a pivotal role in spearheading innovation.As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its future d...Show more
    Last updated: 9 hours ago • Promoted • New!
    Site Reliability Engineer Rotation Shift

    Site Reliability Engineer Rotation Shift

    Synechron • Pune, Republic Of India, IN
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5-8 years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology specialist...Show more
    Last updated: 16 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Synechron • Pune, Republic Of India, IN
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5 to 9 years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology special...Show more
    Last updated: 30+ days ago • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    Media.net • Republic Of India, IN
    Net is a leading, global ad tech company that focuses on creating the most transparent and efficient path for advertiser budgets to become publisher revenue. Our proprietary contextual technology is...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer - Remote

    Senior Site Reliability Engineer - Remote

    Confidential • India
    Remote
    Senior Site Reliability Engineer - Remote.Do you have a passion for cutting edge technologies and tackling system problems. Are you a self-starting professional who thrives in a fast-paced environme...Show more
    Last updated: 27 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • India, India
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 30+ days ago • Promoted
    Aws Site Reliability Engineer

    Aws Site Reliability Engineer

    HTC Global Services • Chennai, Republic Of India, IN
    Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show more
    Last updated: 23 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    HRhelpdesk • Indore, Republic Of India, IN
    Company is a rapidly growing, private equity backed SaaS product company and provides cloud-based solutions.As a Site Reliability Engineer (SRE), you will be responsible for building and maintainin...Show more
    Last updated: 12 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    GigSky • Republic Of India, IN
    We're Hiring : Site Reliability Engineer (5–10 Years Experience).Location : Bangalore, India | 🏢 Gigsky India Private Limited. Are you passionate about building resilient, scalable, and secure infras...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer, Contract

    Site Reliability Engineer, Contract

    Confidential • India
    AI-focused, data-led solutions leveraging the latest advancements in cloud technology.With our unmatched engineering capabilities and vast industry experience, we help the world's leading brands tr...Show more
    Last updated: 27 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Grootan Technologies • Chennai, Republic Of India, IN
    Site Reliability Engineer (SRE).In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications.You will leverage your e...Show more
    Last updated: 12 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PhonePe • Pune, Republic Of India, IN
    Troubleshoot issues across the entire stack - hardware, software, application, and network.Work to improve the reliability and performance of the next generation of distributed systems.Work to impr...Show more
    Last updated: 23 days ago • Promoted