Talent.com
SRE -Engineer

SRE -Engineer

ConfidentialPune, India
8 days ago
Job description

Senior Site Reliability Engineer  to support the infrastructure platforms powering our Supply Chain systems. This role blends traditional platform engineering with site reliability practices, ensuring that systems are stable, secure, and deployment-ready across on-prem and cloud environments. The ideal candidate will have strong systems administration experience, scripting ability, and an interest in driving reliability through automation, monitoring, and proactive platform support. Occasional on-call support is expected on a rotating basis.

Key Responsibilities :

Platform Configuration & Environment Readiness

  • Install, configure, and maintain platform components (Windows / Linux servers, file systems, middleware, etc.) across development, test, and production environments.
  • Prepare environments for application deployments and platform-level changes.

Incident Management & Root Cause Analysis :

  • Respond to service outages with urgency and lead post-incident reviews to prevent recurrence. Drive RCA & CAPA
  • Develop incident playbooks and automate common response actions.
  • System Monitoring, Reliability & uptime

  • Monitor system health using tools like LogicMonitor and Splunk; respond to alerts and incidents with a root cause and resolution mindset.
  • Proactively identify and address system bottlenecks and performance issues.
  • Improve system performance and reliability through configuration tuning and monitoring enhancements.
  • Help Define SLAs, SLOs along with other critical KPIs and work towards continuous improvement
  • Track Backup & restore efficiency and record RPO & RTO as a metric
  • Scripting & Automation

  • Develop and maintain scripts (e.g., PowerShell, Bash, Python) to automate health checks, administrative tasks, and environment validation.
  • Contribute to efforts that reduce manual support and increase consistency across platforms.
  • Deployment & Change Coordination

  • Collaborate with application teams and infrastructure engineers to validate system readiness for deployments and major changes.
  • Ensure platform-level changes follow Medline's change control, documentation, and testing procedures.
  • Security & Compliance

  • Apply system security best practices; ensure patching, access management, and configuration policies are in place and audit-ready.
  • Participate in ITGC, SOX, and security reviews to maintain operational compliance.
  • Documentation & Knowledge Sharing

  • Maintain accurate runbooks, technical documentation, and troubleshooting guides.
  • Share knowledge across the team to support 24x7 platform operations and reduce key-person risk.
  • High Availability Testing :

  • Design and execute tests that simulate failures (e.g., node failures, network partitions) to verify system resilience.
  • Collaborate with development and infrastructure teams to ensure redundancy and fault tolerance are in place.
  • Continuous Improvement

  • Identify opportunities to improve observability, reduce noise, and increase system resilience.
  • Collaborate with SREs and automation engineers to advocate for platform improvements, capacity management, and performance optimization.
  • Qualifications :

  • Education : Bachelor's degree in computer science, Information Technology, Engineering, Supply Chain, or a related field, or equivalent work experience.
  • Experience :
  • Overall 8+ years of experience in IT
  • 5+ years of experience in platform support, systems administration, or infrastructure engineering
  • Skills :

  • Proficiency in both Windows and Linux system administration
  • Scripting experience using PowerShell, Bash, or similar tools
  • Experience with monitoring tools such as LogicMonitor and Splunk
  • Familiarity with DevOps principles and automation practices
  • Experience supporting enterprise applications and deployment processes
  • Willingness to participate in rotating on-call support
  • Experience with performing High availability & Disaster recovery drill exercises and related tooling
  • Knowledge of tracking, reporting and continuously improving SLAs and other related metrics
  • Skills Required

    Linux System Administration, Powershell, Bash, Splunk, Python, Logicmonitor

    Create a job alert for this search

    Sre Engineer • Pune, India

    Related jobs
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Futurism Technologies, INC.Pune, Maharashtra, India
    Job Title : Site Reliability Engineering (SRE) Lead Location : Hinjewadi Phase-1 (WFO) Experience : 7+ years of experience Shift Time : 11 : 00 AM to 8 : 00 PM Working Days : Monday to Friday About the...Show moreLast updated: 4 days ago
    • Promoted
    Sr. Engineer / Engineer

    Sr. Engineer / Engineer

    ConfidentialPune
    Coordinate work of maintenance facility with outside concern vendor or company person.Attending daily breakdowns on utility & m / c shop. Air Compressor, DG set, Acs, ACB, HT).Knowledge of statutory d...Show moreLast updated: 30+ days ago
    • Promoted
    SRE / DevOps Engineer

    SRE / DevOps Engineer

    ConfidentialPune
    Implement and maintain AWS infrastructure components (RDS, EventBridge, Lambda, FIS) using best practices.Author and extend IaC modules (Terraform or CloudFormation) for reproducible environments.D...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade GlobalPune, Maharashtra, India
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 4 days ago
    • Promoted
    Sr. / Software Engineer

    Sr. / Software Engineer

    BrightEdgePune, IN
    BrightEdge is a global leader in enterprise SEO and content performance solutions, driving AI-powered digital marketing success for the world’s top brands. Our culture is product-first, innovation-d...Show moreLast updated: 30+ days ago
    • Promoted
    TCS Is Hiring For Site Reliability Engineering (SRE)

    TCS Is Hiring For Site Reliability Engineering (SRE)

    Tata Consultancy Servicespune, maharashtra, in
    To Detect the Incidents and act proactively escalate using the built in dashboards.Hands on using Dynatrace dashboards and creation of customized dashboards. Hands on using ServiceNow to perform ana...Show moreLast updated: 19 days ago
    • Promoted
    Engineer - SRE

    Engineer - SRE

    ConfidentialPune, India
    CredAble is a technology-powered supply chain funding solutions company and NBFC.CredAble leverages its trade finance expertise, technology platform and access to 3rd party capital to arrange fundi...Show moreLast updated: 8 days ago
    • Promoted
    Sr. Engineer

    Sr. Engineer

    TekWissen IndiaPune, India
    TekWissen is a global workforce management provider throughout India and many other countries in the world.The below job opportunity is one of our clients which has been a one-stop solution for pro...Show moreLast updated: 4 days ago
    • Promoted
    SRE-II

    SRE-II

    ConfidentialPune, India
    Mindtickle is the market-leading revenue productivity platform that combines on-the-job learning and deal execution to get more revenue per rep. Mindtickle is recognized as a market leader by top in...Show moreLast updated: 8 days ago
    • Promoted
    Sr. Software Engineer - ESRE

    Sr. Software Engineer - ESRE

    ConfidentialPune, India
    Business Unit : WSI Corporate Technology, Pune.Overview / What We Are Looking For : .As a Senior Site Reliability Engineer at Williams-Sonoma you will support eCommerce produc;on.Commerce Support tea...Show moreLast updated: 8 days ago
    • Promoted
    Python & SRE Engineer

    Python & SRE Engineer

    ConfidentialPune
    Develop and maintain automation scripts and tools primarily using.Collaborate with development and operations teams to build and maintain highly available, scalable systems.Implement and manage mon...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Engineer

    Sr. Engineer

    ConfidentialPune, India
    We are looking for motivated professionals with strong skills to execute engineering projects / solutions for Eaton's Swtichgear productline through specifications, design, implementation, testing ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Pune, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    SRE

    SRE

    ConfidentialPune, India
    You have a good understanding and work experience in AKS, Kubernetes, and EKS.You are able to manage multi region clusters for disaster recovery. You have a good understanding of AWS stack.You have ...Show moreLast updated: 8 days ago
    • Promoted
    Infrastructure Site Reliability Engineer (SRE)

    Infrastructure Site Reliability Engineer (SRE)

    ConfidentialPune, India
    Velotio Technologies is a product engineering company working with innovative startups and enterprises.We are a certified and recognized as one of the best companies to work for in India.We have pr...Show moreLast updated: 8 days ago
    • Promoted
    Systems SRE

    Systems SRE

    ConfidentialPune
    Highly Technical Skilled in AIX Administration on P9 Linux Administration building of new systems.Upgradation from Linux 7 to Linux 8. Installing and upgrading UNIX system software on company server...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServicePune, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 2 days ago
    • Promoted
    Sr Observability Engineer (SRE)

    Sr Observability Engineer (SRE)

    ConfidentialPune
    Design and implement solutions to improve system reliability, availability, performance, and scalability.Manage SLIs, SLOs, error budgets, monitoring, and alerting. Conduct blameless postmortems and...Show moreLast updated: 8 days ago