Talent.com
Systems Reliability Specialist

Systems Reliability Specialist

GREYTIP SOFTWARE PRIVATE LIMITEDBengaluru, Republic Of India, IN
4 days ago
Job description

About the Role

We are looking for a skilled Site Reliability Engineer II to join our SRE team. The ideal candidate will have hands-on experience in production monitoring, alert handling, and L1 production support . You will play a key role in ensuring the reliability, availability, and performance of our production systems.

Key Responsibilities

  • Monitor production systems using enterprise monitoring tools and dashboards.
  • Respond to alerts promptly and take appropriate first-level actions.
  • Provide L1 production support , including initial triage, log analysis, and escalation to relevant teams as needed.
  • Participate in incident management, including documentation, communication, and coordination during production incidents.
  • Perform basic troubleshooting for application, infrastructure, and platform issues.
  • Ensure adherence to SLAs, SLOs, and operational best practices.
  • Contribute to runbooks, knowledge base articles, and incident postmortems.
  • Collaborate with engineering and DevOps teams for incident resolution and improvements.
  • Participate in on-call rotations as required.

Required Skills & Qualifications

  • 2–5 years of experience in SRE, Production Support, DevOps, or similar roles.
  • Hands-on experience with production monitoring tools (e.G., Prometheus, Grafana, Datadog, New Relic, Splunk, CloudWatch, etc.).
  • Strong understanding of alerting systems , incident lifecycle, and on-call processes.
  • Basic troubleshooting knowledge in Linux / Unix , networking fundamentals, and cloud environments.
  • Familiarity with logging tools (e.G., ELK, Splunk, Cloud Logging).
  • Ability to communicate clearly during incidents and coordinate with cross-functional teams.
  • Strong analytical, problem-solving, and time-management skills.
  • Good to Have

  • Experience with cloud platforms (AWS / Azure / GCP).
  • Basic scripting skills (Python, Shell, Bash).
  • Exposure to CI / CD pipelines and DevOps practices.
  • Understanding of SLOs, SLIs, and reliability engineering principles.
  • Create a job alert for this search

    System Specialist • Bengaluru, Republic Of India, IN

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ReyikaBengaluru, Karnataka, India
    Senior Site Reliability Engineer / Reliability Architect.Pune,Bengalore,Chennai,Pune,Noida.Reliability Architect with over 9 years of experience in proactive monitoring, automation, and observabili...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    London Stock Exchange GroupBangalore, India
    Engineer, Site Reliability Engineering.We are evolving our Reliability Engineering team to move beyond support and operations. As a Senior Engineer in Site Reliability, you will be part of a diverse...Show moreLast updated: 30+ days ago
    • Promoted
    Systems Reliability Specialist

    Systems Reliability Specialist

    Andor TechBengaluru, Republic Of India, IN
    IT services and consulting firm.AI-enabled IT services, application support, analytics, and test automation.With a presence across India, the USA, Europe, and the UAE, AndorTech partners with.Globa...Show moreLast updated: 1 day ago
    • Promoted
    Systems Reliability Engineer

    Systems Reliability Engineer

    ReyikaBengaluru, Republic Of India, IN
    Senior Site Reliability Engineer / Reliability Architect.Pune,Bengalore,Chennai,Pune,Noida.Reliability Architect with over 9 years of experience in proactive monitoring, automation, and observabili...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    JRD SystemsBengaluru, Karnataka, India
    Site Reliability Engineer (Windows / Cloud / Automation).We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments.T...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Awign Experthosur, tamil nadu, in
    Position : SRE Observability Engineer.Mandatory Skills : Observability, Grafana and Writing queries using Prometheus and Loki. We are seeking a highly experienced and driven Senior Observability Engin...Show moreLast updated: 17 hours ago
    • Promoted
    Systems Platform Specialist

    Systems Platform Specialist

    PaychexBengaluru, Republic Of India, IN
    NASDAQ : PAYX) is a leading provider of integrated human capital management.Industry expertise since 1971 (53 Years).Largest HR company for small to medium-sized businesses.Product development compa...Show moreLast updated: 9 days ago
    • Promoted
    Principal Systems Reliability Engineer

    Principal Systems Reliability Engineer

    Delta Air LinesBengaluru, Republic Of India, IN
    Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    GREYTIP SOFTWARE PRIVATE LIMITEDBengaluru, Karnataka, India
    The ideal candidate will have hands-on experience in.You will play a key role in ensuring the reliability, availability, and performance of our production systems. Monitor production systems using e...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ACL DigitalBengaluru, Karnataka, India
    Service Management : Maintain application uptime / performance, manage system enhancements and defects, oversee daily operational activities, and ensure continuous improvement and adherence to ITIL be...Show moreLast updated: 30+ days ago
    • Promoted
    Reliability Systems Engineer

    Reliability Systems Engineer

    super.moneyBengaluru, Republic Of India, IN
    Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show moreLast updated: 17 days ago
    • Promoted
    System Reliability Engineer

    System Reliability Engineer

    Andromeda SecurityBengaluru, Karnataka, India
    We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in DevOps technologies and cloud infrastructure. The ideal candidate will have hands-on experience with Kuberne...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent PartnersBengaluru, Karnataka, India
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    super.moneyBengaluru, Karnataka, India
    Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show moreLast updated: 17 days ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Rakuten IndiaBengaluru, Karnataka, India
    Design, develop SLA, SLO, SLI of services within the Business Unit.Involve in whole process of Development, Production System Operation including system maintenance, monitoring, automation, backend...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PhonePehosur, tamil nadu, in
    SRE We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production ...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Landmark GroupBengaluru, India
    Ensure reliability and high availability of Java and microservices-based applications through proactive monitoring and automation. Define and track SLIs / SLOs to maintain service performance and stab...Show moreLast updated: 8 days ago
    • Promoted
    Reliability Monitoring Systems Specialist

    Reliability Monitoring Systems Specialist

    Tata ElectronicsKolār, Republic Of India, IN
    Tata Electronics (a wholly owned subsidiary of Tata Sons Pvt.India’s first AI-enabled state-of-the-art Semiconductor Foundry. This facility will produce chips for applications such as power manageme...Show moreLast updated: 30+ days ago