Talent.com
Site Reliability Engineer
Site Reliability EngineerGREYTIP SOFTWARE PRIVATE LIMITED • kollam, India
No longer accepting applications
Site Reliability Engineer

Site Reliability Engineer

GREYTIP SOFTWARE PRIVATE LIMITED • kollam, India
17 hours ago
Job description

About the Role

We are looking for a skilled Site Reliability Engineer II to join our SRE team. The ideal candidate will have hands-on experience in production monitoring, alert handling, and L1 production support . You will play a key role in ensuring the reliability, availability, and performance of our production systems.

Key Responsibilities

  • Monitor production systems using enterprise monitoring tools and dashboards.
  • Respond to alerts promptly and take appropriate first-level actions.
  • Provide L1 production support , including initial triage, log analysis, and escalation to relevant teams as needed.
  • Participate in incident management, including documentation, communication, and coordination during production incidents.
  • Perform basic troubleshooting for application, infrastructure, and platform issues.
  • Ensure adherence to SLAs, SLOs, and operational best practices.
  • Contribute to runbooks, knowledge base articles, and incident postmortems.
  • Collaborate with engineering and DevOps teams for incident resolution and improvements.
  • Participate in on-call rotations as required.

Required Skills & Qualifications

  • 2–5 years of experience in SRE, Production Support, DevOps, or similar roles.
  • Hands-on experience with production monitoring tools (e.g., Prometheus, Grafana, Datadog, New Relic, Splunk, CloudWatch, etc.).
  • Strong understanding of alerting systems , incident lifecycle, and on-call processes.
  • Basic troubleshooting knowledge in Linux / Unix , networking fundamentals, and cloud environments.
  • Familiarity with logging tools (e.g., ELK, Splunk, Cloud Logging).
  • Ability to communicate clearly during incidents and coordinate with cross-functional teams.
  • Strong analytical, problem-solving, and time-management skills.
  • Good to Have

  • Experience with cloud platforms (AWS / Azure / GCP).
  • Basic scripting skills (Python, Shell, Bash).
  • Exposure to CI / CD pipelines and DevOps practices.
  • Understanding of SLOs, SLIs, and reliability engineering principles.
  • Create a job alert for this search

    Site Reliability Engineer • kollam, India

    Related jobs
    Equifax - Senior Site Reliability Engineer - IAC Terraform

    Equifax - Senior Site Reliability Engineer - IAC Terraform

    Equifax • Trivandrum
    About the job Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distr...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    Confidential • Thiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    The world's top banks use Zafin's integrated platform to drive transformative customer value.Powered by an innovative AI-powered architecture, Zafin's platform seamlessly unifies data from across t...Show more
    Last updated: 17 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Confidential • Thiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    Job Title : Senior Site Reliability Engineer (SRE II).Location : Thiruvananthapuram, KL (Hybrid 3 days Onsite).We're looking for an experienced. Senior Site Reliability Engineer.The ideal candidate ha...Show more
    Last updated: 13 days ago • Promoted
    Founding MLOps Engineer

    Founding MLOps Engineer

    Vectorial AI • Kollam, IN
    Vectorial is a simulation engine platform powered by millions of synthetic users—state-of-the-art models that capture real human behavior—to deliver instant, nuanced validation across the entire pr...Show more
    Last updated: 7 days ago • Promoted
    Site Reliability Engineer - DevOps

    Site Reliability Engineer - DevOps

    Aim Plus Staffing Solutions • Thiruvananthapuram
    Mandatory skills : We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI / CD automation to lead cloud infra...Show more
    Last updated: 11 days ago • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Confidential • Thiruvananthapuram / Trivandrum
    As a Site Reliability Engineer (SRE) you will be responsible for improving the overall reliability of applications by ensuring its availability, performance, and scalability.Should be able to gathe...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Confidential • Thiruvananthapuram / Trivandrum, India
    Site Reliability Engineering (SRE).Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that ...Show more
    Last updated: 17 days ago • Promoted
    Lead Engineer

    Lead Engineer

    Hyqoo • Kollam, IN
    Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show more
    Last updated: 8 days ago • Promoted
    Lead Test Engineer

    Lead Test Engineer

    dSPACE • Thiruvananthapuram, Kerala, India
    For our office in Trivandrum, dSPACE is looking for creative and motivated professionals.You will work on exciting and technologically cutting-edge product development projects in the areas of code...Show more
    Last updated: 30+ days ago • Promoted
    Delinea Implementation Engineer

    Delinea Implementation Engineer

    K&K Talents - India • Thiruvananthapuram, IN
    This position is with one of our.Title : Delinea Implementation Engineer.Employment Type : Full-time Permanent.Delinea Implementation Engineer. Delinea (formerly Thycotic & Centrify) Privileged Access...Show more
    Last updated: 26 days ago • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaService • Thiruvananthapuram, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show more
    Last updated: 11 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    CodeKarma • Alleppey, Republic Of India, IN
    About InstaServiceInstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ st...Show more
    Last updated: 23 hours ago • Promoted
    Site Project Manager

    Site Project Manager

    CP Kukreja Architects • Alappuzha, IN
    Project Manager – Civil (Site-Based).Type of project : Township and Development Project.The Project Manager – He will be responsible for overseeing all aspects of the project.This includes coordinat...Show more
    Last updated: 16 hours ago • Promoted • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdge • Kollam, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show more
    Last updated: 26 days ago • Promoted
    Senior DevOps & Database Reliability Engineer – 100% Remote

    Senior DevOps & Database Reliability Engineer – 100% Remote

    Hyly.AI • Thiruvananthapuram, IN
    Remote
    AI, we’re building the first AI + Data Fabric for the multifamily industry, transforming how clients manage, secure, and scale their marketing and operational data. As the industry moves toward a co...Show more
    Last updated: 5 days ago • Promoted
    Compliance Engineer - Safety and Quality Compliance (Remote)

    Compliance Engineer - Safety and Quality Compliance (Remote)

    Certivo • Kollam, IN
    Remote
    Certivo is an AI-first platform that assembles, validates, and keeps regulatory.We turn messy supplier documents into.You’ll be the company’s point of truth for. Your work directly determines whethe...Show more
    Last updated: 24 days ago • Promoted
    Sitecore Developer

    Sitecore Developer

    Technocratic Solutions • Kollam, IN
    CDN routing, optimizing content delivery,.Show more
    Last updated: 16 hours ago • Promoted • New!
    Equifax - Site Reliability Engineer

    Equifax - Site Reliability Engineer

    Equifax • Trivandrum
    Site Reliability Engineering (SRE) at Equifax SRE is a discipline that combines software and systems engineering for building and running large-scale, distrib...Show more
    Last updated: 30+ days ago • Promoted