Site Reliability EngineerGREYTIP SOFTWARE PRIVATE LIMITED • Kollam, IN

No longer accepting applications

Site Reliability Engineer

GREYTIP SOFTWARE PRIVATE LIMITED • Kollam, IN

1 day ago

Job description

About the Role

We are looking for a skilled Site Reliability Engineer II to join our SRE team. The ideal candidate will have hands-on experience in production monitoring, alert handling, and L1 production support . You will play a key role in ensuring the reliability, availability, and performance of our production systems.

Key Responsibilities

Monitor production systems using enterprise monitoring tools and dashboards.
Respond to alerts promptly and take appropriate first-level actions.
Provide L1 production support , including initial triage, log analysis, and escalation to relevant teams as needed.
Participate in incident management, including documentation, communication, and coordination during production incidents.
Perform basic troubleshooting for application, infrastructure, and platform issues.
Ensure adherence to SLAs, SLOs, and operational best practices.
Contribute to runbooks, knowledge base articles, and incident postmortems.
Collaborate with engineering and DevOps teams for incident resolution and improvements.
Participate in on-call rotations as required.

Required Skills & Qualifications

2–5 years of experience in SRE, Production Support, DevOps, or similar roles.

Hands-on experience with production monitoring tools (e.g., Prometheus, Grafana, Datadog, New Relic, Splunk, CloudWatch, etc.).

Strong understanding of alerting systems , incident lifecycle, and on-call processes.

Basic troubleshooting knowledge in Linux / Unix , networking fundamentals, and cloud environments.

Familiarity with logging tools (e.g., ELK, Splunk, Cloud Logging).

Ability to communicate clearly during incidents and coordinate with cross-functional teams.

Strong analytical, problem-solving, and time-management skills.

Good to Have

Experience with cloud platforms (AWS / Azure / GCP).

Basic scripting skills (Python, Shell, Bash).

Exposure to CI / CD pipelines and DevOps practices.

Understanding of SLOs, SLIs, and reliability engineering principles.

Create a job alert for this search

Site Reliability Engineer • Kollam, IN

Related jobs

Equifax - Senior Site Reliability Engineer - IAC Terraform

Equifax • Trivandrum

About the job Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distr...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer II

Confidential • Thiruvananthapuram, Thiruvananthapuram / Trivandrum, India

The world's top banks use Zafin's integrated platform to drive transformative customer value.Powered by an innovative AI-powered architecture, Zafin's platform seamlessly unifies data from across t...Show more

Last updated: 17 days ago • Promoted

Site Reliability Engineer

Confidential • Thiruvananthapuram, Thiruvananthapuram / Trivandrum, India

Job Title : Senior Site Reliability Engineer (SRE II).Location : Thiruvananthapuram, KL (Hybrid 3 days Onsite).We're looking for an experienced. Senior Site Reliability Engineer.The ideal candidate ha...Show more

Last updated: 13 days ago • Promoted

Senior DevOps & Database Reliability Engineer – 100% Remote

Hyly.AI • Alappuzha, IN

Remote

AI, we’re building the first AI + Data Fabric for the multifamily industry, transforming how clients manage, secure, and scale their marketing and operational data. As the industry moves toward a co...Show more

Last updated: 5 days ago • Promoted

Site Reliability Engineer - DevOps

Aim Plus Staffing Solutions • Thiruvananthapuram

Mandatory skills : We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI / CD automation to lead cloud infra...Show more

Last updated: 11 days ago • Promoted

Site Reliability Engineer (SRE)

Confidential • Thiruvananthapuram / Trivandrum

As a Site Reliability Engineer (SRE) you will be responsible for improving the overall reliability of applications by ensuring its availability, performance, and scalability.Should be able to gathe...Show more

Last updated: 30+ days ago • Promoted

Senior Site Reliability Engineer

Confidential • Thiruvananthapuram / Trivandrum, India

Site Reliability Engineering (SRE).Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that ...Show more

Last updated: 17 days ago • Promoted

Lead Test Engineer

dSPACE • Thiruvananthapuram, Kerala, India

For our office in Trivandrum, dSPACE is looking for creative and motivated professionals.You will work on exciting and technologically cutting-edge product development projects in the areas of code...Show more

Last updated: 30+ days ago • Promoted

Senior Site Reliability Engineer

IntraEdge • Alappuzha, IN

Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show more

Last updated: 26 days ago • Promoted

Site Reliability Engineer

GREYTIP SOFTWARE PRIVATE LIMITED • thiruvananthapuram, India

The ideal candidate will have hands-on experience in.You will play a key role in ensuring the reliability, availability, and performance of our production systems. Monitor production systems using e...Show more

Last updated: 3 hours ago • Promoted • New!

Lead Engineer

Hyqoo • Alappuzha, IN

Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show more

Last updated: 8 days ago • Promoted

Senior Site Reliability Engineer

CodeKarma • Alleppey, Republic Of India, IN

About InstaServiceInstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ st...Show more

Last updated: 11 hours ago • Promoted • New!

Compliance Engineer - Sustainability Compliance (Remote)

Certivo • Thiruvananthapuram, IN

Remote

Certivo turns regulatory evidence into market access.Our AI, CORA, automates supplier outreach, data extraction, and rule checks, then assembles market-ready packets mapped to every product × site ...Show more

Last updated: 24 days ago • Promoted

Site Reliability Engineer - Docker & Kubernetes

Growel Softech Pvt. Ltd. • Thiruvananthapuram

Description : We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI / CD automation to lead cloud...Show more

Last updated: 1 day ago • Promoted

Compliance Engineer - Safety and Quality Compliance (Remote)

Certivo • Kollam, IN

Remote

Certivo is an AI-first platform that assembles, validates, and keeps regulatory.We turn messy supplier documents into.You’ll be the company’s point of truth for. Your work directly determines whethe...Show more

Last updated: 24 days ago • Promoted

Sitecore Developer

Technocratic Solutions • Kollam, IN

CDN routing, optimizing content delivery,.Show more

Last updated: 4 hours ago • Promoted • New!

Site Reliability Engineer (SRE) – Infrastructure & Automation

InstaService • Kollam, IN

InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show more

Last updated: 11 days ago • Promoted

Equifax - Site Reliability Engineer

Equifax • Trivandrum

Site Reliability Engineering (SRE) at Equifax SRE is a discipline that combines software and systems engineering for building and running large-scale, distrib...Show more

Last updated: 30+ days ago • Promoted