Talent.com
No longer accepting applications
Site Reliability Engineer

Site Reliability Engineer

VXI Global Solutionskollam, India
1 day ago
Job description

We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications. The ideal candidate will have hands-on experience with Prometheus , Grafana , along with exposure to SolarWinds . You should be comfortable working with metrics, logs, and traces , and be able to correlate telemetry data to proactively detect, diagnose, and resolve performance issues.

Key Responsibilities :

  • Design and maintain observability pipelines using OpenTelemetry, Prometheus, and Grafana.
  • Build dashboards and alerts to monitor system health, application performance, and business KPIs.
  • Integrate observability solutions with Google Cloud Platform services and SolarWinds.
  • Correlate logs, metrics, and traces to troubleshoot incidents and reduce MTTR.
  • Collaborate with SREs, DevOps, and development teams to improve end-to-end system observability.
  • Implement best practices for telemetry data collection, enrichment, storage, and visualization.

Requirements :

  • Strong experience with Prometheus and Grafana for monitoring and alerting.
  • Proficiency in OpenTelemetry for instrumenting distributed systems.
  • Working knowledge of observability tools in Google Cloud (e.g., Cloud Monitoring, Logging, Trace).
  • Exposure to SolarWinds for network and infrastructure monitoring.
  • Solid understanding of telemetry data types : metrics, logs, and traces.
  • Ability to correlate and analyze multi-source observability data.
  • Scripting skills (Python, Bash) and familiarity with Infrastructure-as-Code is a plus.
  • Preferred Qualifications :

  • Experience in Site Reliability Engineering or Platform Engineering roles.
  • Knowledge of SLIs / SLOs and performance benchmarking.
  • Experience with APM tools (e.g., Datadog, New Relic) is a plus.
  • Create a job alert for this search

    Site Reliability Engineer • kollam, India

    Related jobs
    • Promoted
    • New!
    Senior Site Reliability Engineer (C / Python)

    Senior Site Reliability Engineer (C / Python)

    EntechAlleppey, Republic Of India, IN
    Senior Software Site Reliability Engineer (C# / Python).You’ll ensure enterprise systems are reliable, scalable, and performant - driving improvements, leading SRE initiatives, and mentoring teams on...Show moreLast updated: 21 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    PeoplefyThiruvananthapuram, Kerala, India
    We’re looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Datum Technologies GroupKollam, IN
    Job Title : Site Reliability Engineer (SRE) – AWS.AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog.We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experi...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialThiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    Job Title : Senior Site Reliability Engineer (SRE II).Location : Thiruvananthapuram, KL (Hybrid 3 days Onsite).We're looking for an experienced. Senior Site Reliability Engineer.The ideal candidate ha...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    People Prime WorldwideAlappuzha, IN
    Our client is a French multinational information technology (IT) services and consulting company, headquartered in Paris, France. Founded in 1967, It has been a leader in business transformation for...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer (SRE) / DevOps Engineer

    Site Reliability Engineer (SRE) / DevOps Engineer

    Stoopa AIAlappuzha, IN
    AI is building next-generation AI-driven platforms for ports and is focused on reliability, speed, and intelligent automation. As we scale our next generation smart port product Turi, we are hiring ...Show moreLast updated: 10 hours ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    ConfidentialThiruvananthapuram / Trivandrum
    As a Site Reliability Engineer (SRE) you will be responsible for improving the overall reliability of applications by ensuring its availability, performance, and scalability.Should be able to gathe...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialThiruvananthapuram / Trivandrum, India
    Site Reliability Engineering (SRE).Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that ...Show moreLast updated: 20 days ago
    • Promoted
    Site Reliability Engineer - DevOps

    Site Reliability Engineer - DevOps

    Aim Plus Staffing SolutionsThiruvananthapuram
    Mandatory skills : We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI / CD automation to lead cloud infra...Show moreLast updated: 14 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PhonePeThiruvananthapuram, IN
    SRE We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production ...Show moreLast updated: 16 days ago
    • Promoted
    • New!
    Site Reliability Engineer II

    Site Reliability Engineer II

    ConfidentialIndia, Thiruvananthapuram / Trivandrum, Thiruvananthapuram
    The world's top banks use Zafin's integrated platform to drive transformative customer value.Powered by an innovative AI-powered architecture, Zafin's platform seamlessly unifies data from across t...Show moreLast updated: 5 hours ago
    • Promoted
    Lead Site Reliability Specialist

    Lead Site Reliability Specialist

    PeoplefyThiruvananthapuram, Republic Of India, IN
    We’re looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Awign ExpertThiruvananthapuram, IN
    Position : SRE Observability Engineer.Mandatory Skills : Observability, Grafana and Writing queries using Prometheus and Loki. We are seeking a highly experienced and driven Senior Observability Engin...Show moreLast updated: 10 hours ago
    • Promoted
    Senior Site Reliability Engineer (C# / Python)

    Senior Site Reliability Engineer (C# / Python)

    EntechKollam, IN
    Senior Software Site Reliability Engineer (C# / Python).You’ll ensure enterprise systems are reliable, scalable, and performant - driving improvements, leading SRE initiatives, and mentoring teams on...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Voya IndiaKollam, IN
    We are seeking a strategic and technically adept leader to drive the scalability, resilience, and operational excellence of our enterprise systems. This role will set the vision for site reliability...Show moreLast updated: 9 hours ago
    • Promoted
    Principal Reliability Engineer

    Principal Reliability Engineer

    PeoplefyThiruvananthapuram, Republic Of India, IN
    We’re looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    KarixAlappuzha, IN
    We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Serv...Show moreLast updated: 10 hours ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceKollam, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 14 days ago