Talent.com
GCP Reliability Engineer

GCP Reliability Engineer

Dexian IndiaBengaluru, Republic Of India, IN
5 days ago
Job description

Role Description

We are seeking an experienced and motivated engineer to join the Observability fleet which focuses on delivering tools in private and public cloud environments. The role focuses on developing and modernizing Observability platforms for cloud-native and hybrid applications, with a primary focus on Google Cloud Platform (GCP). This role involves designing, integrating, and maintaining solutions for collecting, transporting, and visualizing telemetry (tracing, metrics, and logging) to improve the reliability and uptime of our applications. You will closely collaborate with software developers, SRE, infrastructure, and security teams to drive automation and implement best-in-class observability solutions supporting both development and operations in a hybrid cloud environment.

Role & Responsibilities :

  • Build and support the modernization and integration of observability tools in private and public cloud offerings (GCP, AKS, EKS)
  • Design, implement, and automate telemetry, logging, and monitoring solutions—including dashboards, alerts, and CI / CD integration.
  • Enable teams to leverage observability data for reliability, performance, and security use cases;

provide actionable recommendations.

  • Collaborate with DevOps, SRE, and security teams to share best practices and support adoption of observability standards.
  • Mentor and upskill client teams through knowledge transfer and participate in on call activities as required.
  • Required Skills

  • At least 5 years of relevant experience in Observability, Logging, and Monitoring in enterprise environments.
  • Hands-on experience with observability tools such as Grafana, Prometheus, Loki, Cortex, Tempo, ElasticSearch, Datadog, Splunk, or equivalents
  • Experience working with container technologies (Docker, Kubernetes) and orchestration platforms (GKE or similar).
  • Proficiency in setting up and configuring dashboards, alerts, and alarms on Grafana and / or GCP Monitoring.
  • Experience in integrating observability tools with CI / CD pipelines and automating through scripting (Python, Bash, JSON, YAML, Terraform or similar).
  • Excellent communication, presentation, and problem-solving skills.
  • Proficiency with Linux operating systems and databases (MySQL, DB2, MSSQL, or similar).
  • Solid understanding of how enterprise service delivery components interact (web servers, application servers, databases, web services, storage, security).
  • Create a job alert for this search

    Reliability Engineer • Bengaluru, Republic Of India, IN

    Related jobs
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Delta Air LinesBengaluru, India
    Execute on the Incident, Change Management, Problem Management processes.Building and supporting a reliable application suite for the environment in order to meet the development and maintenance re...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ITC InfotechBengaluru, Karnataka, India
    Must-Have Requirements Experience : 5–8 years in SRE and / or DevOps roles Programming Skills : Proficiency in at least one coding language — preferably Python or C++ Platform Support : Experience...Show moreLast updated: 23 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmahosur, tamil nadu, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 25 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupBengaluru, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 4 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.hosur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Core Minds Tech SOlutionsHosur
    Job Description : - Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions&l...Show moreLast updated: 30+ days ago
    • Promoted
    GCP Observability Engineer

    GCP Observability Engineer

    Dexian IndiaBengaluru, Karnataka, India
    We are seeking an experienced and motivated engineer to join the Observability fleet which focuses on delivering tools in private and public cloud environments. The role focuses on developing and mo...Show moreLast updated: 30+ days ago
    • Promoted
    Platform Reliability Engineer

    Platform Reliability Engineer

    Tata Consultancy ServicesBengaluru, Republic Of India, IN
    Minimum 5 mandate details are mandate with two or 3 liners.CI-CD (Jenkins, Bitbucket Pipelines).Minimum 2 mandate details are mandate with two or 3 liners. Location : Hyderabad & Bangalore.Show moreLast updated: 24 days ago
    • Promoted
    Reliability Engineer

    Reliability Engineer

    lululemonBengaluru, Republic Of India, IN
    Setting the bar in technical fabrics and functional design, we create transformational products and experiences that support people in moving, growing, connecting, and being well.We owe our success...Show moreLast updated: 24 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Sonata SoftwareBengaluru, Republic Of India, IN
    In today's market, there is a unique duality in technology adoption.On one side, extreme focus on cost containment by clients, and on the other, deep motivation to modernize their Digital storefron...Show moreLast updated: 26 days ago
    • Promoted
    Lead - Cloud Reliability Engineer

    Lead - Cloud Reliability Engineer

    Searce Inchosur, tamil nadu, in
    The ‘process-first’ AI-native modern tech consultancy that's rewriting the rules.As an engineering-led consultancy, we are dedicated to relentlessly improving the real business outcomes.Our solvers...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Delta Air LinesBengaluru, India
    Execute on the Incident, Change Management, Problem Management processes.Building and supporting reliable applications that meet development and maintenance requirements. Provide consultation and di...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ACL DigitalBengaluru, Karnataka, India
    Service Management : Maintain application uptime / performance, manage system enhancements and defects, oversee daily operational activities, and ensure continuous improvement and adherence to ITIL be...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiBengaluru, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 14 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    super.moneyBengaluru, Karnataka, India
    Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show moreLast updated: 5 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionshosur, tamil nadu, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer (GCP)

    Site Reliability Engineer (GCP)

    ConfidentialBengaluru / Bangalore
    As SRE with Data Engineering expertise, you will be responsible for managing, maintaining, and troubleshooting GCP data pipelines. The ideal candidate will have extensive experience in cloud data en...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    RecRootsBangalore Urban, Karnataka, India
    Key Job Responsibilities and Duties : .The core premise for the SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned addressing...Show moreLast updated: 30+ days ago