Talent.com
This job offer is not available in your country.
Rosemallow Technologies - Site Reliability Engineer - Observability Services

Rosemallow Technologies - Site Reliability Engineer - Observability Services

ROSEMALLOW TECHNOLOGIES PRIVATE LIMITEDCoimbatore
30+ days ago
Job description

Job Title : Site Reliability Engineer (SRE)

Location : Coimbatore, Pune

Interview Mode : 2 rounds (F2F)

Department : Technology / Infrastructure / DevOps

Employment Type : Full-time

Job Summary :

We are seeking an experienced Site Reliability Engineer (SRE) who will play a critical role in ensuring the reliability, performance, and scalability of our payment systems.

The ideal candidate will possess deep expertise in DevOps automation, enterprise monitoring, and cloud platforms, along with a strong background in Card Payment systems.

This role requires hands-on technical skills, a passion for problem-solving, and the ability to collaborate across teams in a fast-paced, dynamic environment.

Key Responsibilities : & Performance :

  • Ensure the reliability, availability, and performance of critical payment platforms and services.
  • Drive root cause analysis (RCA) and implement long-term solutions to prevent recurrence of incidents.
  • Manage capacity planning, scalability, and performance tuning across cloud and on-prem environments.
  • Lead and participate in the on-call rotation, providing timely support and issue resolution.

DevOps Automation & CI / CD :

  • Design, implement, and maintain CI / CD pipelines using Jenkins, GitHub, and other DevOps tools.
  • Automate infrastructure deployment, configuration, and monitoring, following Infrastructure as Code (IaC) principles.
  • Enhance automation for routine operational tasks, incident response, and self-healing capabilities.
  • Monitoring & Observability :

  • Implement and manage enterprise monitoring solutions including Splunk, Dynatrace, Prometheus, and Grafana.
  • Build real-time dashboards, alerts, and reporting to proactively identify system anomalies.
  • Continuously improve observability, logging, and tracing across all environments.
  • Cloud Platforms & Infrastructure :

  • Work with AWS, Azure, and PCF (Pivotal Cloud Foundry) environments, managing cloud-native services and infrastructure.
  • Design and optimize cloud architecture for reliability and cost-efficiency.
  • Collaborate with cloud security and networking teams to ensure secure and compliant infrastructure.
  • Payment Systems Expertise :

  • Apply your understanding of Card Payment systems to ensure platform reliability and compliance.
  • Troubleshoot payment-related issues, ensuring minimal impact on transaction flows and customer experience.
  • Collaborate with product and development teams to ensure alignment with business objectives.
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Coimbatore