Talent.com
This job offer is not available in your country.
Zoop.One - Site Reliability Engineer - DevOps

Zoop.One - Site Reliability Engineer - DevOps

ZOOPPune
30+ days ago
Job description

Role : Site Reliability Engineer.

Location : Pune (on-site).

Experience : 3+ years.

Someone who has experience setting up an in-house monitoring platform with 99.99% uptime SLA using Victoria Metrics & Prometheus in Multi Region.

Site Reliability Engineer Zoop.

The Opportunity :

We're seeking a Senior Site Reliability Engineer to elevate and standardize our reliability engineering practices. This role offers the opportunity to shape and optimize SRE practices in a high-growth fintech environment while working with cutting-edge technologies and critical identity verification services.

Key Responsibilities :

Standardization & Optimization :

  • Assess and standardize existing monitoring and observability practices across NewRelic and Prometheus.
  • Refine and formalize SLIs / SLOs for all solution offerings.
  • Optimize current alerting strategies to improve signal-to-noise ratio.
  • Document and standardize incident management processes.
  • Create comprehensive runbooks for all critical services.

Reliability Engineering :

  • Drive improvements to achieve and maintain 99.95% uptime for critical services.
  • Optimize API response times to strengthen our "Fastest Platform" positioning.
  • Implement advanced chaos engineering practices.
  • Enhance existing automation and self-healing capabilities.
  • Standardize disaster recovery and business continuity procedures.
  • Infrastructure Excellence :

  • Optimize our GCP / Kubernetes infrastructureand AWS where applicablefor enhanced reliability.
  • Standardize Infrastructure as Code (IaC) practices across teams.
  • Identify and automate remaining manual operational tasks.
  • Build advanced tooling for monitoring, deployment, and troubleshooting.
  • Drive cloud cost optimization initiatives.
  • Prepare for potential self?hosting scenarios, including operating Grafana, Prometheus, VictoriaMetrics, and log stacks such as Loki and Elastic.
  • Security & Compliance :

  • Ensure all reliability practices meet ISO 27001 : 2022, ISO 27017 : 2015, ISO 27018 : 2019, ISO 27701 : 2019, and SOC 2 Type II requirements (with a pragmatic, risk?based approach).
  • Enhance security monitoring and anomaly detection.
  • Standardize secure CI / CD practices across the organization.
  • Implement comprehensive audit and compliance reporting.
  • Collaboration & Process Improvement :

  • Partner with the Platform team to enhance and standardize existing SRE workflows.
  • Collaborate with 50+ developers to strengthen reliability culture.
  • Lead blameless post?mortems and drive systematic improvements.
  • Establish SRE best practices and knowledge's haring sessions.
  • Build a roadmap for eventual SRE team expansion.
  • Technical Requirements :

    Must?Have Skills :

  • Experience : 3+ years in SRE, DevOps, or similar roles with a focus on standardizing and scaling practices.
  • Cloud Expertise : Deep hands?on experience with Google Cloud Platform (GCP) and Amazon Web Services (AWS).
  • Container Orchestration : Advanced Kubernetes and Docker skills in production environments.
  • Programming : Proficiency in at least two of Go, Python, TypeScript, plus strong Shell's cripting abilities.
  • Operating Systems : Expert?level Linux knowledge and tuning.
  • Monitoring : Expert?level knowledge of Prometheus and NewRelic.
  • IaC : Strong experience with Terraform or similar tools.
  • Process Excellence : Proven track record of standardizing SRE practices.
  • Preferred Qualifications :

  • Experience in fintech, banking, or other high's ecurity environments.
  • Knowledge of ISO 27001, SOC 2, and related compliance requirements.
  • Experience optimizing API reliability at scale (millions of requests / day).
  • Background in maturing existing SRE practices.
  • Familiarity with identity verification or fraud detection systems.
  • GCP Professional Cloud Architect or DevOps Engineer certification.
  • Experience running self?hosted observability stacks (Grafana, Prometheus, VictoriaMetrics, Loki, Elastic).
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Pune

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    UplersPune, IN
    Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 25 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Birlasoftpune, maharashtra, in
    Be primarily responsible for providing production, operations support and application administration to business and web applications, 3rd party applications and related ecosystems.The application ...Show moreLast updated: 24 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    AllianzPune
    Site Reliability Engineer (SRE) - One Identity Access Management The primary objective of the Site Reliability Engineer (SRE) specializing in One Identity Access Mana...Show moreLast updated: 24 days ago
    • Promoted
    Dynamisch - DevOps / Site Reliability Engineer

    Dynamisch - DevOps / Site Reliability Engineer

    Dynamisch IT Pvt ltd.Pune
    Job Title : DevOps & Site Reliability Engineer Experience : 4+ Yrs Qualification : B.SC IT / MCAShow moreLast updated: 30+ days ago
    • Promoted
    Spotnana - Site Reliability Engineer - Cloud Infrastructure

    Spotnana - Site Reliability Engineer - Cloud Infrastructure

    SpotnanaPune
    Lets build whats next, together.Were on a mission to modernize the infrastructure of the $1.Our Travel-as-a-Service platform is designed to make every trip better, whether youre booking for work, b...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ExasoftPune, IN
    Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 20 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    TechVeritopune, maharashtra, in
    As a SRE Engineer, you will have a strong background in cloud infrastructure management, migration and deployment, with expertise in Google Cloud Platform (GCP), DevOps tools, and Kubernetes ecosys...Show moreLast updated: 17 hours ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebiapune, maharashtra, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 8 days ago
    • Promoted
    Senior DevOps / Site Reliability Engineer

    Senior DevOps / Site Reliability Engineer

    Petals CareersPune
    About the Role : We're looking for a highly skilled and experienced Senior DevOps / SRE Engineer to join our team.In this role, you'll be responsible for bui...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer - Cloud Platforms

    Site Reliability Engineer - Cloud Platforms

    LanceSoft, IncPune
    Role and Responsibilities : Reporting to Engineering, the Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Soluti...Show moreLast updated: 19 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XebiaPune, IN
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 27 days ago
    • Promoted
    Reveille Technologies - Site Reliability Engineer - DevOps

    Reveille Technologies - Site Reliability Engineer - DevOps

    Reveille TechnologiesPune
    Job Summary : We are seeking a skilled and proactive Site Reliability Engineer (SRE) with a strong DevOps mindset and hands-on experience in applicat...Show moreLast updated: 30+ days ago
    • Promoted
    Qualys - Senior Site Reliability Engineer - DevOps

    Qualys - Senior Site Reliability Engineer - DevOps

    QUALYS SECURITY TECHSERVICES PRIVATE LIMITEDPune
    About the job : Come work at a place where innovation and teamwork come together to support the most exciting missions in the world! <...Show moreLast updated: 22 days ago
    • Promoted
    Rosemallow Technologies - Site Reliability Engineer

    Rosemallow Technologies - Site Reliability Engineer

    ROSEMALLOW TECHNOLOGIES PRIVATE LIMITEDPune
    Job Title : Site Reliability Engineer (SRE).Department : Technology / Infrastructure / DevOps.Employment Type : Full-time.Job Summary : Show moreLast updated: 26 days ago
    • Promoted
    CrelioHealth - Site Reliability Engineer - CI / CD Pipeline

    CrelioHealth - Site Reliability Engineer - CI / CD Pipeline

    CRELIANT SOFTWARE PRIVATE LIMITEDPune
    Job Role : Site Reliability Engineer.Job Summary : We are seeking a Senior DevOps & SRE Engineer to join our team and help us build,...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConcordPune, IN
    Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 18 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Pune, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Luxoft Indiapune, maharashtra, in
    We are looking for an experienced technical developer to work for one of our client from the banking industry.Project goal is to maintain and develop solutions. Design, develop, and improve the digi...Show moreLast updated: 18 days ago