Site Reliability Engineer III - System ArchitectureHyreSnap • Bangalore
Site Reliability Engineer III - System Architecture
HyreSnap • Bangalore
30+ days ago
Job description
Responsibilities :
Architect and lead the design of scalable, reliable infrastructure solutions.
Implement strategies for high availability, scalability, and low-latency performance.
Define service-level objectives (SLOs) and service-level indicators (SLIs) to track performance and reliability.
Drive incident management by identifying root causes and providing long-term solutions.
Mentor junior engineers and foster a collaborative, learning-focused environment.
Design advanced monitoring and alerting systems for proactive system management.
Architect and optimize network topologies (hybrid cloud, multi-cloud, and on-prem) to support ultra-low-latency trading and compliance-driven workloads.
Configure and manage cloud and on-prem networking components (VPCs, Shared VPCs, Private Service Connect, Cloud NAT, and Global Load Balancers for secure and compliant transaction flows.
Implement secure connectivity solutions (VPNs, Interconnect, Direct Connect, and service meshes) to meet fintech regulatory requirements and standards.
Develop and maintain DNS, load-balancing, and traffic-routing strategies to ensure millisecond-level latency for real-time transactions.
Evolve Infrastructure as Code (IaC) practices and principles to automate infrastructure provisioning.
Collaborate on reliability roadmaps, performance benchmarks, and disaster recovery plans tailored for low-latency and high-throughput workloads.
Manage Kubernetes clusters at scale, integrating service meshes like Istio or Linkerd.
Implement chaos engineering principles to strengthen system resilience.
Influence technical direction, reliability culture, and organizational strategies.
Requirements :
6-9 years of experience in SRE, DevOps, or system architecture roles with large-scale production systems.
Extensive experience managing and scaling high-traffic, low-latency fintech systems, ensuring reliability, compliance, and secure transaction processing.
Proven expertise in the networking stack, with hands-on experience in BGP, OSPF, DNS, HTTP(S), TCP / IP, MPLS, and VPN protocols.
Advanced knowledge of GCP networking (VPC design, Shared VPC, Private Service Connect, Global Load Balancers, Cloud DNS, Cloud NAT, Network Intelligence Center, and Service Mesh).
Strong background in managing complex multi-cloud environments (AWS, GCP, Azure) with a focus on secure and compliant architectures in regulated industries.
Hands-on expertise in Terraform and Infrastructure-as-Code (IaC) for repeatable, automated deployments.
Expertise in Kubernetes, container orchestration, and microservices, with production experience in regulated fintech environments.
Advanced programming and scripting skills in Python, Go, or Java, applied to automation, risk reduction, and financial system resilience.
Proficiency with monitoring and logging tools (Prometheus, Mimir, Grafana, Loki) to ensure real-time visibility into trading, payments, and transaction flows.
Strong understanding of networking, load balancing, and DNS management across multi-cloud and hybrid infrastructures.
Implemented end-to-end observability solutions (metrics, logs, and traces) to monitor and optimize transaction throughput, adhering to latency SLAs.
Leadership skills with experience mentoring teams, fostering a culture of reliability, and partnering with cross-functional stakeholders in product teams.
Strong communication, critical thinking, and incident management abilities, especially in high-stakes production incidents involving customer transactions.
Bachelor's or Master's degree in Computer Science, Engineering, or equivalent experience.
(ref : hirist.tech)
Create a job alert for this search
Site Reliability Engineer • Bangalore
Related jobs
Site Reliability Engineer
JRD Systems • Bengaluru, Karnataka, India
Site Reliability Engineer (Windows / Cloud / Automation).We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments.T...Show more
Last updated: 28 days ago • Promoted
Site Reliability Engineer
Capgemini • hosur, tamil nadu, in
Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
Last updated: 19 days ago • Promoted
Site Reliability Engineer (SRE) - AWS
techolution • hosur, tamil nadu, in
We are seeking a highly skilled.Site Reliability Engineer - AWS.The ideal candidate will be responsible for designing, implementing, and maintaining high-availability systems, automating processes,...Show more
Last updated: 18 hours ago • Promoted • New!
Site Reliability Engineer
Veca Consulting Pvt Ltd • hosur, tamil nadu, in
Role Name : SRE & Devops Engineer(Bigdata).Location : Bangalore(No relocation).Notice Period : 20-30 days(who are currently serving).
You will be a member of our AI Platform Team, supporting the next...Show more
Last updated: 18 hours ago • Promoted • New!
Site Reliability Engineer
CodeKarma • hosur, tamil nadu, in
Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show more
Last updated: 30+ days ago • Promoted
Senior Site Reliability Engineer
o9 Solutions, Inc. • hosur, tamil nadu, in
Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises.
With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show more
Last updated: 18 hours ago • Promoted • New!
Senior Site Reliability Engineer
Nebula Tech Solutions • hosur, tamil nadu, in
SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can.
Enhance application reliability through code.Add or modify cod...Show more
Last updated: 9 days ago • Promoted
Senior Site Reliability Engineer (SRE) – Datadog Observability
Jade Global • hosur, tamil nadu, in
Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote.
Site Reliability Engineer (SRE).SRE...Show more
Last updated: 9 days ago • Promoted
Site Reliability Engineer
ACL Digital • Bengaluru, India
Service Management : Maintain application uptime / performance, manage system enhancements and defects, oversee daily operational activities, and ensure continuous improvement and adherence to ITIL be...Show more
Last updated: 30+ days ago • Promoted
System Reliability Engineer
Andromeda Security • Bengaluru, Karnataka, India
We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in DevOps technologies and cloud infrastructure.
The ideal candidate will have hands-on experience with Kuberne...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer
Pro5.ai • bangalore, karnataka, in
This role is ideal for someone passionate about system reliability, incident response, and cross-team collaboration in a large-scale cloud environment.
Act as the first point of contact for all cust...Show more
Last updated: 18 hours ago • Promoted • New!
Lead System Architect
Pegasystems • hosur, tamil nadu, in
Pegasystems develops strategic applications for sales, marketing, service and operations.Pega's applications streamline critical business operations, connect enterprises to their customers seamless...Show more
Last updated: 6 days ago • Promoted
Site Reliability Engineer
super.money • Bengaluru, Karnataka, India
Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show more
Last updated: 10 days ago • Promoted
Senior Site Reliability Engineer
GigSky • hosur, tamil nadu, in
We're Hiring : Site Reliability Engineer (5–10 Years Experience).Location : Bangalore, India | 🏢 Gigsky India Private Limited.
Are you passionate about building resilient, scalable, and secure infras...Show more
Last updated: 18 hours ago • Promoted • New!
Senior Staff Site Reliability Engineer
Talent Collective (India) • hosur, tamil nadu, in
Client of Talent Collective (India).Our client is seeking a deeply skilled Senior Staff Site Reliability Engineer (SRE) to advance their enterprise security initiatives around identity and access, ...Show more
Last updated: 14 hours ago • Promoted • New!
Site Reliability Engineer II
RecRoots • Bangalore Urban, Karnataka, India
Key Job Responsibilities and Duties : .The core premise for the SRE lies in treating operational issues as a software problem.
We code our way out of problems where operations are concerned addressing...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer
Synamedia • hosur, tamil nadu, in
At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed.
We are backed by the Permira funds and Sky.This is the age of infinite ...Show more
Last updated: 3 days ago • Promoted
Senior Site Reliability Engineer
ITC Infotech • Bengaluru, Karnataka, India
Proficiency in at least one coding language — preferably.Experience supporting and enhancing.Proven ability to implement automation solutions.
Strong skills in diagnosing and resolving issues in.Str...Show more