Site Reliability Engineer III - System Architecture
HyreSnapBangalore
10 days ago
Job description
Responsibilities :
Architect and lead the design of scalable, reliable infrastructure solutions.
Implement strategies for high availability, scalability, and low-latency performance.
Define service-level objectives (SLOs) and service-level indicators (SLIs) to track performance and reliability.
Drive incident management by identifying root causes and providing long-term solutions.
Mentor junior engineers and foster a collaborative, learning-focused environment.
Design advanced monitoring and alerting systems for proactive system management.
Architect and optimize network topologies (hybrid cloud, multi-cloud, and on-prem) to support ultra-low-latency trading and compliance-driven workloads.
Configure and manage cloud and on-prem networking components (VPCs, Shared VPCs, Private Service Connect, Cloud NAT, and Global Load Balancers for secure and compliant transaction flows.
Implement secure connectivity solutions (VPNs, Interconnect, Direct Connect, and service meshes) to meet fintech regulatory requirements and standards.
Develop and maintain DNS, load-balancing, and traffic-routing strategies to ensure millisecond-level latency for real-time transactions.
Evolve Infrastructure as Code (IaC) practices and principles to automate infrastructure provisioning.
Collaborate on reliability roadmaps, performance benchmarks, and disaster recovery plans tailored for low-latency and high-throughput workloads.
Manage Kubernetes clusters at scale, integrating service meshes like Istio or Linkerd.
Implement chaos engineering principles to strengthen system resilience.
Influence technical direction, reliability culture, and organizational strategies.
Requirements :
6-9 years of experience in SRE, DevOps, or system architecture roles with large-scale production systems.
Extensive experience managing and scaling high-traffic, low-latency fintech systems, ensuring reliability, compliance, and secure transaction processing.
Proven expertise in the networking stack, with hands-on experience in BGP, OSPF, DNS, HTTP(S), TCP / IP, MPLS, and VPN protocols.
Advanced knowledge of GCP networking (VPC design, Shared VPC, Private Service Connect, Global Load Balancers, Cloud DNS, Cloud NAT, Network Intelligence Center, and Service Mesh).
Strong background in managing complex multi-cloud environments (AWS, GCP, Azure) with a focus on secure and compliant architectures in regulated industries.
Hands-on expertise in Terraform and Infrastructure-as-Code (IaC) for repeatable, automated deployments.
Expertise in Kubernetes, container orchestration, and microservices, with production experience in regulated fintech environments.
Advanced programming and scripting skills in Python, Go, or Java, applied to automation, risk reduction, and financial system resilience.
Proficiency with monitoring and logging tools (Prometheus, Mimir, Grafana, Loki) to ensure real-time visibility into trading, payments, and transaction flows.
Strong understanding of networking, load balancing, and DNS management across multi-cloud and hybrid infrastructures.
Implemented end-to-end observability solutions (metrics, logs, and traces) to monitor and optimize transaction throughput, adhering to latency SLAs.
Leadership skills with experience mentoring teams, fostering a culture of reliability, and partnering with cross-functional stakeholders in product teams.
Strong communication, critical thinking, and incident management abilities, especially in high-stakes production incidents involving customer transactions.
Bachelor's or Master's degree in Computer Science, Engineering, or equivalent experience.
(ref : hirist.tech)
Create a job alert for this search
Site Reliability Engineer • Bangalore
Related jobs
Promoted
Site Reliability Engineer
Vbeyond corporationBangalore
SRE (Site Reliability Engineer 2) We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tool...Show moreLast updated: 30+ days ago
Promoted
New!
Site Reliability Engineer
ExasoftBengaluru, IN
Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites.
Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 10 hours ago
Promoted
Site Reliability Engineer
Amicon Hub Serviceshosur, tamil nadu, in
Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation.
Collaborate with development teams to en...Show moreLast updated: 6 days ago
Promoted
New!
Site Reliability Engineer
BayOne Solutionshosur, tamil nadu, in
Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 7 hours ago
Promoted
New!
System Engineer
CUS Techhosur, tamil nadu, in
We are looking for a detail-oriented and proactive.The role involves ensuring the reliability, security, and performance of servers, networks, and applications while providing technical support and...Show moreLast updated: 7 hours ago
Promoted
Site Reliability Engineer
ViewSonicBengaluru, Karnataka, India
Bachelor's degree in Computer Science, Engineering, or a related field.Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.
Basic understanding of AWS solutions in...Show moreLast updated: 17 days ago
Promoted
Senior Site Reliability Engineer- ELK Expert
iVedha Inc.hosur, tamil nadu, in
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone.
Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
Core Minds Tech SOlutionsHosur
Job Description : - Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions&l...Show moreLast updated: 30+ days ago
Promoted
Senior Site Reliability Engineer II
ConfidentialBengaluru / Bangalore
The Site Reliability Engineering team focused on Efficiency and Performance is responsible for driving AWS cost intelligence, managing the ThousandEyes infrastructure, and ensuring optimal resource...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
TavantBengaluru, Karnataka, India
With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers.
It has been the frontrunner in driving digital innovation and tec...Show moreLast updated: 26 days ago
Promoted
Senior Site Reliability Engineer
WSO2hosur, tamil nadu, in
Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 7 days ago
Promoted
Site Reliability Engineer
XebiaBengaluru, Karnataka, India
Performance & Reliability Engineer ( Senior, Lead , Principal & Manager).Location : Pune, Chennai, Bangalore & Gurgaon.Role : Performance & Reliability Engineer.
Job Location : Gurgaon, Chennai, Pune, ...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
WhiteLotus Talent PartnersBengaluru, Karnataka, India
L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by.
In this role, you will focu...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
Uplershosur, tamil nadu, in
Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required.
OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 24 days ago
Promoted
Site Reliability Engineer - Chaos Management
Xebiahosur, tamil nadu, in
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 7 days ago
Promoted
Inoptra - Site Reliability Engineer - DevOps
InoptraBangalore
Job Description : Site Reliability Engineer.For this position, were looking for talented & experienced engineers who have a passion fo...Show moreLast updated: 30+ days ago
Promoted
Principal Site Reliability Engineer
Rakuten IndiaBengaluru, Karnataka, India
Design, develop SLA, SLO, SLI of services within the Business Unit.Involve in whole process of Development, Production System Operation including system maintenance, monitoring, automation, backend...Show moreLast updated: 8 days ago
Promoted
Site Reliability Engineer
Central Business Solutions Inc.Bangalore Urban, Karnataka, India
Linux SRE [Linux SRE L3 with Infra + Operation Support].The Server Operations team is part of the Enterprise Computing organization within Client.
The wider team has presence in cities globally and ...Show moreLast updated: 5 days ago