Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions
Operate, monitor, and triage all aspects of our production and non-production environments
Collaborate with other engineers on code, infrastructure, design reviews, and process enhancements.
Evaluate and integrate new technologies to improve system reliability, security, and performance
Develop and implement automation to provision, configure, deploy, and monitor Apple services
Participate in an on-call rotation providing hands-on technical expertise during service-impacting events
Design, build, and maintain highly available and scalable infrastructure
Implement and improve monitoring, alerting, and incident response systems
Automate operations tasks and develop efficient workflows
Conduct system performance analysis and optimization
Collaborate with development teams to ensure smooth deployment and release processes
Implement and maintain security best practices and compliance standards
Troubleshoot and resolve system and application issues
Participate in capacity planning and scaling efforts
Stay up-to-date with the latest trends, technologies, and advancements in SRE practices
Contribute to capacity planning, scale testing, and disaster recovery exercises.
Approach operational problems with a software engineering mindset
BS degree in computer science or equivalent field with 5+ years of experience
5+ years in an Infrastructure Ops, Site Reliability Engineering, or DevOps-focused role.
Knowledge of Linux operating system principles, networking fundamentals, and systems management.
Demonstrable fluency in at least one of the following languages : Java, Python, or Go
Experience managing and scaling distributed systems in a public, private, or hybrid cloud environment
Develop and implement automation tools and apply best practices for system reliability.
You will be responsible for the availability & scalability of our services and manage the disaster recovery and other operational tasks.
Collaborate with the development team to improve application codebase for logging, metrics and traces for observability.
Collaborate with data science teams and other business units to design, build and maintain the infrastructure that runs machine learning and generative AI workloads.
Influence architectural decisions with focus on security, scalability and performance.
Find and fix problems in production, and work to avoid them from happening again
Preferred Qualifications :
Familiarity with micro-services architecture and container orchestration with Kubernetes.
Awareness of key security principles including encryption, keys (types and exchange protocols).
Understanding SRE principles includes monitoring, alerting, error budgets, fault analysis, and automation.
Strong sense of ownership, with a desire to communicate and collaborate with other engineers and teams.
Ability to identify and communicate technical and architectural problems, while working with partners and their team to iteratively find solutions.
(ref : hirist.tech)
Create a job alert for this search
Site Reliability Engineer • Hosur
Related jobs
Promoted
Site Reliability Engineer
SynamediaBengaluru, Karnataka, India
At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed.
We are backed by the Permira funds and Sky.This is the age of infinite ...Show moreLast updated: 10 days ago
Promoted
Site Reliability Engineer
ReyikaBengaluru, Karnataka, India
Senior Site Reliability Engineer / Reliability Architect.Pune,Bengalore,Chennai,Pune,Noida.Reliability Architect with over 9 years of experience in proactive monitoring, automation, and observabili...Show moreLast updated: 1 day ago
Promoted
Site Reliability Engineer
London Stock Exchange GroupBangalore, India
Engineer, Site Reliability Engineering.We are evolving our Reliability Engineering team to move beyond support and operations.
As a Senior Engineer in Site Reliability, you will be part of a diverse...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
JRD SystemsBengaluru, Karnataka, India
Site Reliability Engineer (Windows / Cloud / Automation) Job Summary : We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud e...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
SynechronBengaluru, Karnataka, India
We have immediate opportunity for Senior Site Reliability Engineer.Senior Site Reliability Engineer.At Synechron, we believe in the power of digital to transform businesses for the better.Our globa...Show moreLast updated: 30+ days ago
Promoted
New!
Site Reliability Engineer
Karixhosur, tamil nadu, in
We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Serv...Show moreLast updated: 16 hours ago
Promoted
Site Reliability Engineer
VAYUZ TechnologiesBengaluru, Republic Of India, IN
Execute and maintain SOPs for production operations, onboarding, and integration support.Handle incident response, troubleshoot system and data issues, and ensure timely resolution.Support partner ...Show moreLast updated: 2 days ago
Promoted
Senior Site Reliability Engineer
o9 Solutions, Inc.Bengaluru, Karnataka, India
Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises.
With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 7 days ago
Promoted
Site Reliability Engineer
GREYTIP SOFTWARE PRIVATE LIMITEDBengaluru, Karnataka, India
About the Role We are looking for a skilled Site Reliability Engineer II to join our SRE team.The ideal candidate will have hands-on experience in production monitoring, alert handling, and L1 pro...Show moreLast updated: 4 days ago
Promoted
Site Reliability Engineer
WhiteLotus Talent PartnersBengaluru, Karnataka, India
L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by.
In this role, you will focu...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer (SRE) – Infrastructure & Automation
InstaServicehosur, tamil nadu, in
InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly.
We’re growing fast across 23+ states and expanding...Show moreLast updated: 14 days ago
Promoted
Site Reliability Engineer
super.moneyBengaluru, Karnataka, India
Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show moreLast updated: 17 days ago
Promoted
New!
Site Reliability Engineer
Awign ExpertBangalore, IN
Position : SRE Observability Engineer.Mandatory Skills : Observability, Grafana and Writing queries using Prometheus and Loki.
We are seeking a highly experienced and driven Senior Observability Engin...Show moreLast updated: 22 hours ago
Promoted
New!
Senior Site Reliability Engineer (SRE)
Voya Indiahosur, tamil nadu, in
We are seeking a strategic and technically adept leader to drive the scalability, resilience, and operational excellence of our enterprise systems.
This role will set the vision for site reliability...Show moreLast updated: 15 hours ago
Promoted
Site Reliability Engineer
Datum Technologies Grouphosur, tamil nadu, in
Job Title : Site Reliability Engineer (SRE) – AWS.AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog.We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experi...Show moreLast updated: 8 days ago
Promoted
Site Reliability Engineer
Media.netBengaluru, Karnataka, India
Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms.
HQ is based in New York, and the Global H...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
Landmark GroupBengaluru, India
Ensure reliability and high availability of Java and microservices-based applications through proactive monitoring and automation.
Define and track SLIs / SLOs to maintain service performance and stab...Show moreLast updated: 7 days ago
Promoted
Site Reliability Engineer
PhonePehosur, tamil nadu, in
SRE We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production ...Show moreLast updated: 16 days ago