Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions
Operate, monitor, and triage all aspects of our production and non-production environments
Collaborate with other engineers on code, infrastructure, design reviews, and process enhancements.
Evaluate and integrate new technologies to improve system reliability, security, and performance
Develop and implement automation to provision, configure, deploy, and monitor Apple services
Participate in an on-call rotation providing hands-on technical expertise during service-impacting events
Design, build, and maintain highly available and scalable infrastructure
Implement and improve monitoring, alerting, and incident response systems
Automate operations tasks and develop efficient workflows
Conduct system performance analysis and optimization
Collaborate with development teams to ensure smooth deployment and release processes
Implement and maintain security best practices and compliance standards
Troubleshoot and resolve system and application issues
Participate in capacity planning and scaling efforts
Stay up-to-date with the latest trends, technologies, and advancements in SRE practices
Contribute to capacity planning, scale testing, and disaster recovery exercises.
Approach operational problems with a software engineering mindset
BS degree in computer science or equivalent field with 5+ years of experience
5+ years in an Infrastructure Ops, Site Reliability Engineering, or DevOps-focused role.
Knowledge of Linux operating system principles, networking fundamentals, and systems management.
Demonstrable fluency in at least one of the following languages : Java, Python, or Go
Experience managing and scaling distributed systems in a public, private, or hybrid cloud environment
Develop and implement automation tools and apply best practices for system reliability.
You will be responsible for the availability & scalability of our services and manage the disaster recovery and other operational tasks.
Collaborate with the development team to improve application codebase for logging, metrics and traces for observability.
Collaborate with data science teams and other business units to design, build and maintain the infrastructure that runs machine learning and generative AI workloads.
Influence architectural decisions with focus on security, scalability and performance.
Find and fix problems in production, and work to avoid them from happening again
Preferred Qualifications :
Familiarity with micro-services architecture and container orchestration with Kubernetes.
Awareness of key security principles including encryption, keys (types and exchange protocols).
Understanding SRE principles includes monitoring, alerting, error budgets, fault analysis, and automation.
Strong sense of ownership, with a desire to communicate and collaborate with other engineers and teams.
Ability to identify and communicate technical and architectural problems, while working with partners and their team to iteratively find solutions.
(ref : hirist.tech)
Create a job alert for this search
Site Reliability Engineer • Vadodara
Related jobs
Promoted
New!
Subsurface Reliability Engineer
Chevronanand, India
The Subsurface Reliability Engineer is part of the Production Engineering team within the Chevron ENGINE Center and is responsible for ensuring the reliability and efficiency of subsurface operatio...Show moreLast updated: 21 hours ago
Promoted
Senior Site Reliability Engineer
PeoplefyAnand, Republic Of India, IN
We’re looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets.
Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 1 day ago
Promoted
Senior Site Reliability Engineer
CodeKarmaAnand, Republic Of India, IN
About InstaServiceInstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly.
We’re growing fast across 23+ st...Show moreLast updated: 2 days ago
Reliability Engineer
Saaki Argus & Averil ConsultingVadodara, Gujarat, India
Quick Apply
One of the leading Engineering and R&D Software Services Companies.Experience of maintaining the Instruments, Valves, transmitters, Sensors, Control systems (DCS / PLC, SCADA), Analyzers and F&am...Show moreLast updated: 30+ days ago
Promoted
New!
Technical Support Engineer
SutherlandVadodara, IN
Job Description : Technical Support Engineer.You will support both on-premises and cloud-based deployments, including AWS-hosted instances, and play a key part in delivering world-class technical as...Show moreLast updated: 5 hours ago
Promoted
New!
Senior Site Reliability Engineer (C# / Python)
EntechNadiad, IN
Senior Software Site Reliability Engineer (C# / Python).You’ll ensure enterprise systems are reliable, scalable, and performant - driving improvements, leading SRE initiatives, and mentoring teams on...Show moreLast updated: 5 hours ago
Promoted
Senior Dell Boomi Integration Engineer
MaitsysAnand, IN
Job Description : Senior Boomi Integration Engineer.Atom migration (on-prem → cloud), integration development, and ongoing support.
Senior Dell Boomi Integration Engineer.Boomi Atom to a cloud-hosted...Show moreLast updated: 1 day ago
Promoted
New!
Site Reliability Engineer
People Prime Worldwidenadiad, India
Our client is a French multinational information technology (IT) services and consulting company, headquartered in Paris, France.
Founded in 1967, It has been a leader in business transformation for...Show moreLast updated: 15 hours ago
Promoted
New!
SRE (Devops)
CozzeraNadiad, IN
Manage and optimize cloud infrastructure with strong hands-on expertise in.Automate deployment pipelines and ensure high availability and scalability of services.
Troubleshoot production issues and ...Show moreLast updated: 2 hours ago
Promoted
Site Reliability Engineer (SRE) – Infrastructure & Automation
InstaServiceNadiad, IN
InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly.
We’re growing fast across 23+ states and expanding...Show moreLast updated: 13 days ago
Promoted
New!
Senior Site Reliability Engineer
Synechronnadiad, India
We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5+ years.SRE (Senior Site Reliability Engineer).
We began life in 2001 as a small, self-funded team of technology specialists...Show moreLast updated: 15 hours ago
Promoted
New!
TCS Walkin Drive For Site Reliability Engineering (SRE)
Tata Consultancy Servicesanand, India
Site Reliability Engineering (SRE)Ops.TCS has been a great pioneer in feeding the fire of young Techies like you.We are a global leader in the technology arena and there’s nothing that can stop us ...Show moreLast updated: 21 hours ago
Promoted
Lead Engineer
HyqooNadiad, IN
Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions.
Assist in the design, implementation, and maintenance of backup, ...Show moreLast updated: 10 days ago
AI, we’re building the first AI + Data Fabric for the multifamily industry, transforming how clients manage, secure, and scale their marketing and operational data.
As the industry moves toward a co...Show moreLast updated: 7 days ago
Promoted
New!
Site Engineer
Solarsureanand, India
We are hiring a detail-oriented and technically skilled Site Engineer to monitor and support on-ground civil, electrical and mechanical works as per engineering drawings and quality standards, ensu...Show moreLast updated: 21 hours ago
Promoted
Senior Site Reliability Engineer
IntraEdgeNadiad, IN
Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 28 days ago
Promoted
Site Reliability Engineer
ACL DigitalAnand, Republic Of India, IN
ACL Digital is Hiring for the Below position.ACL Digital, part of the ALTEN Group, is a trusted AI-led, Digital & Systems Engineering Partner driving innovation by designing and building intelligen...Show moreLast updated: 14 days ago
Promoted
Site Reliability Engineer
VXI Global SolutionsAnand, Republic Of India, IN
We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications.The id...Show moreLast updated: 1 day ago