Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions
Operate, monitor, and triage all aspects of our production and non-production environments
Collaborate with other engineers on code, infrastructure, design reviews, and process enhancements.
Evaluate and integrate new technologies to improve system reliability, security, and performance
Develop and implement automation to provision, configure, deploy, and monitor Apple services
Participate in an on-call rotation providing hands-on technical expertise during service-impacting events
Design, build, and maintain highly available and scalable infrastructure
Implement and improve monitoring, alerting, and incident response systems
Automate operations tasks and develop efficient workflows
Conduct system performance analysis and optimization
Collaborate with development teams to ensure smooth deployment and release processes
Implement and maintain security best practices and compliance standards
Troubleshoot and resolve system and application issues
Participate in capacity planning and scaling efforts
Stay up-to-date with the latest trends, technologies, and advancements in SRE practices
Contribute to capacity planning, scale testing, and disaster recovery exercises.
Approach operational problems with a software engineering mindset
BS degree in computer science or equivalent field with 5+ years of experience
5+ years in an Infrastructure Ops, Site Reliability Engineering, or DevOps-focused role.
Knowledge of Linux operating system principles, networking fundamentals, and systems management.
Demonstrable fluency in at least one of the following languages : Java, Python, or Go
Experience managing and scaling distributed systems in a public, private, or hybrid cloud environment
Develop and implement automation tools and apply best practices for system reliability.
You will be responsible for the availability & scalability of our services and manage the disaster recovery and other operational tasks.
Collaborate with the development team to improve application codebase for logging, metrics and traces for observability.
Collaborate with data science teams and other business units to design, build and maintain the infrastructure that runs machine learning and generative AI workloads.
Influence architectural decisions with focus on security, scalability and performance.
Find and fix problems in production, and work to avoid them from happening again
Preferred Qualifications :
Familiarity with micro-services architecture and container orchestration with Kubernetes.
Awareness of key security principles including encryption, keys (types and exchange protocols).
Understanding SRE principles includes monitoring, alerting, error budgets, fault analysis, and automation.
Strong sense of ownership, with a desire to communicate and collaborate with other engineers and teams.
Ability to identify and communicate technical and architectural problems, while working with partners and their team to iteratively find solutions.
(ref : hirist.tech)
Create a job alert for this search
Site Reliability Engineer • Baroda
Related jobs
Promoted
Site Reliability Engineer
ConcordNadiad, IN
Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 18 days ago
Promoted
Senior Site Reliability Engineer- ELK Expert
iVedha Inc.Anand, IN
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone.
Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
Promoted
New!
Site Reliability Engineer
BayOne Solutionsvadodara, gujarat, in
Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 9 hours ago
Promoted
Site Reliability Engineer
Amicon Hub Servicesanand, gujarat, in
Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation.
Collaborate with development teams to en...Show moreLast updated: 6 days ago
Promoted
Senior Site Reliability Engineer
WSO2anand, gujarat, in
Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 8 days ago
Promoted
L3 O365 Engineer
Nextbridge IT SolutionsNadiad, IN
We are seeking a highly skilled .This senior role is a critical escalation point for complex issues, driving the resolution of major incidents and ensuring the seamless operation, security, and pro...Show moreLast updated: 7 days ago
Promoted
Integrity Engineer - Fixed Equipment
Quest GlobalVadodara, Gujarat, India
Title : Integrity Lead Engineer.This position is a mechanical integrity technical position for fixed equipment in support of petrochemical facility.
This position provides technical support to the cl...Show moreLast updated: 30+ days ago
Promoted
New!
Site Reliability Engineer
ExasoftAnand, IN
Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites.
Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 12 hours ago
Promoted
Site Reliability Engineer
Core Minds Tech SOlutionsVadodara
Job Description : - Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions&l...Show moreLast updated: 30+ days ago
Reliability Engineer
Saaki Argus & Averil ConsultingVadodara, Gujarat, India
Quick Apply
One of the leading Engineering and R&D Software Services Companies.Experience of maintaining the Instruments, Valves, transmitters, Sensors, Control systems (DCS / PLC, SCADA), Analyzers and F&am...Show moreLast updated: 30+ days ago
Promoted
Deployment Engineer
AvocaNadiad, IN
Build, launch & optimize AI agents that power the next generation of home-service customer experiences.Avoca is the all-in-one AI lead-conversion platform.
Our technology boosts booking rates, slash...Show moreLast updated: 30+ days ago
Promoted
Implementation Engineer
Storefox.aiAnand, IN
By leveraging AI-driven solutions, we capture and analyze in-store interactions, providing actionable insights that empower retailers to enhance customer experiences and maximize profitability.You ...Show moreLast updated: 4 days ago
Promoted
Site Reliability Engineer - Chaos Management
Xebiaanand, gujarat, in
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 8 days ago
Promoted
Senior MLOps Engineer
Mitchell Martin Inc.Nadiad, IN
Include, but are not limited to, the following : .Own productionizing models—from tracked experiments to governed releases—ensuring resilient services with clear SLOs, runbooks, and fast, safe rollba...Show moreLast updated: 20 days ago
Promoted
Lead Sustenance Engineer - Storage
DDNAnand, IN
This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades.
DataDirect Networks (DDN) is a globa...Show moreLast updated: 7 days ago
Promoted
New!
D&E Engineer
Eki.StructAnand, IN
The Company’s Equal Opportunities policy applies equally to the recruitment process and must be complied with at every stage of the recruitment process.
This means that prospective applicants should...Show moreLast updated: 12 hours ago
Promoted
Site Reliability Engineer
XebiaAnand, IN
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 26 days ago
Promoted
Site Reliability Engineer
UplersAnand, IN
Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required.
OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 24 days ago
Promoted
DevOps / Platform Engineer
iVedha Inc.Vadodara, IN
Hiring a seasoned DevOps / Platform Engineer to drive automation, platform reliability, and robust.Design, deploy, and manage CI / CD pipelines and infrastructure automation, leveraging AI for.Implemen...Show moreLast updated: 30+ days ago
Promoted
Resident Engineer – Kubernetes & Portworx
CMK Resources, Inc.Anand, IN
CMK Resources Resident Engineer – Kubernetes & Portworx (3 openings).Help Shape the Future of Kubernetes Storage.Our client's largest and most strategic customer is moving VMware-based workloads to...Show moreLast updated: 7 days ago