About The Company :
Ara's Client is a B2B SaaS based company that helps online retailers and marketplaces make more business by using AI-powered advertising and marketing tools. The company was founded in 2016 and is based in San Francisco.
The Role :
We are seeking a highly skilled Staff DevOps Engineer to architect and maintain a highly available, global infrastructure capable of handling high QPS systems with 99.99% uptime. The role requires expertise in managing deployments across multiple regions, ensuring fault-tolerant systems, and driving scalability for mission-critical applications.
Key Responsibilities :
- Architect, manage, and scale Kubernetes clusters for high throughput and low latency across multiple global regions.
- Design and maintain Infrastructure as Code (IaC) to support a fault-tolerant, globally distributed architecture.
- Build and optimize CI / CD pipelines to ensure smooth, zero-downtime deployments.
- Ensure 99.99% availability for high QPS applications by implementing robust monitoring, incident management, and failover strategies.
- Manage multi-region deployments to enable low-latency, geo-redundant infrastructure.
- Collaborate with cross-functional teams to ensure security, scalability, and operational efficiency.
- Lead and mentor a high-performing DevOps team, fostering a culture of excellence and innovation.
Skills Required :
7-10 years of experience managing large-scale, high-availability systems.Experience in B2B SAAS company is a must.Proven expertise in Kubernetes administration, including multi-region deployments and scaling for high QPS.Deep experience with IaC tools like Terraform or CloudFormation.Hands-on with CI / CD pipelines for global, multi-region deployments.Strong understanding of cloud platforms (AWS, GCP, or Azure) and geo-redundant architecture.Proficient in Linux, scripting (Bash, Python), and troubleshooting large-scale distributed systems.Experience leading teams and solving complex, production-grade system challenges.Qualifications & Experience :
6–12 years of experience managing large-scale, high-availability systems.Any Graduate.Education
Masters in Technology (M.Tech / M.E), Bachelor Of Technology (B.Tech / B.E)
Skills Required
Devops, Kubernetes, Cicd, Cloud, Deployment, Scripting