Job Overview :
We are seeking an experienced Lead DevOps Engineer with deep expertise in Kubernetes infrastructure design and implementation. This role requires an individual who can architect, build, and manage enterprise-grade Kubernetes clusters from the ground up. The position offers an exciting opportunity to lead infrastructure modernization initiatives and work with cutting-edge cloud-native technologies.
Initial Setup Phase : First 2-3 months will be based in Mumbai for project initiation and stakeholder alignment, followed by relocation to our Bangalore CoffeeBeans office for ongoing operations.
Key Responsibilities :
Infrastructure Design & Implementation :
- Design and architect enterprise-grade Kubernetes clusters across multi-cloud environments (AWS / Azure / GCP)
- Build production-ready Kubernetes infrastructure with high availability, scalability, and security best practices
- Implement Infrastructure as Code using Terraform, Helm charts, and GitOps methodologies
- Set up monitoring, logging, and observability solutions for Kubernetes workloads
- Design disaster recovery and backup strategies for containerized applications
Leadership & Team Management :
Lead a team of 3 - 4 DevOps engineers and provide technical mentorshipDrive best practices for containerization, orchestration, and cloud-native developmentCollaborate with development teams to optimize application deployment strategiesConduct technical reviews and ensure code quality standards across infrastructure componentsFacilitate knowledge transfer and create comprehensive documentationOperational Excellence :
Manage CI / CD pipelines integrated with Kubernetes deploymentsImplement security policies including RBAC, network policies, and container security scanningOptimize cluster performance and resource utilizationAutomate routine operations and reduce manual interventionEnsure 99.9% uptime for production Kubernetes workloadsStrategic Planning :
Define infrastructure roadmap aligned with business objectivesEvaluate and recommend new tools and technologies for container orchestrationCapacity planning and cost optimization for cloud infrastructureRisk assessment and mitigation strategies for production environmentsMust-Have Technical Skills :
Core Kubernetes Expertise6+ years of hands-on experience with Kubernetes in production environmentsDeep understanding of Kubernetes architecture, components (etcd, API server, scheduler, kubelet)Expertise in Kubernetes networking (CNI, Ingress controllers, Service mesh)Advanced knowledge of Kubernetes storage (CSI, Persistent Volumes, StorageClasses)Experience with Kubernetes operators and custom resource definitions (CRDs)Infrastructure as Code :
Terraform - Advanced proficiency for infrastructure provisioningHelm - Creating and managing complex Helm chartsAnsible / Chef / Puppet - Configuration management experienceGitOps workflows - ArgoCD, Flux, or similar toolsCloud Platforms :
Multi-cloud experience with at least 2 major cloud providers :
AWS : EKS, EC2, VPC, IAM, CloudFormationAzure : AKS, Virtual Networks, Azure Resource ManagerGCP : GKE, Compute Engine, VPC, Deployment ManagerCI / CD & DevOps Tools :
Jenkins, GitLab CI, Azure DevOps, or GitHub ActionsDocker - Advanced containerization and optimization techniquesContainer registries - Docker Hub, ECR, ACR, GCR managementVersion control - Git workflows and branching strategiesMonitoring & Observability :
Prometheus & Grafana - Metrics collection and visualizationELK Stack / EFK - Centralized logging solutionsJaeger / Zipkin - Distributed tracing implementationAlertManager - Intelligent alerting and incident managementGood-to-Have Skills :
Advanced TechnologiesService Mesh experience (Istio, Linkerd, Consul Connect)Serverless platforms (Knative, OpenFaaS, AWS Lambda)Database operations in Kubernetes (PostgreSQL, MongoDB operators)Machine Learning pipelines on Kubernetes (Kubeflow, MLflow)Security & Compliance :
Container security tools (Twistlock, Aqua Security, Falco)Policy management (Open Policy Agent, Gatekeeper)Compliance frameworks (SOC 2, PCI-DSS, GDPR)Certificate management (cert-manager, Let's Encrypt)-Programming & ScriptingPython / Go - For automation and tooling developmentShell scripting (Bash / PowerShell) - Advanced automationYAML / JSON - Configuration management expertiseRequired Qualifications :
Education :
Bachelor's degree in Computer Science, Engineering, or related technical fieldRelevant certifications preferred :
Certified Kubernetes Administrator (CKA)Certified Kubernetes Application Developer (CKAD)Cloud provider certifications (AWS / Azure / GCP)Experience :
6 - 7 years of DevOps / Infrastructure engineering experience4+ years of hands-on Kubernetes experience in production2+ years in a lead / senior role managing infrastructure teamsExperience with large-scale distributed systems and microservices architectureref : hirist.tech)