About Company
Founded in the year 2017, CoffeeBeans specializes in offering high end consulting services in technology, product, and processes. We help our clients attain significant improvement in quality of delivery through impactful product launches, process simplification, and help build competencies that drive business outcomes across industries. The company uses new-age technologies to help its clients build superior products and realize better customer value. We also offer data-driven solutions and AI-based products for businesses operating in a wide range of product categories and service domains.
We are seeking an experienced Lead DevOps Engineer with deep expertise in Kubernetes infrastructure design and implementation. This role requires an individual who can architect, build, and manage enterprise-grade Kubernetes clusters from the ground up. The position offers an exciting opportunity to lead infrastructure modernization initiatives and work with cutting-edge cloud-native technologies.
Initial Setup Phase : First 2-3 months will be based in Mumbai for project initiation and stakeholder alignment, followed by relocation to our Bangalore CoffeeBeans office for ongoing operations.
Key Responsibilities Infrastructure Design & Implementation
Design and architect enterprise-grade Kubernetes clusters across multi-cloud environments (AWS / Azure / GCP) Build production-ready Kubernetes infrastructure with high availability, scalability, and security best practices Implement Infrastructure as Code using Terraform, Helm charts, and GitOps methodologies Set up monitoring, logging, and observability solutions for Kubernetes workloads Design disaster recovery and backup strategies for containerized applications
Leadership & Team Management
Lead a team of 3-4 DevOps engineers and provide technical mentorship Drive best practices for containerization, orchestration, and cloud-native development Collaborate with development teams to optimize application deployment strategies Conduct technical reviews and ensure code quality standards across infrastructure components Facilitate knowledge transfer and create comprehensive documentation
Operational Excellence
Manage CI / CD pipelines integrated with Kubernetes deployments Implement security policies including RBAC, network policies, and container security scanning Optimize cluster performance and resource utilization Automate routine operations and reduce manual intervention Ensure 99.9% uptime for production Kubernetes workloads
Strategic Planning
Define infrastructure roadmap aligned with business objectives Evaluate and recommend new tools and technologies for container orchestration Capacity planning and cost optimization for cloud infrastructure Risk assessment and mitigation strategies for production environments
Must-Have Technical Skills Core Kubernetes Expertise
6+ years of hands-on experience with Kubernetes in production environments Deep understanding of Kubernetes architecture, components (etcd, API server, scheduler, kubelet) Expertise in Kubernetes networking (CNI, Ingress controllers, Service mesh) Advanced knowledge of Kubernetes storage (CSI, Persistent Volumes, StorageClasses) Experience with Kubernetes operators and custom resource definitions (CRDs)
Infrastructure as Code
Terraform - Advanced proficiency for infrastructure provisioning Helm - Creating and managing complex Helm charts Ansible / Chef / Puppet - Configuration management experience GitOps workflows - ArgoCD, Flux, or similar tools
Cloud Platforms
Multi-cloud experience with at least 2 major cloud providers :
AWS : EKS, EC2, VPC, IAM, CloudFormation Azure : AKS, Virtual Networks, Azure Resource Manager GCP : GKE, Compute Engine, VPC, Deployment Manager
CI / CD & DevOps Tools
Jenkins, GitLab CI, Azure DevOps, or GitHub Actions Docker - Advanced containerization and optimization techniques Container registries - Docker Hub, ECR, ACR, GCR management Version control - Git workflows and branching strategies
Monitoring & Observability
Prometheus & Grafana - Metrics collection and visualization ELK Stack / EFK - Centralized logging solutions Jaeger / Zipkin - Distributed tracing implementation AlertManager - Intelligent alerting and incident management
Good-to-Have Skills Advanced Technologies
Service Mesh experience (Istio, Linkerd, Consul Connect) Serverless platforms (Knative, OpenFaaS, AWS Lambda) Database operations in Kubernetes (PostgreSQL, MongoDB operators) Machine Learning pipelines on Kubernetes (Kubeflow, MLflow)
Security & Compliance
Container security tools (Twistlock, Aqua Security, Falco) Policy management (Open Policy Agent, Gatekeeper) Compliance frameworks (SOC 2, PCI-DSS, GDPR) Certificate management (cert-manager, Let's Encrypt)
Programming & Scripting
Python / Go - For automation and tooling development Shell scripting (Bash / PowerShell) - Advanced automation YAML / JSON - Configuration management expertise
Required Qualifications Education
Bachelor's degree in Computer Science, Engineering, or related technical field Relevant certifications preferred :
Certified Kubernetes Administrator (CKA) Certified Kubernetes Application Developer (CKAD) Cloud provider certifications (AWS / Azure / GCP)
Experience
6-7 years of DevOps / Infrastructure engineering experience 4+ years of hands-on Kubernetes experience in production 2+ years in a lead / senior role managing infrastructure teams Experience with large-scale distributed systems and microservices architecture
Dev Ops Engineer • India