Job Description :
- Design, deploy, and manage Kubernetes clusters (EKS, self-managed, and hybrid).
- Ensure high availability, scalability, and resilience of containerized platforms.
- Implement and manage networking (CNI plugins, ingress controllers, service mesh) for cluster connectivity.
- Manage container security, image scanning, and policy enforcement ( Kyverno).
- Automate cluster provisioning, upgrades, and patching using tools like Terraform, Ansible, or Helm.
- Integrate observability tools (Prometheus, Grafana, Datadog) for monitoring and alerting.
- Troubleshoot node, pod, networking, and storage issues to ensure reliable application performance.
- Optimize resource usage (requests / limits, HPA, VPA) and cluster cost efficiency.
- Support CI / CD integration with Kubernetes (GitOps tools like ArgoCD)
- Collaborate with security, cloud foundation, and developer teams to provide a robust container platform.
Required Skills :
Deep knowledge of Kubernetes internals (control plane, scheduling, networking, storage).Experience with containerization (Docker) and container runtime interfaces .Strong skills in Infrastructure as Code (Terraform, Helm, Ansible).Proficiency with Linux system administration and troubleshooting.Hands-on with CNCF ecosystem tools (ArgoCD, Prometheus, Grafana).Familiarity with cloud-managed Kubernetes services (EKS)Good understanding of RBAC, IAM, TLS, and Kubernetes security hardening practices.Knowledge of logging and monitoring stacks (EFK / ELK, Loki, Dynatrace, Datadog).Strong troubleshooting skills (pods crashloop, node not ready, networking bottlenecks, storage issues).Experience in scaling multi-cluster and multi-tenant Kubernetes environments.Certifications : Certified Kubernetes Administrator (CKA) ( Must Have )
(ref : hirist.tech)