Key Responsibilities :
- Architect and Implement : Design and deploy scalable, high-availability Kubernetes clusters using OpenShift and Rancher Kubernetes Engine (RKE) .
- Automation Orchestration : Develop and manage infrastructure-as-code (IaC) solutions using Terraform, Helm, or Ansible.
- CI / CD Integration : Implement and optimize CI / CD pipelines using Jenkins, GitLab CI / CD, ArgoCD, or Tekton for automated deployment and testing.
- Security Compliance : Enforce security best practices for Kubernetes clusters, RBAC policies, service mesh configurations, and container image scanning.
- Monitoring Logging : Set up observability solutions using Prometheus, Grafana, ELK / EFK Stack, or OpenTelemetry for proactive monitoring and alerting.
- Multi-Cloud Hybrid Cloud Deployments : Design hybrid cloud and multi-cloud strategies using AWS, Azure, GCP, or on-prem solutions integrated with OpenShift and Rancher.
- SRE Performance Optimization : Implement SRE best practices for high availability, auto-scaling, and performance tuning of microservices architectures.
- Collaboration : Work closely with development, security, and operations teams to streamline DevOps processes and enable faster deployments.
- Disaster Recovery Backup : Implement disaster recovery strategies , backup automation, and cluster failover solutions.
Required Skills Experience :
Kubernetes Containerization : Deep understanding of Kubernetes orchestration, OpenShift, and Rancher Kubernetes Engine (RKE2 / RKE) .
Containerization Service Mesh : Experience with Docker, Istio, Linkerd, or Envoy.
Infrastructure as Code (IaC) : Hands-on expertise with Terraform, Helm, and Ansible.
CI / CD Pipelines : Strong knowledge of Jenkins, GitOps (ArgoCD, FluxCD), and Tekton.
Cloud Platforms : Experience with AWS, Azure, GCP, and on-premises Kubernetes clusters.
Monitoring Logging : Experience with Prometheus, Grafana, ELK / EFK Stack, OpenTelemetry.
Security Compliance : Kubernetes RBAC, Pod Security Policies, image scanning, and network policies.
Scripting Automation : Proficiency in Bash, Python, or Go for automation and scripting.
Networking Load Balancing : Expertise in Kubernetes networking, Ingress controllers (NGINX, Traefik), and service discovery.
Backup DR : Experience with Velero, Longhorn, or Kasten for Kubernetes backup and recovery.
Skills Required
Scripting, Kubernets, Azure, Automation, Python, Devops, Technical Architecture