Description :
The core responsibilities for the job include the following :
Kubernetes and Containerization :
- Architect, deploy, and manage Kubernetes (K8s) clusters on both bare-metal and cloud environments.
- Utilize Helm and similar tools for application packaging and deployment within K8s.
- Develop and maintain K8s deployment templates for consistent and repeatable service rollouts.
- Utilize installation tools and distributions like kubeadm, kubespray, k3s, Rancher, OpenShift, etc., for bare-metal K8s and OS :
- Design and deploy new infrastructure solutions for bare-metal environments based on evolving requirements.
- Perform Linux sysadmin tasks and troubleshooting on physical servers.
- Utilize scripts and IaC for automated provisioning and management.
- Have a good understanding of database servers.
Networking and Security :
Design, implement, and troubleshoot networking solutions, covering reachability, routing, DNS, load balancing, firewall management, etc.Deploy and maintain environments following security best practices, including system hardening, implementing least privileged access, and ensuring adherence to compliance standards.CI / CD and Automation :
Design, build, and maintain CI / CD pipelines to automate software build, testing, and deployment processes.Develop automation scripts (Python, Bash, Ansible, etc.) to streamline software packaging, deployment, and operational tasks.Reliability and Operations :
Implement and manage monitoring and alerting systems for proactive issue detection and resolution.Develop and execute backup and disaster recovery (DR) strategies to ensure data integrity and service availability.Conduct performance testing and capacity planning to optimize and scale infrastructure effectively.Collaborate with development teams to ensure new services are designed for reliability and seamless deployment.Requirements :
Bachelor's degree in Computer Science, Engineering, or a related field; or equivalent work experience.4- 6 years of experience in DevOps / Site Reliability Engineering / Systems Administration, or a similar role with a strong focus on bare-metal environments.Expert-level knowledge of Linux system administration, scripting, troubleshooting, and performance tuning.Deep understanding of networking concepts, protocols, and firewalls.Strong experience in designing and implementing disaster recovery and backup strategies.Proficiency in managing and optimizing database technologies.Extensive experience with Kubernetes (both cloud and bare-metal deployments), including deployment, scaling, and management.Proficiency in scripting and automation (e.g., Python, Bash, Ansible, or similar).Experience with CI / CD pipelines and tools.Strong problem-solving skills and the ability to thrive in a fast-paced environment.Excellent communication and collaboration abilities.(ref : hirist.tech)