Description :
We are looking for a proactive DevOps / SRE Engineer to own the stability, scalability, and deployment automation of our critical services.
You will foster a culture of seamless collaboration between development and operations, leveraging best practices in CI / CD, Infrastructure as Code (IaC), and observability.
Key Responsibilities :
- Automation (CI / CD) : Design, implement, and maintain robust, end-to-end CI / CD pipelines for fast and reliable software delivery.
- Infrastructure : Manage and provision cloud infrastructure and resources using Infrastructure as Code (IaC) tools.
- Container Orchestration : Manage and scale application deployments across our Kubernetes clusters, ensuring high availability and fault tolerance.
- Monitoring & Observability : Implement comprehensive monitoring, logging, and alerting systems (Prometheus, Grafana, ELK / Loki) to maintain system health and rapidly diagnose issues.
- Security (DevSecOps) : Implement security best practices into the CI / CD pipeline and cloud environment, focusing on vulnerability scanning and secrets management.
- Troubleshooting : Perform root cause analysis for production incidents and implement preventative measures to improve system reliability.
Essential Technical Skills :
Cloud Platforms : Expert experience with one or more major cloud providers (AWS, Azure, or GCP).Containerization : Mastery of Docker and production-level experience with Kubernetes administration.IaC : Proficiency in Terraform or CloudFormation for infrastructure management.CI / CD : Hands-on experience with CI / CD tools such as Jenkins, GitLab CI, or GitHub Actions.Operating Systems : Strong Linux system administration and Shell Scripting (Bash / Python) skills.Monitoring : Experience with logging, monitoring, and tracing tools (Prometheus, Grafana, Splunk, Jaeger).Configuration Management : Familiarity with tools like Ansible, Chef, or Puppet is a plus(ref : hirist.tech)