Description
We are seeking an experienced SRE DevOps Engineer to join our team in India. The ideal candidate will have a strong background in system reliability and automation, with a passion for improving system performance and ensuring high availability.
Responsibilities
- Design, implement, and maintain highly available systems and infrastructure.
- Monitor system performance and troubleshoot issues as they arise.
- Automate deployment processes and improve CI / CD pipelines.
- Collaborate with development teams to enhance application performance and reliability.
- Implement security best practices and manage access controls.
- Perform capacity planning and forecasting for system resources.
- Document system configurations and operational procedures.
Skills and Qualifications
7-15 years of experience in Site Reliability Engineering or DevOps roles.Strong knowledge of Linux / Unix administration.Experience with cloud platforms such as AWS, Azure, or Google Cloud.Proficiency in scripting languages like Python, Bash, or Ruby.Familiarity with containerization technologies such as Docker and orchestration tools like Kubernetes.Experience with configuration management tools such as Ansible, Puppet, or Chef.Knowledge of monitoring tools like Prometheus, Grafana, or Nagios.Understanding of networking concepts and protocols.Skills Required
Kubernetes, Terraform, Docker, Prometheus, Grafana, Python, Linux Administration, Monitoring Tools, Networking, Gcp