Job Role : Site Reliability Engineer.
Location : Pune.
Job Summary :
We are seeking a Senior DevOps & SRE Engineer to join our team and help us build, deploy, and maintain our infrastructure and applications.
The ideal candidate will have experience working in a fast-paced environment and a strong background in DevOps and Site Reliability Engineering (SRE).
You will be responsible for ensuring the reliability, scalability, and security of our applications and :
- Build and maintain our CI / CD pipeline and deployment automation tools.
- Design and implement monitoring and alerting systems to ensure the health of our applications and infrastructure.
- Work closely with development teams to ensure that code is deployed in a reliable and scalable manner.
- Participate in on-call rotations to provide 24 / 7 support for our production systems.
- Develop and maintain disaster recovery plans and processes.
- Continuously improve our infrastructure and processes to ensure scalability, reliability, and security.
- Mentor and provide technical leadership to junior team members.
- Keep up-to-date with industry best practices and emerging technologies in DevOps and SRE.
Requirements :
Bachelors degree in Computer Science, Engineering, or a related field.5+ years of experience in DevOps or SRE.Strong programming skills in at least one of the following languages : Python, Go, Ruby, or Java.Experience with infrastructure as code tools such as Terraform or CloudFormation.Experience with containerization technologies such as Docker and Kubernetes.Strong understanding of networking concepts such as TCP / IP, DNS, and load balancing.Experience with monitoring and logging tools such as Prometheus, Grafana, and ELK stack.Excellent problem-solving skills and the ability to troubleshoot complex issues in a fast-paced environment.Strong communication and collaboration skills with both technical and non-technical stakeholders.Preferred Qualifications :
Experience with cloud providers such as AWS or Azure.Experience with building and maintaining large-scale distributed systems.Experience with database technologies such as MySQL, PostgreSQL, or MongoDB.Experience with automation tools such as Ansible or Chef.Experience with Agile development methodologies such as Scrum or Kanban.(ref : hirist.tech)