MHP India is seeking a Senior Consultant, DevOps & SRE to join our team. In this role, you will be responsible for designing and building mission-critical infrastructure on public and private cloud platforms, as well as developing the tools and processes necessary to ensure the highest levels of availability and reliability for our clients. You'll work with systems that handle thousands of daily transactions, managing large-scale environments with thousands of virtual machines, hundreds of thousands of requests per second, and petabytes of data.
You will be expected to define and implement the technical vision for our cloud-based platforms, working directly with our Software Engineering teams to ensure our systems are always up. A deep technical understanding of Linux, networking, and distributed architectures is required, with a strong focus on debugging technologies like Tomcat, HAProxy, Nginx, Apache, and Java. You must also possess strong programming skills in Python and Go.
Key Responsibilities
- Design and build highly available, scalable, and reliable cloud-based platforms for our clients.
- Implement and manage infrastructure as code (IaC) to ensure consistency and repeatability.
- Develop and maintain monitoring, alerting, and logging systems to proactively identify and resolve issues.
- Collaborate directly with software engineering teams to embed reliability and operational excellence into the development lifecycle.
- Perform deep-dive debugging and root cause analysis for complex production issues across the stack.
- Mentor and guide junior team members on best practices for DevOps and SRE.
- Contribute to the technical vision and roadmap for our SRE practice.
- Automate repetitive tasks to improve efficiency and reduce human error.
Required Skills & Qualifications
3-8+ years of experience as a DevOps Engineer, Site Reliability Engineer, or similar role.Expertise in Linux systems administration, networking, distributed monitoring systems like Prometheus GrafanaProficiency in a public cloud platform (e.g., AWS, Azure, GCP) including IaC tools like Terraform, Ansible, etc. .Strong hands-on experience with debugging tools and technologies for applications like Tomcat, HAProxy, Nginx, Apache, and Java.Solid programming skills in Python and / or Go.Proficiency with containerization and orchestration tools (e.g., Docker, Kubernetes).Experience with CI / CD pipelines and related tools (e.g., Jenkins, GitLab CI, ArgoCD),.Excellent problem-solving skills and the ability to operate under pressure.Strong communication and collaboration abilities.