Location : Pune
Experience : 7 - 9 Years
Notice Period : Immediate to 15 Days
Overview :
We are looking for an experienced IT Operations (Monitoring & Observability) Consultant to design, implement, and optimize end-to-end observability solutions. The ideal candidate will have a strong background in monitoring frameworks, ITSM integrations, and AIOps tools to drive system reliability, performance, and proactive incident management.
Key Responsibilities :
- Design and deploy comprehensive monitoring and observability architectures for infrastructure, applications, and networks.
- Implement tools like Prometheus, Grafana, OpsRamp, Dynatrace, New Relic for system performance monitoring.
- Integrate monitoring systems with ITSM platforms (e.g., ServiceNow, BMC Remedy).
- Develop dashboards, alerts, and reports to enable real-time performance insights.
- Architect solutions for hybrid and multi-cloud environments.
- Automate alerting, remediation, and reporting to streamline operations.
- Apply AIOps and ML for anomaly detection and predictive insights.
- Collaborate with DevOps, infra, and app teams to embed monitoring into CI / CD.
- Document architectures, procedures, and operational playbooks.
Required Skills :
Hands-on experience with observability tools : Prometheus, Grafana, ELK Stack, Fluentd, Dynatrace, New Relic, OpsRamp.Strong scripting knowledge in Python, Ansible.Familiar with tracing tools (e.g., Jaeger, Zipkin) and REST API integrations.Working knowledge of AIOps concepts and predictive monitoring.Solid understanding of ITIL processes and service management frameworks.Familiarity with security monitoring and compliance considerations.Excellent analytical, troubleshooting, and documentation skills.ref : hirist.tech)