Tools & Integration Automation Datadog SME
Location : Pune / Hyderabad / Bangalore
Experience in years : 12-15 years
We are looking for a seasoned Observability & Infrastructure Automation Consultant Datadog SME with strong implementation and maintenance expertise in enterprise-scale environments. The ideal candidate will specialize in designing, deploying, and sustaining observability platforms and automation frameworks to ensure high availability, performance, and operational efficiency.
- Demonstrable experience on managing complex APM project.
- Strong communication skills and ability to interact with our business, product, and development teams, skills and ability to articulate business benefit of a technology solution.
- Ability to think strategically, as well as tactically, and to exercise sound judgment in problem-solving and priority / goal setting.
- Must be proficient in presentation and communication skills.
- Experience of APM, infrastructure metrics, distributed tracing, and log aggregation using Datadog. Configure and manage Datadog for infrastructure, application monitoring, APM, and log management.
- Implements dashboards, alerts, and monitors to provide comprehensive visibility into system performance, availability, and reliability.
- Integrate Datadog with cloud platforms (AWS, Azure, GCP) and on-premises infrastructure.
- Define and deploy monitoring strategies using Datadog’s metrics, traces, logs, and events.
- Set up automated alerts to notify of anomalies and potential issues.
- Work with teams to optimize monitoring strategies and reduce false positives.
- Experience in implementation, solution development and administration of any automation or process orchestration tools like Ayehu / Arago / StackStorm / Ansible.
- Working knowledge of scripting powershell, shell, python etc
- Experience in monitoring as a code
- Ensure seamless integration of observability, automation & ITSM delivery workflows.
- Deep knowledge on Event management, AIOps and observability configurations
- Knowledge of ITIL foundations
- Very good experience in integration of various Observability, ITSM and automation tools.
- Very good experience in development of integration scripts / programs using SOAP / REST Webservices.
- Outstanding problem-solving skills. Fast learner & openness to try different tools, technologies & concepts.
- Self-motivated individual, able to work independently and in coordination with a team.
- Perform regular health checks, upgrades, and troubleshooting for observability and automation tools.
- Drive automation-first strategies to reduce manual interventions and improve reliability.
- Partner with DevOps, SRE, and infrastructure teams to align observability and automation goals.
- Conduct training to various customer stakeholders, prepare & maintain relevant training materials
- Document processes, standards, and best practices for long-term maintainability.
- Familiarity with containerization and orchestration (Docker, Kubernetes) is a plus.
- Define observability standards, guidelines, and best practices for enterprise environments.