Job Title : Monitoring & Observability Engineer Datadog Specialist
Experience : 4+ Years
Location : [Specify Location or Remote]
Job Type : Full-Time
Job Summary :
We are looking for a talented Observability Engineer with hands-on experience in Datadog to enhance our infrastructure and application monitoring capabilities. The ideal candidate will have a strong understanding of performance monitoring, alerting, and observability in cloud-native Responsibilities :
- Design, implement, and maintain observability solutions using Datadog for applications, infrastructure, and cloud services.
- Set up dashboards, monitors, and alerts to proactively detect and resolve system issues.
- Collaborate with DevOps, SRE, and application teams to define SLOs, SLIs, and KPIs for performance monitoring.
- Integrate Datadog with services such as AWS, Kubernetes, CI / CD pipelines, and logging tools.
- Conduct performance tuning and root cause analysis of production incidents.
- Automate observability processes using infrastructure-as-code and scripting (e.g., Terraform, Python).
- Stay up-to-date with the latest features and best practices in Datadog and observability Skills :
- 4+ years of experience in monitoring / observability, with 2+ years hands-on experience in Datadog
- Strong experience with Datadog APM, infrastructure monitoring, custom metrics, and dashboards
- Familiarity with cloud platforms like AWS, GCP, or Azure
- Experience monitoring Kubernetes, containers, and microservices
- Good knowledge of log management, tracing, and alert tuning
- Proficient with scripting (Python, Shell) and IaC tools (Terraform preferred)
- Solid understanding of DevOps / SRE practices and incident Skills :
- Datadog certifications (e.g., Datadog Certified Observability Engineer)
- Experience integrating Datadog with CI / CD tools, ticketing systems, and chatops
- Familiarity with other monitoring tools (e.g., Prometheus, Grafana, New Relic, Splunk)
- Knowledge of performance testing tools (e.g., JMeter, k6)
(ref : hirist.tech)