Talent.com
This job offer is not available in your country.
Observability Engineer – SRE

Observability Engineer – SRE

GSPANNGurugram, Haryana, India
5 hours ago
Job description

Description GSPANN is hiring an Observability Engineer with expertise in Site Reliability Engineering (SRE) The role focuses on leveraging SRE principles, automation, and AI-driven observability to enhance reliability and scalability across cloud and ERP environments.

Role and Responsibilities

  • Leverage Application Performance Management (APM) tools such as Dynatrace and Prometheus to monitor, analyze, and enhance system performance.
  • Write and maintain scripts using Python or Java to automate monitoring tasks and streamline alerting mechanisms.
  • Deploy and manage Splunk to handle log analysis, system monitoring, and troubleshooting of production issues.
  • Analyze user behavior and application performance using Real User Monitoring (RUM) tools such as Quantum Metrics to drive user experience improvements.
  • Ensure the reliability and efficiency of Enterprise Resource Planning (ERP) applications through proactive monitoring and support.
  • Incorporate Site Reliability Engineering (SRE) principles to improve system uptime, scalability, and fault tolerance.
  • Respond to incidents swiftly, resolving them to minimize business disruptions and ensure service continuity.
  • Optimize system performance proactively, using data-driven monitoring insights and continuous analysis.
  • Collaborate with development and operations teams to integrate observability tools seamlessly and align monitoring with deployment workflows.

Skills and Experience

  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • Bring 12-15 years of experience in observability engineering or a similar technical role.
  • Hold certifications such as AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or equivalent.
  • Have experience working on cloud platforms like Amazon Web Services (AWS) and Microsoft Azure.
  • Understand and apply performance optimization frameworks and related best practices in production environments.
  • Demonstrate proficiency with APM tools (e.g., Dynatrace, Prometheus), scripting languages (Python, Java), and Splunk.
  • Possess hands-on experience with RUM tools like Quantum Metrics and the monitoring of ERP applications.
  • Show a strong grasp of SRE principles and practices applied in real-world systems.
  • Exhibit excellent problem-solving abilities and communication skills.
  • Adapt easily to fast-paced and dynamic environments.
  • Create a job alert for this search

    Observability Engineer • Gurugram, Haryana, India