Talent.com
This job offer is not available in your country.
Observability Engineer – SRE

Observability Engineer – SRE

GSPANNgurugram, India
20 hours ago
Job description

Description GSPANN is hiring an Observability Engineer with expertise in Site Reliability Engineering (SRE) The role focuses on leveraging SRE principles, automation, and AI-driven observability to enhance reliability and scalability across cloud and ERP environments.

Role and Responsibilities

  • Leverage Application Performance Management (APM) tools such as Dynatrace and Prometheus to monitor, analyze, and enhance system performance.
  • Write and maintain scripts using Python or Java to automate monitoring tasks and streamline alerting mechanisms.
  • Deploy and manage Splunk to handle log analysis, system monitoring, and troubleshooting of production issues.
  • Analyze user behavior and application performance using Real User Monitoring (RUM) tools such as Quantum Metrics to drive user experience improvements.
  • Ensure the reliability and efficiency of Enterprise Resource Planning (ERP) applications through proactive monitoring and support.
  • Incorporate Site Reliability Engineering (SRE) principles to improve system uptime, scalability, and fault tolerance.
  • Respond to incidents swiftly, resolving them to minimize business disruptions and ensure service continuity.
  • Optimize system performance proactively, using data-driven monitoring insights and continuous analysis.
  • Collaborate with development and operations teams to integrate observability tools seamlessly and align monitoring with deployment workflows.

Skills and Experience

  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • Bring 12-15 years of experience in observability engineering or a similar technical role.
  • Hold certifications such as AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or equivalent.
  • Have experience working on cloud platforms like Amazon Web Services (AWS) and Microsoft Azure.
  • Understand and apply performance optimization frameworks and related best practices in production environments.
  • Demonstrate proficiency with APM tools (e.g., Dynatrace, Prometheus), scripting languages (Python, Java), and Splunk.
  • Possess hands-on experience with RUM tools like Quantum Metrics and the monitoring of ERP applications.
  • Show a strong grasp of SRE principles and practices applied in real-world systems.
  • Exhibit excellent problem-solving abilities and communication skills.
  • Adapt easily to fast-paced and dynamic environments.
  • Create a job alert for this search

    Observability Engineer • gurugram, India