Monitoring and Observability Operations Engineer Job Description
Responsibilities
Support and resolve Incidents and requests for monitoring and observability.
Troubleshoot and resolve incidents related to system performance and availability.
Collaborate with the Engineering team to escalate and address complex issues.
Assist in keeping Infrastructure and Application Performance Monitoring agents up to date
Assist in analyzing and addressing vulnerabilities
Collaborate with internal groups to meet their monitoring and observability requirements
Configure monitoring for server and application inventory.
Adhere to corporate change management policies.
Perform routine maintenance tasks.
Identify and assist in automating routine tasks.
Participate in team meetings.
Contribute to the creation and adherence to standards.
Perform any other duties as assigned.
Requirements :
Bachelors degree in computer science, Information Technology, or related field.
knowledge of monitoring and observability solutions with relevant experience in the Monitoring space ( New Relic , SolarWinds, and Zabbix).
Strong experience with Windows and Unix-based systems and command-line interfaces
Understanding of network protocols, databases, and cloud technologies.
Programming / Scripting Experience-Shell, Powershell, .NET, Java, Python, etc.
Familiarity with Git or other version control systems
Understanding of ITSM
Excellent troubleshooting and problem-solving skills.
Ability to work effectively in a team environment and collaborate with cross-functional teams.
Strong written and verbal communication skills.
Relevant certifications (e.g., New Relic Certified Performance Pro, SolarWinds Certified Professional) are a plus.
Linux • pune, India