Perform trouble shooting, analysis, research and resolution using advanced query and programming skills, Conducts root cause analysis.Look for areas of improvement in monitoring, application stability.Should be open to learn FCIinhouse Product and other technologies and excel them.Communicate with line of business and management the overall status and health of the application.Be the first point of contact for production issues, support requests and alerts.Ensure issues and outages are properly documented.Proactively monitor application and infrastructure alerts and be able to react quickly.Documents major maintenance events and other significant product related issuesKey part of constant improvement process. Recommending and implementing solutions to mitigate repeat issues and early detection.Directs and coordinates operation, maintenance, and repair of equipment and systems in field installations and internal teams.Communicate / escalate issues to appropriate functional areas with supporting evidence from application logs, pcap, service trend data etc.Writes and submits Engineering Change Requests (ECRs) to engineering to correct product performance deficiencies or reliability problems.Implement and manage service monitoring tools including agent-based application monitoring, log analysis / trending and health metrics.Documentation of application flows, monitoring techniques and resolution playbooks.Skills Required
Log Analysis, Root Cause Analysis, Documentation, Incident Management, Troubleshooting, Monitoring