MoEngage is looking for a driven NOC Engineer to join our team in Hyderabad, Telangana, India. In this role, you'll be instrumental in maintaining the stability and performance of our large-scale distributed systems, including one of the largest Elasticsearch clusters and extensive MongoDB installations. If you thrive on 24x7 monitoring, troubleshooting, and ensuring high availability, you'll play a critical role in our operations.
What You'll Do
- Work with one of the largest Elasticsearch cluster deployments .
- Work on a large-scale MongoDB installation .
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system reliability .
- Work closely with team members to ensure best practices and strategic goals are incorporated into development work.
- Collaborate with other engineering teams to identify and anticipate changing requirements and opportunities to improve the development environment. Define and iterate team process, collaboration, and focus on overall team velocity with different stakeholders, including product, design, etc.
- Implement best practices, challenge the status quo, and stay updated on industry and technical trends, changes, and developments to ensure the team is always striving for the best-in-class work.
- Manage capacity, build security into every layer, and reduce cost .
- Implement secure networking, key management, user management, access management, process management, and image management .
Must-Have for the Role
Installation, configuration, and networking on Linux .Adding Resources ( Disk Space / CPU / RAM ) on Virtual Machines.24x7 Monitoring services and applications.Monitoring Storage Space, Systems Partition and Server Health and SLAs .Troubleshooting host unreachable / node-down issues .Troubleshooting downtime, SLA breaches .Working on all kinds of User-generated and Auto-generated Tickets .Knowledge of Ticketing tools (Zendesk, Jira) .Raising emergency changes / normal changes for any changes on the server.Runbook management.Skills
Shell Scripting .Linux / Ubuntu / Debian .System Administration .Server management .Plus : AWS, Python, Networking, Security .Debugging skills in Rest APIs .Basic SQL queries .Awareness of Secure Development process and practices .Awareness of Information Security concepts and Best Practices .Skills Required
Shell Scripting, System Administration, Server Management, Debugging, Aws, Python, Networking