Job Role
Very good knowledge of Log analysing & monitoring tools like Prometheus, Loki, Dynatrace, Grafana & SolarWinds. Understanding of Infrastructure environments Cloud, VMware, storage, networks, databases, etc.
What you'll be responsible for?
- Design and development of security policies, standards, and procedures in accordance with organization goals.
- Responsible for proactive monitoring of alerts (Network, Infra, Applications) and taking corrective actions.
- Responsible for Incident Management life cycle & Service requests fulfilment
- Responsible for Incident logging, accurately tracks and documents all incidents.
- Adherence to the process compliance
- Adherence to the SLAs defined for the platform, Service uptime.
- Coordination with cross-group peers both proactively and reactively produces quality documentation and share with the appropriate team members.
- Responsible to develop SOP documents.
- Ability to deep dive into identifying the root cause of various service-impacting events and optimizing.
- Act as a First Point of Contact for incidents, escalations, and business-impacting technology issues
- To ensure the maximum possible service availability and performance of the platforms
- Responsible for continuous improvement of the process science.
Qualification and other skills
Experience of 5- 8 years in NOCExperience in Alert / Incident Management and a good understanding of SLAsTroubleshooting, Problem-solving & Strong presentation skillsAnalytical and communication skillsWhat you'd have?
Strong knowledge of Linux, Network & database queryingKnowledge of asset managementVery good knowledge of Log analysing & monitoring tools like Prometheus, Loki, Dynatrace, Grafana & SolarWindsUnderstanding of Infrastructure environments Cloud, VMware, storage, networks, databases, etc.Strong Linux, Networking, Log analysing, and database querying skills.Must have experience with monitoring tools like Prometheus, Loki, Grafana, and Dynatrace & building monitoring dashboards.Experience in alerts mitigation & optimization - Knowledge of the ITIL frameworkHands-on exp with observability tools will be an added advantage.Must have expertise in maintaining / updating asset management.Certifications : ITIL foundation, AZ-900, Shell Scripting, Python, Hardware & networking.Why join us?
Impactful Work : Play a pivotal role in safeguarding Tanla's assets, data, and reputation in the industry.Tremendous Growth Opportunities : Be part of a rapidly growing company in the telecom and CPaaS space, with opportunities for professional development.Innovative Environment : Work alongside a world-class team in a challenging and fun environment, where innovation is celebrated. Tanla is an equal opportunity employer.We champion diversity and are committed to creating an inclusive environment for all employees
https : / / www.tanla.com