Key Responsibilities
System Monitoring & Health Management
- Proactively monitor system health and implement corrective actions before issues arise.
- Conduct regular system monitoring and health checks.
- Ensure all critical system backups are completed successfully.
- Take proactive measures for system performance and uptime optimization.
- Deliver and maintain uptime availability targets.
Incident, Problem & Change Management
Manage incidents and problems in alignment with established SLAs.Perform root cause analysis on unplanned outages and implement preventive measures.Responsible for upgrade tasks including service levels, kernel, and firmware.Coordinate and manage outsourced IT partners for service delivery.System Optimization & Performance
Optimize system and application performance through continuous monitoring.Establish, monitor, and report landscape performance KPIs.Initiate and manage improvement projects where KPIs are not met.Collaborate with cross-functional teams to ensure consistent performance across the landscape.Conduct technical analysis and support infrastructure implementations and upgrades.Governance & Compliance
Maintain documentation and technical SOPs; institutionalize best practices.Ensure adherence to landscape methodology, design, security, and general IT controls.Review compliance status and implement corrective actions for non-compliance.Ensure operations are aligned with the company's code of conduct and social responsibility guidelines.Detect and report ethical breaches, corruption, or code of conduct violations immediately.Housekeeping & Maintenance
Schedule and monitor housekeeping activities (e.g., archiving as per retention policies).Ensure regular execution and updates of housekeeping jobs.Ensure effective backup, restoration, and system refresh processes.Resilience & Availability
Ensure proactive application maintenance to prevent disruptions.Design high-availability plans covering backups, archiving, housekeeping, and health checks.Minimize business disruption through fast resolution and root cause analysis.Technical Skills & Tools
Proficiency in Microsoft Windows, Office 365, Linux, SCCM, VMware, Active Directory, Antivirus, Endpoint Security.Cloud computing expertise : AWS / Azure.Remote Desktop Support, Password Management.Network fundamentals and troubleshooting skills.Work Environment
Experience in a multi-site, manufacturing organization.Prior experience in at least one Microsoft 365 implementation.Familiarity with IT service frameworks such as ISO / IEC 20000, ITIL V5, COBIT5.Skills Required
VMware, Cloud Computing, Linux, Active Directory, Firmware, Information Technology, Antivirus