The Cloud Reliability, Principal provides expertise in the developing, implementation and monitoring of cloud applications. Will work with other areas of the development and support cycles to provide recommendations and will mentor team members.
Duties Responsibilities
- Application log analysis and monitor live production, act on app log action items and live monitoring alerts.
- Provides support as SME to groups outside the IT organization.
- Collaborates with development, operations, and infrastructure teams to develop sound automation solutions.
- Orchestrates the automation of web application deployments.
- Looks for opportunities to optimize and enable consistent automated deployments.
- Manages Change Control process to ensure control, coordination and minimal customer impact.
- Provides and implements recommendations on automation solutions.
- Monitors application deployment.
- Manages Production Infrastructure - Disaster Recovery, Backup Management, etc.
- Recommends and implements best practices.
- Keeps up with external trends to incorporate into products and processes.
- Acts an evangelist for Epicor at inside and outside events.
Knowledge, Skills Abilities
Advanced problem solving and analytical skills.Mentorship skills.Ability to conduct solution design sessions.Advanced troubleshooting skills.Creative problem-solving skills.In depth knowledge of and experience with development tools, application frameworks, and testing tools.In depth knowledge of CI / CD tools (Jenkins) required.Expert knowledge of SQL.Advanced knowledge of AWS required.Advanced knowledge of Linux required.DevOps experience is a plus.Qualifications
9+ years applicable experience and demonstrated success / knowledge.3+ years of specialized / industry experience.Bachelor s degree (or equivalent experience).AWS Certified SysOps Administrator - Associate certification required.Skills Required
Devops, Sql, Linux, Aws