Designation : DevOps Engineer / Lead DevOps Engineer
Experience : 8-15 years
Location : Remote
Summary
Work with talented DevOps and Cloud operations engineers and architects to deliver Sycamore SaaS product offerings to our Bio-Pharma customers using exciting, cutting-edge technologies. Develop, execute, maintain, and improve procedures, automation scripts, and infrastructure implementations to support Sycamore SaaS Operations.
Roles and Responsibility
Specific roles and responsibilities include :
- Provide technical expertise and leadership when needed to SaaS Operations Production Operations teams.
- Help Implement the Cloud Operations team's goals and deliverables as determined by Sycamore Leadership
- Ensure smooth operations of Sycamore SaaS products
- Take Complete ownership of Customer Implementations, including SLA and SLO.
- Automate, enhance and maintain critical processes in Cloud Operations, such as Change Control, Monitoring & Alerting
- Drive critical processes in SaaS Operations such as Change Control, Problem & Incident Management, and Reporting, as well as key tools for Monitoring & Alerting
- Drive Disaster Recovery and failover procedures, training, testing, and team readiness
- Coordinate focus groups across all teams on process improvements and technical improvements that lead to better stability and reliability
- Contribute to process improvements and technical improvements that lead to increased stability and reliability
- Support continuous improvements in SaaS Operations by
- developing platform services and tooling for modern cloud operations, including metrics monitoring, CI / CD pipelines, etc.
- improving automation of provisioning, deployment, monitoring, alerting, and escalation
- Support Secure operations by
- implementing best-in-class recommendations for secure operations
- Carry out ongoing Production Ops activities with precision and quality
- Define, build, and deliver a high-quality SaaS Platform for Work with third-party vendors and partners to help develop a complete solution set on the SaaS platform
- Representing Cloud Operations in InfoSec meetings and developing and driving secure procedures
- Help obtain and maintain various certifications
- Being a good team player & a leader when needed for a high-performance Cloud / SaaS delivery team by
- Reviewing personal / team performance, quality reviews,
- Manage operations and operational issues.
- Establish a culture of high performance, ownership, delivery focus, and continuous improvement.
Excellence in Operations
Implement and carry out procedures and policies to ensure high-quality SaaS operations with appropriate levels of management controls.Act as an internal contact for platform services issues for a customerWork with cross-functional departments : Sales, Professional Service, Customer Support, Engineering, and QADesired Experience
Has experience in implementing, managing, maintaining, and decommissioning complex cloud-based Information system components in a secure and controlled manner.Must be experienced in coordinating cross-functional teams such as support, escalation, and engineering software teams to address product issues successfully.Strong understanding of how to build, scale, and manage complex multi-product / service environmentsRecord of building lean, automated, scalable support structures versus labor-intensive environments.Strong innovation mindset, analytical skills, excellent oral and written communication skills, and experience effectively communicating project / program mission and objectives.Must exhibit a practical customer service attitude and lead a team in resolving difficult customer situations.Skills Required
Very Strong Linux Knowledge & Troubleshooting SkillsScripting using – Bash, Python, PowerShell, etcKubernetes, helm ChartsTerraform, AnsibleWindows Terminal Services, AD, LDAPHands-on experience in cloud technology – AWS, Azure – AWS preferredChange, Problem & Incident ManagementImplementation awareness of Vulnerability / Penetration Testing, SecurityStrong Networking SkillsTools and frameworks used for monitoring, performance management, loggingCI / CD pipelineSRE – Including Datadog.DatadogCertification
RHELAWSKubernetes