Job Title : Senior OSP Managed Services EngineerExperience : 5-6 Years of Hands-on ExperienceLocation : Delhi NCRJob Type : Full-TimeCompany Overview
Sutantra Systems Pvt Ltd is a leading provider of innovative cloud solutions and managed services. We help businesses transform their operations by leveraging cutting-edge open-source technologies. Our team is composed of passionate experts dedicated to delivering reliability, performance, and excellence in every client engagement. We are looking for a seasoned OpenStack Platform professional to join our dynamic team and drive the success of our mission-critical cloud environments.
Job Summary
We are seeking a highly skilled and certified Senior OpenStack Platform (OSP) Engineer with extensive experience in managed services. The ideal candidate will possess deep, hands-on technical expertise in designing, implementing, automating, and, most importantly, managing and supporting production-grade OSP environments. You will be responsible for ensuring the highest levels of availability, performance, and security for our clients' OpenStack clouds, acting as a subject matter expert and escalation point for complex issues.
Key Responsibilities
1. Platform Management & Operations :
- Provide expert-level administration, operational support, and lifecycle management for large-scale, production OpenStack Platform (OSP) environments.
- Perform day-to-day operations including node health checks, service monitoring, capacity planning, and performance tuning.
- Manage and troubleshoot the core OpenStack services (Nova, Neutron, Cinder, Swift, Keystone, Glance, Horizon, Heat, Ceilometer).
- Implement and maintain high-availability and disaster recovery configurations for the control plane and workloads.
2. Managed Services & Client Support :
- Serve as a primary technical point of contact for managed services clients, ensuring their SLAs are met and exceeded.
- Proactively monitor client environments using tools like Prometheus, Grafana, ELK Stack, or RH CloudForms.
- Handle incident management, including troubleshooting complex issues, performing root cause analysis (RCA), and implementing preventive measures.
- Execute planned maintenance activities, upgrades, and patches with minimal client impact.
3. Networking & Storage :
- Administer and troubleshoot advanced Neutron networking configurations, including SDN (Open vSwitch, OVN), VLANs, VXLANs, security groups, and floating IPs.
- Manage storage backends (Ceph / RADOS, Cinder, Swift) for block, object, and file storage, ensuring data integrity and performance.
- Diagnose and resolve persistent network connectivity and storage performance issues.
4. Automation & Infrastructure as Code (IaC) :
- Automate repetitive operational tasks using Ansible (a must-have), Python, Bash, or other scripting languages.
- Develop, maintain, and version control IaC templates using Heat, Terraform, or similar tools.
- Utilize Git for source control management of all automation scripts and configurations.
5. Security & Compliance :
- Harden the OpenStack environment in compliance with industry standards (e.g., CIS Benchmarks).
- Manage security patches and vulnerabilities using Satellite or equivalent tools.
- Implement and maintain identity and access management (Keystone) policies and multi-tenancy isolation.
6. Collaboration & Documentation :
- Create and maintain detailed technical documentation, including architecture diagrams, operational runbooks, and knowledge base articles.
- Collaborate effectively with cross-functional teams, including systems administrators, network engineers, and developers.
- Mentor junior team members and share knowledge through formal and informal training sessions.
Mandatory Skills and Qualifications (Must-Have)
- Experience : 5-6 years of proven, hands-on experience in deploying, managing, and troubleshooting OpenStack Platform (OSP) in production environments.
- Managed Services Focus : Demonstrable experience in a managed services or MSP (Managed Service Provider) environment, with a strong client-facing support ethos.
- Networking : In-depth knowledge of Linux networking and advanced OpenStack Neutron networking.
- Storage : Hands-on experience with storage technologies, preferably Ceph Storage.
- Automation : Expert-level proficiency in Ansible for automation and configuration management. Strong scripting skills in Python and / or Bash.
- Operating Systems : Expert-level knowledge Linux (RHEL) 7 / 8 / 9 & etc.
- Troubleshooting : Excellent problem-solving skills and a methodical approach to diagnosing and resolving complex technical issues.
- Core OpenStack Services : Deep understanding of the architecture and operation of core projects (Nova, Neutron, Cinder, Keystone, Glance).
Good-to-Have Skills & Qualifications
- Certifications :
- Red Hat Certified Engineer (RHCE)
- Red Hat Certified Specialist in OpenStack (EX310)
- Red Hat Certified Architect (RHCA) – any concentration
- Red Hat Certified Specialist in Advanced Automation : Ansible Best Practices (DO447)
- Additional Technologies :
- Experience with containerization technologies (Docker, Podman, Kubernetes / OpenShift).
- Knowledge of infrastructure monitoring and logging tools (Prometheus, Grafana, Nagios, Zabbix, ELK Stack).
- Soft Skills : Excellent verbal and written communication skills, with the ability to explain complex technical concepts to non-technical stakeholders.
What We Offer
- A competitive salary and performance-based bonuses.
- Opportunities for professional development and continuous learning, including support for additional certifications.
- A collaborative and innovative work environment with cutting-edge technology.
- Flexible work hours and remote work options.