Company Description
Sutantra Systems Pvt Ltd is a leading provider of innovative cloud solutions and managed services. We help businesses transform their operations by leveraging cutting-edge open-source technologies. Our team is composed of passionate experts dedicated to delivering reliability, performance, and excellence in every client engagement. We are looking for a seasoned OpenStack Platform professional to join our dynamic team and drive the success of our mission-critical cloud environments.
Role Description
We are seeking a highly skilled and certified Senior OpenStack Platform (OSP) Engineer with extensive experience in managed services. The ideal candidate will possess deep, hands-on technical expertise in designing, implementing, automating, and, most importantly, managing and supporting production-grade OSP environments. You will be responsible for ensuring the highest levels of availability, performance, and security for our clients' OpenStack clouds, acting as a subject matter expert and escalation point for complex issues.
Key Responsibilities
1. Platform Management & Operations :
- Provide expert-level administration, operational support, and lifecycle management for large-scale, production OpenStack Platform (OSP) environments.
- Perform day-to-day operations including node health checks, service monitoring, capacity planning, and performance tuning.
- Manage and troubleshoot the core OpenStack services (Nova, Neutron, Cinder, Swift, Keystone, Glance, Horizon, Heat, Ceilometer).
- Implement and maintain high-availability and disaster recovery configurations for the control plane and workloads.
2. Managed Services & Client Support :
Serve as a primary technical point of contact for managed services clients, ensuring their SLAs are met and exceeded.Proactively monitor client environments using tools like Prometheus, Grafana, ELK Stack, or RH CloudForms.Handle incident management, including troubleshooting complex issues, performing root cause analysis (RCA), and implementing preventive measures.Execute planned maintenance activities, upgrades, and patches with minimal client impact.3. Networking & Storage :
Administer and troubleshoot advanced Neutron networking configurations, including SDN (Open vSwitch, OVN), VLANs, VXLANs, security groups, and floating IPs.Manage storage backends (Ceph / RADOS, Cinder, Swift) for block, object, and file storage, ensuring data integrity and performance.Diagnose and resolve persistent network connectivity and storage performance issues.4. Automation & Infrastructure as Code (IaC) :
Automate repetitive operational tasks using Ansible (a must-have), Python, Bash, or other scripting languages.Develop, maintain, and version control IaC templates using Heat, Terraform, or similar tools.Utilize Git for source control management of all automation scripts and configurations.5. Security & Compliance :
Harden the OpenStack environment in compliance with industry standards (e.g., CIS Benchmarks).Manage security patches and vulnerabilities using Satellite or equivalent tools.Implement and maintain identity and access management (Keystone) policies and multi-tenancy isolation.6. Collaboration & Documentation :
Create and maintain detailed technical documentation, including architecture diagrams, operational runbooks, and knowledge base articles.Collaborate effectively with cross-functional teams, including systems administrators, network engineers, and developers.Mentor junior team members and share knowledge through formal and informal training sessions.Mandatory Skills and Qualifications (Must-Have)
Experience : 5-6 years of proven, hands-on experience in deploying, managing, and troubleshooting OpenStack Platform (OSP) in production environments.Managed Services Focus : Demonstrable experience in a managed services or MSP (Managed Service Provider) environment, with a strong client-facing support ethos.Networking : In-depth knowledge of Linux networking and advanced OpenStack Neutron networking.Storage : Hands-on experience with storage technologies, preferably Ceph Storage.Automation : Expert-level proficiency in Ansible for automation and configuration management. Strong scripting skills in Python and / or Bash.Operating Systems : Expert-level knowledge Linux (RHEL) 7 / 8 / 9 & etc.Troubleshooting : Excellent problem-solving skills and a methodical approach to diagnosing and resolving complex technical issues.Core OpenStack Services : Deep understanding of the architecture and operation of core projects (Nova, Neutron, Cinder, Keystone, Glance).Good-to-Have Skills & Qualifications
Certifications :Red Hat Certified Engineer (RHCE)Red Hat Certified Specialist in OpenStack (EX310)Red Hat Certified Architect (RHCA) – any concentrationRed Hat Certified Specialist in Advanced Automation : Ansible Best Practices (DO447)Additional Technologies :Experience with containerization technologies (Docker, Podman, Kubernetes / OpenShift).Knowledge of infrastructure monitoring and logging tools (Prometheus, Grafana, Nagios, Zabbix, ELK Stack).Soft Skills : Excellent verbal and written communication skills, with the ability to explain complex technical concepts to non-technical stakeholders.What We Offer
A competitive salary and performance-based bonuses.Opportunities for professional development and continuous learning, including support for additional certifications.A collaborative and innovative work environment with cutting-edge technology.Flexible work hours and remote work options.