Talent.com
No longer accepting applications
Hybrid Cloud Platform Lead

Hybrid Cloud Platform Lead

Tata Communications Transformation Services (TCTS)Pune, Republic Of India, IN
6 days ago
Job description

Title : Senior Manager - Cloud SME "Redhat Open shift" (Private Cloud)

Location : Pune

Experience : 12+ years

Role Summary :

We are seeking a highly skilled Hybrid Cloud Platform Engineer to design, implement, and manage our consolidated platform built on Red Hat OpenShift Container Platform (RHOCP) . This unique role requires expertise in running both containerized workloads and traditional Virtual Machines (VMs) using OpenShift Virtualization . The ideal candidate will be a deep technical expert in cloud orchestration , VM lifecycle management , and deploying comprehensive observability and analytics solutions across the hybrid environment to ensure performance, reliability, and cost efficiency. The Senior NOC / SOC Operations Engineer will manage and operate Telco Cloud platforms built on Red Hat OpenShift and Private Cloud Virtualization environments supporting VNF / CNF workloads .

This role requires strong hands-on experience in VM lifecycle management , orchestration , cloud management , and AI / ML-driven analytics . The engineer will ensure 24x7 availability, proactive monitoring, fault management, and lifecycle support of cloud-native and virtualized network functions in a Telco-grade production setup

Key Responsibilities :

Cloud & Virtualization Operations

  • Manage daily operations of Red Hat OpenShift (Kubernetes-based container platform) and RHOSP-based virtualization environments.
  • Perform VM lifecycle operations (provisioning, scaling, migration, snapshot, decommissioning).
  • Monitor and troubleshoot compute, storage, and network resources within Red Hat Private Cloud.
  • Maintain and optimize hypervisors (KVM / QEMU), ensuring performance and availability SLAs.
  • Manage tenant configurations, quotas, and multi-tenant isolation within RHOSP.

Orchestration & Automation

  • Operate and maintain Red Hat CloudForms / Ansible Automation Platform for orchestration workflows.
  • Support Day-0 to Day-2 operations through policy-driven automation templates.
  • Integrate orchestration with VNFM / NFVO components for VNF / CNF deployments and scaling.
  • Ensure alignment of orchestration workflows with ITSM change management processes.
  • VNF / CNF & Telco Cloud Operations

  • Perform lifecycle management of VNFs and CNFs (onboarding, instantiation, scaling, termination).
  • Troubleshoot network function issues in coordination with Network Engineering and NFV Orchestration teams.
  • Validate service chains, SDN underlay / overlay connectivity, and application availability.
  • Coordinate with OEM vendors for updates, patches, and RCA of incidents.
  • AI / ML-based Analytics & Observability

  • Utilize AI-ML analytics platforms for predictive fault detection, anomaly identification, and performance optimization.
  • Support implementation of closed-loop automation through analytics-driven triggers.
  • Participate in continuous improvement initiatives for automated RCA and alert correlation.
  • Monitoring, Incident & Change Management :

  • Monitor infrastructure KPIs – CPU / memory utilization, pod / container health, network throughput, and storage latency.
  • Respond to alerts from monitoring tools (Zabbix, Prometheus, Grafana, or OpenShift Console).
  • Manage incidents, problems, and change activities following ITIL guidelines.
  • Maintain configuration documentation, CMDB updates, and operational dashboards.
  • Key Skills and Certifications :

  • Platform / Containerization - Red Hat OpenShift Container Platform (RHOCP) , Kubernetes, Operators, CRI-O, Pod Networking (SDN / CNI).
  • Virtualization - OpenShift Virtualization (KubeVirt) , VM Lifecycle Management (provisioning, migration, snapshots), KVM, virt-launcher pods.
  • Orchestration / Automation- Ansible (Playbooks, Roles, Automation Platform), GitOps ( ArgoCD or OpenShift GitOps ), Infrastructure-as-Code (IaC), Tekton or Jenkins.
  • Observability / Analytics- Prometheus (Metrics), Grafana (Visualization), Loki / Vector / Fluentd (Logging), Jaeger / OpenTelemetry (Tracing), Data analysis for capacity planning.
  • Networking / Storage - SDN, CNI, Load Balancing, Ingress / Egress, Red Hat OpenShift Data Foundation (ODF) or Ceph / NFS / iSCSI, Persistent Volume Claims (PVCs).
  • Cloud Operations Requirements w.R.T RHOCP :

    1. Cluster Management and Reliability

  • High Availability (HA) : Implement and maintain HA for the control plane (3 or 5 Master nodes) and worker nodes across availability zones / domains.
  • Automated Lifecycle : Use OpenShift Operators and the Cluster Version Operator (CVO) for automated, non-disruptive upgrades, patches, and security fixes for the platform and its add-ons.
  • Security : Proactive management of Red Hat Enterprise Linux CoreOS (RHCOS) for control plane nodes, using immutability to enhance security and simplify patches.
  • Disaster Recovery (DR) : Implement robust backup and restore strategies for cluster configuration, etcd data, and critical workloads using tools like OpenShift APIs for Data Protection (OADP) or similar solutions.
  • 2. Virtualization Operations

  • Unified Management : Manage the VM lifecycle (provisioning, scale, decommission) using Kubernetes API objects (like VirtualMachine, VirtualMachineInstance) and the OpenShift Console, treating VMs as native cluster resources alongside containers.
  • Workload Migration : Utilize the Migration Toolkit for Virtualization (MTV) to streamline the move of existing VMs from external virtualization platforms (like VMware or RHEV) to OpenShift Virtualization.
  • Compute Consistency : Ensure consistent application of network and storage policies for both container pods and KubeVirt VMs.
  • 3. Observability and Analytics

  • Unified Monitoring : Consolidate metrics, logs, and traces from both container pods and virtual machines into a single platform (e.G., OpenShift's integrated Prometheus / Grafana stack).
  • Proactive Alerting : Configure alerts based on predefined SLOs / SLIs for the health of the OpenShift cluster, underlying infrastructure, and key VM / application performance indicators (CPU, Memory, Disk IO).
  • Capacity Planning : Regularly analyze historical usage data from OpenShift (including VM utilization) to predict future resource needs and optimize cost efficiency across the hybrid cloud footprint.
  • Troubleshooting : Establish runbooks and procedures for utilizing the observability data to quickly isolate the root cause of issues, whether they stem from the container layer, the virtualization layer, or the underlying cloud / hardware.
  • Create a job alert for this search

    Lead Platform • Pune, Republic Of India, IN

    Related jobs
    • Promoted
    Cloud Engineer

    Cloud Engineer

    SysTechCorp Incnagpur, maharashtra, in
    Experience in AWS cloud – Lambda based microservices.GitHub CICD experience (GitHub Actions is a plus).Go & Python programming language. Experience in building & managing CICD pipelines for Kubernet...Show moreLast updated: 17 days ago
    • Promoted
    Cloud Architect

    Cloud Architect

    AventIQIndia
    AventIQ is a leading software development company specializing in Robotic Process Automation (RPA) to revolutionize businesses through cutting-edge automation technologies.Our mission is to deliver...Show moreLast updated: 1 day ago
    • Promoted
    Cloud Engineer

    Cloud Engineer

    Vsynergize AInagpur, maharashtra, in
    Responsibilities & Required Skills : .Design, deploy, and manage core AWS resources (EC2, S3, RDS / Aurora, Lambda,.Build, maintain, and version infrastructure using Terraform (S3 / DynamoDB remote state...Show moreLast updated: 1 day ago
    • Promoted
    Lead Engineer

    Lead Engineer

    HyqooNagpur, IN
    Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show moreLast updated: 12 days ago
    • Promoted
    AWS Cloud Engineer

    AWS Cloud Engineer

    ProgliteNagpur, IN
    Infrastructure & System Administration : .Deploy, manage, and optimize EC2 instances across dev, test, and production environments. Perform system administration and troubleshooting for Linux and Wind...Show moreLast updated: 30+ days ago
    • Promoted
    Cloud Infrastructure Architect

    Cloud Infrastructure Architect

    AscendionNagpur, IN
    Cloud Infrastructure Architect.All Ascendion Locations – Bengaluru, Pune, Chennai, Vadodara and Hyderabad.We are seeking a highly skilled Cloud Infrastructure Architect to lead the design, optimiza...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Lead Generation Specialist

    Lead Generation Specialist

    LoyyalNagpur, IN
    Loyyal is a leader in loyalty and payments innovation, offering an advanced Enterprise SaaS Suite powered by patented blockchain technology. Our platform helps loyalty programs enhance customer enga...Show moreLast updated: 13 hours ago
    • Promoted
    Technical Lead

    Technical Lead

    NAZZTECNagpur, IN
    Experience Required : 8 – 10 Years.Qualification : Engineering (Computer Science).We are looking for a highly experienced. This role involves owning the technology roadmap, ensuring timely delivery, a...Show moreLast updated: 1 day ago
    • Promoted
    Azure Cloud Architect

    Azure Cloud Architect

    ValueMomentumIndia
    Cloud Architect (Azure / AWS) – Hyderabad / Pune.This is an exciting opportunity to lead.As a Cloud Architect, you will : . DevSecOps and platform engineering teams.Drive cloud strategy, architecture st...Show moreLast updated: 1 day ago
    • Promoted
    Cloud Automation Developer

    Cloud Automation Developer

    CentrilogicNagpur, IN
    This role is responsible for designing, developing, and maintaining Infrastructure-as-Code (IaC), automation tooling, and cloud management solutions that support Centrilogic’s managed cloud service...Show moreLast updated: 28 days ago
    • Promoted
    Terraform and Ansible Platform Engineer

    Terraform and Ansible Platform Engineer

    CapgeminiNagpur, IN
    Support infrastructure automation using configuration management tools (Chef, Puppet) and Infrastructure-as-Code (IaC) tools (Terraform). Write automation scripts and manage deployment pipelines.Int...Show moreLast updated: 3 days ago
    • Promoted
    Senior Cloud Network Engineer

    Senior Cloud Network Engineer

    sliceNagpur, IN
    We’ve all felt how slow, confusing, and complicated banking can be.We’re building every product from scratch to be fast, transparent, and feel good, because we believe that the best products transc...Show moreLast updated: 1 day ago
    • Promoted
    Cloud Engineer

    Cloud Engineer

    People Prime WorldwideNagpur, IN
    Note (Please Read Before Applying).You have less than of 5+ years’ experience in combined roles of cloud engineer, infrastructure engineer, DevOps experience. You do not have 2+ years’ experience de...Show moreLast updated: 30+ days ago
    • Promoted
    lead SDWAN Fortinet

    lead SDWAN Fortinet

    Tata CommunicationsNagpur, IN
    Responsible for Managing customer queries related to all services and solutions delivered, including diagnosing, and resolving complex technical issues in respective areas of FortiGate SDWAN / Cloud ...Show moreLast updated: 1 day ago
    • Promoted
    Cloud Consultant

    Cloud Consultant

    Allianze InfosoftNagpur, IN
    Microsoft Dynamics 365 HR & Finance – Senior Implementation Consultant.This is a high-visibility, client-facing position where you will own the full implementation lifecycle—from requirements and s...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Salesforce Commerce Cloud Tech Lead

    Salesforce Commerce Cloud Tech Lead

    EmpiricNagpur, IN
    Salesforce Commerce Cloud front-end Tech Lead Developer – SFCC - Remote - India.Empiric has received an exciting opportunity for a Salesforce Commerce Cloud front-end Tech Lead Developer with stron...Show moreLast updated: 13 hours ago
    • Promoted
    Genesys Cloud CX Platform Specialist

    Genesys Cloud CX Platform Specialist

    Tata Consultancy ServicesIndia
    Genesys Engage / Genesys Cloud CX Platform Specialist – Routing Domain.We are seeking a highly skilled Genesys Engage Platform Specialist with expertise in Routing Domain configuration and management...Show moreLast updated: 1 day ago
    • Promoted
    Salesforce Data Cloud Solution Architect & Tech Lead

    Salesforce Data Cloud Solution Architect & Tech Lead

    10XTDNagpur, IN
    Compelling Opportunity for Salesforce Data Cloud Solution Architect & Tech Lead with Global Leader in IT Services.The Salesforce Data Cloud Solution Architect / Tech Lead will be responsible for orch...Show moreLast updated: 1 day ago