Senior Cloud DevOps Manager – GCP | AI / ML | GenAI
Job Summary :
We are seeking a visionary and technically proficient Senior Cloud DevOps Manager to lead our cloud infrastructure and DevOps strategy. This role is pivotal in scaling our AI / ML and GenAI initiatives on Google Cloud Platform (GCP), ensuring robust, secure, and automated infrastructure using modern DevOps practices and container orchestration with Kubernetes.
Key Responsibilities :
- Lead and mentor a high-performing team of DevOps and cloud engineers.
- Architect and manage scalable, secure, and cost-efficient infrastructure on GCP.
- Design and maintain CI / CD pipelines for traditional applications and AI / ML / GenAI workloads.
- Collaborate with data science and ML engineering teams to implement MLOps and GenAI pipelines.
- Oversee containerization strategies using Docker and orchestration with Kubernetes (GKE).
- Implement Infrastructure as Code (IaC) using Terraform or Google Deployment Manager.
- Ensure system reliability, observability, and performance monitoring.
- Drive cloud cost optimization, governance, and compliance.
- Evaluate and integrate emerging DevOps and AI / ML tools and technologies.
Technical Skills :
Cloud Expertise : Deep experience with Google Cloud Platform (GCP) services (Compute Engine, GKE, Cloud Functions, Vertex AI, BigQuery, etc.).DevOps Tools : Jenkins, GitLab CI / CD, ArgoCD, or similar.IaC : Terraform, Google Deployment Manager, Ansible.Containers & Orchestration : Docker, Kubernetes (GKE preferred), Helm.Monitoring & Logging : Google Cloud Operations Suite (formerly Stackdriver), Prometheus.Scripting & Automation : Python, Bash, NodeJs.AI / ML & GenAI : Experience with Vertex AI, Kubeflow, MLflow, LangChain, Hugging Face, or OpenAI APIs.Security & Compliance : IAM, VPC, service accounts, secrets management.Non-Technical Skills :
Leadership : Proven ability to lead cross-functional teams and manage complex projects.Strategic Thinking : Align DevOps and cloud strategies with business and AI / ML goals.Collaboration : Strong interpersonal skills to work with engineering, data science, and product teams.Communication : Clear and effective communication with both technical and non-technical stakeholders.Problem Solving : Analytical mindset with a proactive approach to troubleshooting and innovation.Agility : Comfortable working in fast-paced, evolving environments using Agile / Scrum methodologies.Preferred Qualifications :
Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.12+ years of experience in DevOps / cloud roles, with 3+ years in a leadership capacity.GCP certifications (e.g., Professional Cloud DevOps Engineer, Cloud Architect).Experience deploying and managing AI / ML and GenAI models in production.Familiarity with data governance, model lifecycle management, and responsible AI practices.