Talent.com
Senior MLOps / AIOps Platform Engineer - MLflow, GCP, Vertex AI, IBM Watsonx, Terraform

Senior MLOps / AIOps Platform Engineer - MLflow, GCP, Vertex AI, IBM Watsonx, Terraform

ConfidentialChennai, India
5 days ago
Job description

Before you apply to a job, select your language preference from the options available at the top right of this page.

Explore your next opportunity at a Fortune Global 500 organization. Envision innovative possibilities, experience our rewarding culture, and work with talented teams that help you become better every day. We know what it takes to lead UPS into tomorrow—people with a unique combination of skill + passion. If you have the qualities and drive to lead yourself or teams, there are roles ready to cultivate your skills and take you to the next level.

Job Description

Job Summary

We are seeking a Senior MLOps / AIOps Platform Engineer with deep DevSecOps expertise and hands-on experience managing enterprise-grade AI / ML platforms. This critical role focuses on building, configuring, and operationalizing secure, scalable, and reusable infrastructure and pipelines that support AI and ML initiatives across the enterprise. The ideal candidate will have a strong background in Infrastructure as Code (IaC), pipeline automation, and platform engineering, with specific experience configuring and maintaining IBM watsonx and Google Cloud Vertex AI environments.

Key Responsibilities

Platform Engineering & Operations

  • Lead the provisioning, configuration, and ongoing support of IBM watsonx and Google Cloud Vertex AI platforms.
  • Ensure platforms are production-ready, secure, cost-efficient, and performant across training, inference, and orchestration workflows.
  • Manage lifecycle tasks such as patching, upgrades, integrations, and service reliability.
  • Partner with security, compliance, and product teams to align platforms with enterprise and regulatory standards.

Enterprise MLOps / AIOps Enablement

  • Define and implement standardized MLOps / AIOps practices across business units for consistency and scalability.
  • Build and maintain reusable workflows for model development, deployment, retraining, and monitoring.
  • Provide onboarding, enablement, and support to AI / ML teams adopting enterprise platforms and tools.
  • Support development / deployment of GenAI applications and maintain them at an Enterprise scale.
  • DevSecOps Integration

  • Embed security and compliance guardrails across the ML lifecycle, including CI / CD pipelines and IaC templates.
  • Implement policy-as-code, access controls, vulnerability scanning, and automated compliance checks.
  • Ensure all deployments meet enterprise and regulatory requirements (HIPAA, SOX, FedRAMP, etc.).
  • Infrastructure as Code & Automation

  • Design and maintain IaC templates (Terraform, Pulumi, Ansible, CloudFormation) for reproducible ML infrastructure.
  • Build and optimize CI / CD pipelines for AI / ML assets including data pipelines, training workflows, deployment artifacts, and monitoring systems.
  • Enforce best practices around automation, reusability, and observability of infrastructure and workflows.
  • Monitoring, Logging & Observability

  • Implement comprehensive observability for AI / ML workloads using Prometheus, Grafana, Stackdriver, or Datadog.
  • Monitor both infrastructure health (CPU, memory, cost) and ML-specific metrics (model drift, data integrity, anomaly detection).
  • Define KPIs and usage metrics to measure platform performance, adoption, and operational health.
  • Qualifications

    Education

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field.
  • Experience

  • 5+ years in MLOps, DevOps, Platform Engineering, or Infrastructure Engineering.
  • 2+ years applying DevSecOps practices (secure CI / CD, vulnerability management, policy enforcement).
  • Hands-on experience configuring and managing enterprise AI / ML platforms (IBM watsonx, Google Vertex AI).
  • Demonstrated success in building and scaling ML infrastructure, automation pipelines, and platform support models.
  • Technical Skills

  • Proficiency with IaC tools (Terraform, Pulumi, Ansible, CloudFormation).
  • Strong scripting skills in Python and Bash.
  • Deep understanding of containerization and orchestration (Docker, Kubernetes).
  • Experience with model lifecycle tools (MLflow, TFX, Vertex Pipelines, or equivalents).
  • Familiarity with secrets management, policy-as-code, access control, and monitoring tools.
  • Working knowledge of data engineering concepts and their integration into ML pipelines.
  • Preferred

  • Cloud certifications (e.g., GCP Professional ML Engineer, AWS DevOps Engineer, IBM Cloud AI Engineer).
  • Experience supporting platforms in regulated industries (HIPAA, FedRAMP, SOX, PCI-DSS).
  • Contributions to open-source projects in MLOps, automation, or DevSecOps.
  • Familiarity with responsible AI practices including governance, fairness, interpretability, and explainability.
  • Hands-on experience with enterprise feature stores, model monitoring frameworks, and fairness toolkits.
  • Employee Type

    Permanent

    UPS is committed to providing a workplace free of discrimination, harassment, and retaliation.

    Skills Required

    Cloudformation, Prometheus, Bash, Pulumi, Grafana, Datadog, Terraform, Docker, Ansible, Python, Kubernetes

    Create a job alert for this search

    Senior Platform Engineer • Chennai, India

    Related jobs
    • Promoted
    • New!
    MLOps Engineer

    MLOps Engineer

    Yotta Data Services Private Limitedchennai, India
    We’re looking for a strategic Senior MLOps Engineer to lead the end-to-end design, implementation, and scaling of our AI infrastructure. You’ll partner with researchers, product teams, and DevOps to...Show moreLast updated: 20 hours ago
    • Promoted
    Senior MLOps Engineer

    Senior MLOps Engineer

    Mitchell Martin Inc.Chennai, IN
    Include, but are not limited to, the following : .Own productionizing models—from tracked experiments to governed releases—ensuring resilient services with clear SLOs, runbooks, and fast, safe rollba...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Cloud IAM Engineer (AWS / Okta)

    Senior Cloud IAM Engineer (AWS / Okta)

    Vertex AgilityChennai, IN
    Senior Cloud IAM Engineer (AWS / Okta) – Remote.Vertex Agility | Agile On-Demand Solutions.Are you passionate about identity management and cloud security? Vertex Agility is looking for a Senior Cl...Show moreLast updated: 2 days ago
    • Promoted
    MLOPs

    MLOPs

    MerilChennai, Tamil Nadu, India
    We are seeking an experienced MLOps Engineer with a focus on healthcare AI to join our innovative team.This role involves designing, implementing, and maintaining scalable MLOps pipelines for deplo...Show moreLast updated: 10 days ago
    • Promoted
    Senior / Lead Engineer - DevOps (AWS / Azure / GCP)

    Senior / Lead Engineer - DevOps (AWS / Azure / GCP)

    QBurstchennai, tamil nadu, in
    We are seeking an experienced and versatile DevOps Engineer.The ideal candidate will have hands-on experience with CI / CD pipelines, Kubernetes, Linux systems, monitoring / logging tools, and Infrastr...Show moreLast updated: 15 days ago
    • Promoted
    GenAI Platform Engineer - NLP / LLM

    GenAI Platform Engineer - NLP / LLM

    NasugroupChennai
    Description : We are looking for a skilled Gen AI Platform Engineer to join our team.The ideal candidate will have 10 years of experience in managing LLM-based syste...Show moreLast updated: 30+ days ago
    • Promoted
    MLOps Engineer - Google Cloud Platform

    MLOps Engineer - Google Cloud Platform

    Confidential CompanyChennai
    About the role : We are looking for a MLOps Engineer who will work on a broad range of cutting-edge data analytics and machine learning problems across a variety of industries....Show moreLast updated: 13 days ago
    • Promoted
    DevOps / Platform Engineer

    DevOps / Platform Engineer

    iVedha Inc.Chennai, IN
    Hiring a seasoned DevOps / Platform Engineer to drive automation, platform reliability, and robust.Design, deploy, and manage CI / CD pipelines and infrastructure automation, leveraging AI for.Implemen...Show moreLast updated: 30+ days ago
    • Promoted
    Senior MLOps / DevOps Engineer

    Senior MLOps / DevOps Engineer

    YubiChennai, Tamil Nadu, India
    Yubi, formerly known as CredAvenue, is re-defining global debt markets by freeing the flow of finance between borrowers, lenders, and investors. We are the world's possibility platform for the disco...Show moreLast updated: 1 day ago
    • Promoted
    AiOps Engineer

    AiOps Engineer

    L&T Technology Serviceschennai, tamil nadu, in
    Only immediate to 15 days joiner.Develop and Deploy AI Solutions : .Design, build, and deploy end-to-end Machine Learning and Generative AI pipelines on. Google Cloud Platform, using Vertex AI service...Show moreLast updated: 20 days ago
    • Promoted
    Deployment Engineer

    Deployment Engineer

    Tata Consultancy ServicesChennai, Tamil Nadu, India
    Experience in deploying complex, multi-component systems, preferably in cloud environments (Azure, AWS).Deep understanding of AI models, especially LLMs, and the infrastructure required to support ...Show moreLast updated: 11 days ago
    • Promoted
    AI ML Engineer

    AI ML Engineer

    GiggsoChennai, Tamil Nadu, India
    Founded in 2018, Giggso is a responsible AI platform for enterprise operations with security and automations.Giggso provides a single integrated platform for AI Agent orchestration, AI governance, ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    SRE & DevOps Engineer (ML / AI Platform)

    SRE & DevOps Engineer (ML / AI Platform)

    Prospance Incchennai, India
    SRE & DevOps Engineer (ML / AI Platform).Contract Position | Global E-Commerce Leader | Hybrid.SRE & DevOps Engineer to join their AI Platform Team. This is your chance to shape the future of machine ...Show moreLast updated: 20 hours ago
    • Promoted
    Sr. ML / Ops Developer

    Sr. ML / Ops Developer

    GarudaUAVchennai, tamil nadu, in
    To build and maintain robust ML pipelines and scalable deployment architectures for satellite, drone, LiDAR and temporal-based AI models, supporting data versioning, training workflows, and CI / CD f...Show moreLast updated: 1 day ago
    • Promoted
    MLOps Lead Engineer

    MLOps Lead Engineer

    RecroChennai, IN
    Experience with Azure services such as Azure AI services, Azure Search, Azure ML, Databricks, Azure Kubernetes Service, and AWS services like AWS SageMaker, AWS Bedrock and AWS Lambda.Exposure to G...Show moreLast updated: 22 days ago
    • Promoted
    Capgemini - MLOps Engineer

    Capgemini - MLOps Engineer

    Capgemini Technology Services India LimitedChennai
    Your Role : - Design, implement, and maintain end-to-end ML pipelines for model training, evaluation, and deployment &...Show moreLast updated: 30+ days ago
    • Promoted
    Senior AI-ML Engineer

    Senior AI-ML Engineer

    Myridius x Aethereuschennai, tamil nadu, in
    Aethereus is now part of Myridius, formerly known as RCG Global Services.This strategic integration combines Aethereus’ expertise in cutting-edge solutions with Myridius’ legacy of delivering trans...Show moreLast updated: 14 days ago
    • Promoted
    Senior AI / ML Engineer

    Senior AI / ML Engineer

    RingCentralchennai, tamil nadu, in
    We are seeking an experienced AI Engineer with a strong background in Natural Language Understanding (NLU) who is passionate about pushing the boundaries of Conversational AI.In this role, you will...Show moreLast updated: 1 day ago