Talent.com
ML Ops Engineer 4 - GCP [T500-20226]

ML Ops Engineer 4 - GCP [T500-20226]

Costco ITthiruvananthapuram, kerala, in
11 hours ago
Job description

About Costco Wholesale

Costco Wholesale is a multi-billion-dollar global retailer with warehouse club operations in eleven countries. They provide a wide selection of quality merchandise, plus the convenience of specialty departments and exclusive member services, all designed to make shopping a pleasurable experience for their members.

About Costco Wholesale India

At Costco Wholesale India, we foster a collaborative space, working to support Costco Wholesale in developing innovative solutions that improve members’ experiences and make employees’ jobs easier. Our employees play a key role in driving and delivering innovation to establish IT as a core competitive advantage for Costco Wholesale.

Position Title : ML Ops Engineer 4

Job Description :

Roles & Responsibilities :

  • Define the long-term vision and strategy for MLOps initiatives : Set the direction for the organization’s MLOps, model deployment, and monitoring practices.
  • Lead and manage a team of MLOps engineers : Provide technical guidance, mentorship, and career development for team members.
  • Identify and explore cutting-edge research areas and technologies : Stay abreast of the latest advancements in MLOps, model serving, and AI operations.
  • Drive innovation and the development of novel MLOps solutions : Lead efforts, prototype new approaches, and oversee implementation of advanced MLOps platforms.
  • Design and manage scalable ML infrastructure and pipelines on GCP; oversee model deployment (A / B testing, rollouts / rollbacks, auto-scaling), and establish monitoring / observability (performance, drift, KPIs).
  • Ensure ML operations meet governance, security, compliance, and disaster recovery standards across the organization.
  • Collaborate with executive leadership on strategic decision-making : Align MLOps initiatives with business objectives and organizational priorities.
  • Establish and enforce MLOps standards and best practices : Ensure quality, reproducibility, and security of ML systems across the organization.
  • Represent the organization in external MLOps communities : Speak at conferences, publish thought leadership, and build partnerships with academia and industry.

Technical Skills :

  • 12+ - years of experience
  • Mastery of relevant technical skills : Deep expertise in MLOps, model deployment, monitoring, and governance.
  • Significant experience in designing and implementing complex MLOps systems at scale : Lead the architecture and deployment of large-scale MLOps platforms on GCP.
  • Hands-on experience architecting large-scale ML platforms on GCP (Vertex AI, GKE, Dataflow, Big Query, Pub / Sub, Cloud Composer), implementing experiment tracking (MLflow, Weights & Biases, TensorBoard), feature stores (Vertex AI), data pipelines and workflow orchestration, and ensuring cloud security, compliance, disaster recovery, and cost optimization.
  • Strong leadership and team management skills : Build, mentor, and lead high-performing MLOps teams.
  • Excellent strategic thinking and problem-solving abilities : Translate business challenges into scalable, reliable MLOps solutions.
  • Exceptional communication and influencing skills : Advocate for MLOps initiatives, and influence executive decisions and represent the organization externally through conferences, publications, and industry engagement.
  • Must Have Skills :

  • Deep expertise in MLOps, model deployment, monitoring, and governance
  • Experience building scalable MLOps platforms on GCP
  • Proficiency with CI / CD for ML, containerization (e.g. Docker, Kubernetes), IaC (Terraform), and orchestration
  • Leadership in MLOps strategy, standards, and cross-team collaboration
  • Hands-on expertise with GCP ML and data services (Vertex AI, Dataflow, BigQuery, Pub / Sub, Cloud Composer, GKE).
  • Experience implementing model observability (performance monitoring, drift detection, dashboards, and alerts).
  • Proficiency with experiment tracking (MLflow, W&B) and feature store management.
  • Knowledge of cloud security, compliance, and cost optimization strategies.
  • Create a job alert for this search

    Ml Engineer • thiruvananthapuram, kerala, in

    Related jobs
    • Promoted
    Ml Ops

    Ml Ops

    EXLKollam, Republic Of India, IN
    Deploy, monitor, and scale ML models on.GCP (Vertex AI, GKE, Cloud Functions).GitHub Actions / Jenkins / cloud-native tools. Containerize and orchestrate workloads with.MLflow, Feast, Prometheus / Gra...Show moreLast updated: 6 days ago
    • Promoted
    • New!
    Senior Mlops Engineer

    Senior Mlops Engineer

    Atomic NorthThiruvananthapuram, Republic Of India, IN
    Years (with at least 2+ years in MLOps or ML deployment roles).Bengaluru, Bhopal, Gurgaon, Hyderabad, Jaipur, Mumbai, Pune, Chennai. The ideal candidate will have strong expertise in cloud infrastru...Show moreLast updated: less than 1 hour ago
    • Promoted
    ML Engineer || Contract Job || 8-15 Years Experience

    ML Engineer || Contract Job || 8-15 Years Experience

    People Prime Worldwidekollam, kerala, in
    Our Client is a global IT services company headquartered in Southborough, Massachusetts, USA.Founded in 1996, with a revenue of $1. B, with 35,000+ associates worldwide, specializes in digital engin...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Ml Ops Engineer

    Ml Ops Engineer

    People Prime WorldwideThiruvananthapuram, Republic Of India, IN
    Our client is a trusted global innovator of IT and business services.They help clients transform through consulting, industry solutions, business process services, digital & IT modernization and ma...Show moreLast updated: less than 1 hour ago
    • Promoted
    • New!
    ML Ops Engineer

    ML Ops Engineer

    People Prime WorldwideKollam, IN
    Our client is a trusted global innovator of IT and business services.They help clients transform through consulting, industry solutions, business process services, digital & IT modernization and ma...Show moreLast updated: 15 hours ago
    • Promoted
    MLOps Engineer

    MLOps Engineer

    Capgeminithiruvananthapuram, kerala, in
    Experience in developing MLOps framework cutting ML lifecycle : model development, training, evaluation, deployment, monitoring including Model Governance. Expert in Azure Databricks, Azure ML, Unity...Show moreLast updated: 4 days ago
    • Promoted
    Senior Ml Engineer

    Senior Ml Engineer

    Piramal FinanceThiruvananthapuram, Republic Of India, IN
    Build and operate end-to-end ML / AI pipelines (data → training → deployment → monitoring).Automate CI / CD for ML / AI with Jenkins, integrate MLflow for tracking and registry.Deploy scalable batch and ...Show moreLast updated: 5 days ago
    • Promoted
    • New!
    Ml Ops Engineer 4 - Gcp T500-20226

    Ml Ops Engineer 4 - Gcp T500-20226

    Costco ITThiruvananthapuram, Republic Of India, IN
    Costco Wholesale is a multi-billion-dollar global retailer with warehouse club operations in eleven countries.They provide a wide selection of quality merchandise, plus the convenience of specialty...Show moreLast updated: less than 1 hour ago
    • Promoted
    ML Ops

    ML Ops

    EXLKollam, IN
    Deploy, monitor, and scale ML models on.GCP (Vertex AI, GKE, Cloud Functions).GitHub Actions / Jenkins / cloud-native tools. Containerize and orchestrate workloads with.MLflow, Feast, Prometheus / Gra...Show moreLast updated: 30+ days ago
    • Promoted
    TechOps Engineer

    TechOps Engineer

    Aquanowthiruvananthapuram, kerala, in
    Aquanow is a trading and technology company powering the next generation of financial services.We’re at the forefront of the rapidly evolving digital asset space, empowering businesses to navigate ...Show moreLast updated: 24 days ago
    • Promoted
    Mlops Engineer

    Mlops Engineer

    CapgeminiThiruvananthapuram, Republic Of India, IN
    Experience in developing MLOps framework cutting ML lifecycle : model development, training, evaluation, deployment, monitoring including Model Governance. Expert in Azure Databricks, Azure ML, Unity...Show moreLast updated: 4 days ago
    • Promoted
    DevOps / Platform Engineer

    DevOps / Platform Engineer

    iVedha Inc.thiruvananthapuram, India
    Hiring a seasoned DevOps / Platform Engineer to drive automation, platform reliability, and robust.Design, deploy, and manage CI / CD pipelines and infrastructure automation, leveraging AI for.Implemen...Show moreLast updated: 21 days ago
    (ML OPNS Lead)Machine Learning Operations Lead Engineer

    (ML OPNS Lead)Machine Learning Operations Lead Engineer

    Epergne SolutionsThiruvananthapuram, Kerala, India
    Quick Apply
    Collaborate with AI / ML, Data Science, and DevOps teams to.Generative AI and Agentic AI ecosystems.Azure OpenAI, Bedrock, Anthropic Claude, and OpenAI API. Implement infrastructure-as-code (IaC) prac...Show moreLast updated: 28 days ago
    • Promoted
    Sr. ML / Ops Developer

    Sr. ML / Ops Developer

    GarudaUAVthiruvananthapuram, kerala, in
    To build and maintain robust ML pipelines and scalable deployment architectures for satellite, drone, LiDAR and temporal-based AI models, supporting data versioning, training workflows, and CI / CD f...Show moreLast updated: 6 days ago
    • Promoted
    Senior ML Engineer

    Senior ML Engineer

    Piramal Financekollam, kerala, in
    Build and operate end-to-end ML / AI pipelines (data → training → deployment → monitoring).Automate CI / CD for ML / AI with Jenkins, integrate MLflow for tracking and registry.Deploy scalable batch and ...Show moreLast updated: 5 days ago
    • Promoted
    Sr. Ml / Ops Developer

    Sr. Ml / Ops Developer

    GarudaUAVKollam, Republic Of India, IN
    To build and maintain robust ML pipelines and scalable deployment architectures for satellite, drone, LiDAR and temporal-based AI models, supporting data versioning, training workflows, and CI / CD f...Show moreLast updated: 6 days ago
    ML Ops Lead

    ML Ops Lead

    Epergne SolutionsTrivandrum, Kerala, India
    Quick Apply
    Roles & Responsibilities : -.Design, deploy, and manage scalable, secure, and automated ML / AI platforms across Azure and AWS. Lead MLOps lifecycle model deployment, monitoring, retraining, CI / C...Show moreLast updated: 20 days ago
    • Promoted
    OpenStack Operations Engineer

    OpenStack Operations Engineer

    KniTTThiruvananthapuram, Republic Of India, IN
    This role is ideal for candidates passionate about.Linux systems, and DevOps automation.Detecting, analyzing, and responding to security threats and anomalies. Monitor OpenStack infrastructure compo...Show moreLast updated: 4 days ago