Talent.com
This job offer is not available in your country.
▷ [Only 24h Left] ML Ops Engineer 4 - GCP [T500-20226]

▷ [Only 24h Left] ML Ops Engineer 4 - GCP [T500-20226]

Costco ITHyderabad, Telangana, India
9 hours ago
Job description

About Costco Wholesale

Costco Wholesale is a multi-billion-dollar global retailer with warehouse club operations in eleven countries. They provide a wide selection of quality merchandise, plus the convenience of specialty departments and exclusive member services, all designed to make shopping a pleasurable experience for their members.

About Costco Wholesale India

At Costco Wholesale India, we foster a collaborative space, working to support Costco Wholesale in developing innovative solutions that improve members’ experiences and make employees’ jobs easier. Our employees play a key role in driving and delivering innovation to establish IT as a core competitive advantage for Costco Wholesale.

Position Title : ML Ops Engineer 4

Job Description :

Roles & Responsibilities :

  • Define the long-term vision and strategy for MLOps initiatives : Set the direction for the organization’s MLOps, model deployment, and monitoring practices.
  • Lead and manage a team of MLOps engineers : Provide technical guidance, mentorship, and career development for team members.
  • Identify and explore cutting-edge research areas and technologies : Stay abreast of the latest advancements in MLOps, model serving, and AI operations.
  • Drive innovation and the development of novel MLOps solutions : Lead efforts, prototype new approaches, and oversee implementation of advanced MLOps platforms.
  • Design and manage scalable ML infrastructure and pipelines on GCP; oversee model deployment (A / B testing, rollouts / rollbacks, auto-scaling), and establish monitoring / observability (performance, drift, KPIs).
  • Ensure ML operations meet governance, security, compliance, and disaster recovery standards across the organization.
  • Collaborate with executive leadership on strategic decision-making : Align MLOps initiatives with business objectives and organizational priorities.
  • Establish and enforce MLOps standards and best practices : Ensure quality, reproducibility, and security of ML systems across the organization.
  • Represent the organization in external MLOps communities : Speak at conferences, publish thought leadership, and build partnerships with academia and industry.

Technical Skills :

  • 12+ - years of experience
  • Mastery of relevant technical skills : Deep expertise in MLOps, model deployment, monitoring, and governance.
  • Significant experience in designing and implementing complex MLOps systems at scale : Lead the architecture and deployment of large-scale MLOps platforms on GCP.
  • Hands-on experience architecting large-scale ML platforms on GCP (Vertex AI, GKE, Dataflow, Big Query, Pub / Sub, Cloud Composer), implementing experiment tracking (MLflow, Weights & Biases, TensorBoard), feature stores (Vertex AI), data pipelines and workflow orchestration, and ensuring cloud security, compliance, disaster recovery, and cost optimization.
  • Strong leadership and team management skills : Build, mentor, and lead high-performing MLOps teams.
  • Excellent strategic thinking and problem-solving abilities : Translate business challenges into scalable, reliable MLOps solutions.
  • Exceptional communication and influencing skills : Advocate for MLOps initiatives, and influence executive decisions and represent the organization externally through conferences, publications, and industry engagement.
  • Must Have Skills :

  • Deep expertise in MLOps, model deployment, monitoring, and governance
  • Experience building scalable MLOps platforms on GCP
  • Proficiency with CI / CD for ML, containerization (e.g. Docker, Kubernetes), IaC (Terraform), and orchestration
  • Leadership in MLOps strategy, standards, and cross-team collaboration
  • Hands-on expertise with GCP ML and data services (Vertex AI, Dataflow, BigQuery, Pub / Sub, Cloud Composer, GKE).
  • Experience implementing model observability (performance monitoring, drift detection, dashboards, and alerts).
  • Proficiency with experiment tracking (MLflow, W&B) and feature store management.
  • Knowledge of cloud security, compliance, and cost optimization strategies.
  • Create a job alert for this search

    Only Left Engineer • Hyderabad, Telangana, India

    Related jobs
    • Promoted
    Senior ML Ops Engineer

    Senior ML Ops Engineer

    ValueMomentumHyderabad, Telangana, India
    Evaluate and source appropriate cloud infrastructure solutions for machine learning needs, ensuring cost-effectiveness and scalability based on project requirements. Automate and manage the deployme...Show moreLast updated: 30+ days ago
    • Promoted
    Electrical Maintenance Engineer

    Electrical Maintenance Engineer

    MRFSangareddy, Telangana, India
    Experience in ALLEN BRADLEY / SIEMENS PLC.Operation maintenance and troubleshooting of Utilities like Boilers, Chillers, DG's etc. Checking all types of Motor drives (AC & DC) and transformers.Troub...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Ops Engineer

    AI / ML Ops Engineer

    Fission LabsHyderabad, Telangana, India
    Role : AI / ML Ops Engineer (DevOps with AI / ML).Headquartered in Sunnyvale, with offices in Dallas & Hyderabad, Fission Labs is a leading software development company, specializing in crafting flexibl...Show moreLast updated: 5 days ago
    • Promoted
    MLOps Engineer

    MLOps Engineer

    YAL.aiHyderabad, Telangana, India
    Location : Hyderabad / Bangalore, India.Type : Full-Time | Immediate Joining Preferred.Your Alternative Life) is reimagining the way people connect, communicate, and discover in a digital-first wor...Show moreLast updated: 8 days ago
    • Promoted
    • New!
    R-103864 LLM Optimization Engineer (Open) [02 / 10 / 2025]

    R-103864 LLM Optimization Engineer (Open) [02 / 10 / 2025]

    Jade GlobalHyderabad, Telangana, India
    Job Description Job Description Customer Interview No location criteria Key Responsibilities : - Analyze tracing logs from LLM inference and training runs to identify performance issues and ine...Show moreLast updated: 5 hours ago
    • Promoted
    Assistant Manager - Process - Solar Cell

    Assistant Manager - Process - Solar Cell

    Premier Energies LimitedRangareddy, Telangana, India
    Founded in 1995, Premier Energies is a leading solar cell and module manufacturer based in Telangana, India.We operate advanced facilities with 2 GW cell and 5. GW module capacity, and are expanding...Show moreLast updated: 30+ days ago
    • Promoted
    ML Ops

    ML Ops

    EXLHyderabad, IN
    Deploy, monitor, and scale ML models on.GCP (Vertex AI, GKE, Cloud Functions).GitHub Actions / Jenkins / cloud-native tools. Containerize and orchestrate workloads with.MLflow, Feast, Prometheus / Gra...Show moreLast updated: 25 days ago
    Senior ML Ops Engineer

    Senior ML Ops Engineer

    Epergne SolutionsHyderabad, Telangana, India
    Quick Apply
    Epergne Solutions is looking for.Chennai, Bengaluru, Pune, Hyderabad, Bhubaneshwar, Chandigarh, Gurugram.University Degree in Computer Science, Information Technology, or related field.Machine Lear...Show moreLast updated: 30+ days ago
    • Promoted
    DevSecOps Engineer

    DevSecOps Engineer

    HTC Global Serviceshyderabad, telangana, in
    Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show moreLast updated: 21 days ago
    • Promoted
    ML Ops Engineer / Lead ML Ops Engineer

    ML Ops Engineer / Lead ML Ops Engineer

    ChubbHyderabad, Telangana, India
    Chubb is a world leader in insurance.With operations in 54 countries and territories, Chubb provides commercial and personal property and casualty insurance, personal accident and supplemental heal...Show moreLast updated: 25 days ago
    • Promoted
    R-103864 LLM Optimization Engineer (Open)

    R-103864 LLM Optimization Engineer (Open)

    Jade GlobalHyderabad, Telangana, India
    Analyze tracing logs from LLM inference and training runs to identify performance issues and inefficiencies.Develop tools and scripts to parse, visualize, and monitor LLM tracing data.Collaborate w...Show moreLast updated: 9 days ago
    • Promoted
    Senior Engineer - Maintenance - Solar Module

    Senior Engineer - Maintenance - Solar Module

    Premier Energies LimitedRangareddy, Telangana, India
    Founded in 1995, Premier Energies is a leading solar cell and module manufacturer based in Telangana, India.We operate advanced facilities with 2 GW cell and 5. GW module capacity, and are expanding...Show moreLast updated: 5 days ago
    • Promoted
    ML Ops Engineer 4 - GCP [T500-20226]

    ML Ops Engineer 4 - GCP [T500-20226]

    Costco ITHyderabad, Telangana, India
    Costco Wholesale is a multi-billion-dollar global retailer with warehouse club operations in eleven countries.They provide a wide selection of quality merchandise, plus the convenience of specialty...Show moreLast updated: 9 days ago
    • Promoted
    DevOps Engineer - Docker / Kubernetes

    DevOps Engineer - Docker / Kubernetes

    ERAYAHyderabad
    Job Title : DevOps Engineer Location : Hyderabad Employment Type : Full-Time ...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Emulation Engineer

    Sr. Emulation Engineer

    ACL DigitalHyderabad, Telangana, India
    Emulation Engineer Job Description - 4-5 yrs of experience in emulation / prototyping using Cadence / Synopsys tool flows (Palladium / Protium / HAPS / Zebu) - Working knowledge of System Verilog & Verilog...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    GCP DevOps Engineer

    GCP DevOps Engineer

    ParadigmITHyderabad, Telangana, India
    Build scalable, secure, and highly available infrastructures using GCP services such as Compute Engine, Kubernetes Engine, Cloud Functions, Big Query and Cloud Storage. Develop and maintain CI / CD pi...Show moreLast updated: 7 hours ago
    • Promoted
    Operations Engineer (OpenStack & Kubernetes)

    Operations Engineer (OpenStack & Kubernetes)

    ClearTrail TechnologiesHyderabad, Telangana, India
    Role - Operations Engineer (OpenStack & Kubernetes) Location - Hyderabad Years of Experience - 4 to 6 Years Requisition Description : We are seeking a motivated and detail-oriented engineer to supp...Show moreLast updated: 2 days ago
    • Promoted
    Senior Engineer - DevOps / MLOps

    Senior Engineer - DevOps / MLOps

    8th Element Digital Private LimitedHyderabad
    Job Description : We are seeking an experienced Senior DevOps & MLOps Engineer with 6+ years of expertise in Azure and AWS cloud platf...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Engineer - Solar Cell - Production

    Senior Engineer - Solar Cell - Production

    Premier Energies LimitedRangareddy, Telangana, India
    Founded in 1995, Premier Energies is a leading solar cell and module manufacturer based in Telangana, India.We operate advanced facilities with 2 GW cell and 5. GW module capacity, and are expanding...Show moreLast updated: 1 day ago
    • Promoted
    Senior Engineer - Solar Cell Maintenance

    Senior Engineer - Solar Cell Maintenance

    Premier Energies LimitedRangareddy, Telangana, India
    Founded in 1995, Premier Energies is a leading solar cell and module manufacturer based in Telangana, India.We operate advanced facilities with 2 GW cell and 5. GW module capacity, and are expanding...Show moreLast updated: 30+ days ago