Talent.com
AI Infrastructure Engineer
AI Infrastructure EngineerCostco IT • Hyderabad, Republic Of India, IN
AI Infrastructure Engineer

AI Infrastructure Engineer

Costco IT • Hyderabad, Republic Of India, IN
30+ days ago
Job description

About Costco Wholesale

Costco Wholesale is a multi-billion-dollar global retailer with warehouse club operations in eleven countries. They provide a wide selection of quality merchandise, plus the convenience of specialty departments and exclusive member services, all designed to make shopping a pleasurable experience for their members.

About Costco Wholesale India

At Costco Wholesale India, we foster a collaborative space, working to support Costco Wholesale in developing innovative solutions that improve members’ experiences and make employees’ jobs easier. Our employees play a key role in driving and delivering innovation to establish IT as a core competitive advantage for Costco Wholesale.

Position Title : ML Ops Engineer 4

Job Description :

Roles & Responsibilities :

  • Define the long-term vision and strategy for MLOps initiatives : Set the direction for the organization’s MLOps, model deployment, and monitoring practices.
  • Lead and manage a team of MLOps engineers : Provide technical guidance, mentorship, and career development for team members.
  • Identify and explore cutting-edge research areas and technologies : Stay abreast of the latest advancements in MLOps, model serving, and AI operations.
  • Drive innovation and the development of novel MLOps solutions : Lead efforts, prototype new approaches, and oversee implementation of advanced MLOps platforms.
  • Design and manage scalable ML infrastructure and pipelines on GCP;

oversee model deployment (A / B testing, rollouts / rollbacks, auto-scaling), and establish monitoring / observability (performance, drift, KPIs).

  • Ensure ML operations meet governance, security, compliance, and disaster recovery standards across the organization.
  • Collaborate with executive leadership on strategic decision-making : Align MLOps initiatives with business objectives and organizational priorities.
  • Establish and enforce MLOps standards and best practices : Ensure quality, reproducibility, and security of ML systems across the organization.
  • Represent the organization in external MLOps communities : Speak at conferences, publish thought leadership, and build partnerships with academia and industry.
  • Technical Skills :

  • 12+ - years of experience
  • Mastery of relevant technical skills : Deep expertise in MLOps, model deployment, monitoring, and governance.
  • Significant experience in designing and implementing complex MLOps systems at scale : Lead the architecture and deployment of large-scale MLOps platforms on GCP.
  • Hands-on experience architecting large-scale ML platforms on GCP (Vertex AI, GKE, Dataflow, Big Query, Pub / Sub, Cloud Composer), implementing experiment tracking (MLflow, Weights & Biases, TensorBoard), feature stores (Vertex AI), data pipelines and workflow orchestration, and ensuring cloud security, compliance, disaster recovery, and cost optimization.
  • Strong leadership and team management skills : Build, mentor, and lead high-performing MLOps teams.
  • Excellent strategic thinking and problem-solving abilities : Translate business challenges into scalable, reliable MLOps solutions.
  • Exceptional communication and influencing skills : Advocate for MLOps initiatives, and influence executive decisions and represent the organization externally through conferences, publications, and industry engagement.
  • Must Have Skills :

  • Deep expertise in MLOps, model deployment, monitoring, and governance
  • Experience building scalable MLOps platforms on GCP
  • Proficiency with CI / CD for ML, containerization (e.G. Docker, Kubernetes), IaC (Terraform), and orchestration
  • Leadership in MLOps strategy, standards, and cross-team collaboration
  • Hands-on expertise with GCP ML and data services (Vertex AI, Dataflow, BigQuery, Pub / Sub, Cloud Composer, GKE).
  • Experience implementing model observability (performance monitoring, drift detection, dashboards, and alerts).
  • Proficiency with experiment tracking (MLflow, W&B) and feature store management.
  • Knowledge of cloud security, compliance, and cost optimization strategies.
  • Create a job alert for this search

    Infrastructure Engineer • Hyderabad, Republic Of India, IN

    Related jobs
    Cloud Engineer- AI / ML

    Cloud Engineer- AI / ML

    Intuition IT – Intuitive Technology Recruitment • Hyderabad, IN
    Design, deploy, and manage scalable ML and GenAI workloads using AWS services, including SageMaker Studio and Bedrock.Implement and maintain infrastructure using AWS Lambda, EKS, ECS on Fargate, an...Show more
    Last updated: 23 hours ago • Promoted
    Cloud Infrastructure Engineer

    Cloud Infrastructure Engineer

    The Goodyear Tire & Rubber Company • Hyderabad, Republic Of India, IN
    Proven experience building and scaling.Terraform, GitHub Actions, CI / CD pipelines.Salesforce platform integration.Experience Cloud, Data Cloud, APIs). Experience guiding multiple squads from.Experie...Show more
    Last updated: 30+ days ago • Promoted
    AI Infrastructure Architect

    AI Infrastructure Architect

    Anonymous • Hyderabad, Republic Of India, IN
    Our mission is to build not another AI application, but the substrate on which intelligence itself.We seek a Chief Architect to define the structural spine of this platform.Architect a platform whe...Show more
    Last updated: 30+ days ago • Promoted
    CP4D Infrastructure Engineer

    CP4D Infrastructure Engineer

    Tixy Tech • Hyderabad, Republic Of India, IN
    Location : Hyderabad, Bangalore, Chennai, Coimbatore, Pune, Kochi.We are looking for a CP4D Platform Engineer responsible for deploying, managing, and supporting IBM Cloud Pak for Data (CP4D) enviro...Show more
    Last updated: 11 hours ago • Promoted • New!
    AI Engineer

    AI Engineer

    Aura Recruitment Solutions • hyderabad, telangana, in
    Pay starts from 150,000 INR per Month.We’re hiring on behalf of our client, a fast-growing, AI-first company building cutting-edge AI-native applications that transform complex, real-world data int...Show more
    Last updated: 3 days ago • Promoted
    Infrastructure Engineer - Tier3

    Infrastructure Engineer - Tier3

    NEXPLAY SECURE • secunderabad, telangana, in
    The Infrastructure Engineer (Tier III, remote) serves as the senior technical authority within Nexplay Secure's Managed Services division. This role leads the deployment and ongoing support of criti...Show more
    Last updated: 30+ days ago • Promoted
    Infrastructure Engineer

    Infrastructure Engineer

    SHI International Corp. • Hyderabad, Telangana, India
    HPE, DELL, Nutanix and IBM Hardware, apple / mac / Storage.We are seeking a highly skilled, near expert-level Infrastructure Engineer to join our Managed Service Provider (MSP) support team.The ideal ...Show more
    Last updated: 8 hours ago • Promoted • New!
    AI Infrastructure Engineer

    AI Infrastructure Engineer

    INSPYR Solutions • Hyderabad, Republic Of India, IN
    MLOps Engineer II ( Mid-Senior-Level).Remote (Night Shift – 10 PM to 7 AM CST).Proficient MLOps engineer capable of independently managing production model deployments, pipelines, and infrastructur...Show more
    Last updated: 30+ days ago • Promoted
    Platform Infrastructure Engineer

    Platform Infrastructure Engineer

    SHI International Corp. • Hyderabad, Republic Of India, IN
    HPE, DELL, Nutanix and IBM Hardware, apple / mac / Storage.We are seeking a highly skilled, near expert-level Infrastructure Engineer to join our Managed Service Provider (MSP) support team.The ideal ...Show more
    Last updated: 11 hours ago • Promoted • New!
    Infrastructure Engineer

    Infrastructure Engineer

    People Prime Worldwide • Hyderabad, Telangana, India
    Important Note (Please Read Before Applying).You have less than 8 years or more than 15 years of hands-on Infrastructure Engineering / Cloud Architecture experience. You do not have strong AWS exper...Show more
    Last updated: 30+ days ago • Promoted
    AI Infrastructure Engineer

    AI Infrastructure Engineer

    WhiteLotus Talent Partners • Hyderabad, Republic Of India, IN
    Job Title- AI Platform Engineering.Location- Bangalore, Hyderabad.We have an immediate requirement for a strong AI Platform Engineering professional in Bangalore / Hyd preferably, and we’re actively ...Show more
    Last updated: 1 day ago • Promoted
    Cloud Infrastructure Engineer

    Cloud Infrastructure Engineer

    ADP • Hyderabad, Republic Of India, IN
    Bachelor’s degree in engineering or computer science or an equivalent combination of education and experience.Lead the design and implementation of scalable, secure, and resilient cloud infrastruct...Show more
    Last updated: 10 days ago • Promoted
    Cloud Infrastructure Architect

    Cloud Infrastructure Architect

    ValueLabs • Hyderabad, Republic Of India, IN
    We are looking at expanding our Cloud COE group to support multiple cloud environments and looking for one Azure architect (hands on) and one AWS architect (hands on). Here are the stacks we use, an...Show more
    Last updated: 9 days ago • Promoted
    Infrastructure Solutions Architect

    Infrastructure Solutions Architect

    BayOne Solutions • secunderabad, telangana, in
    Systems or Solutions Architect.IaaS), and cloud-scale system design.The ideal candidate combines strong fundamentals in.Kubernetes, observability, and automation. You’ll design scalable systems that...Show more
    Last updated: 10 days ago • Promoted
    Cloud Infrastructure Engineer

    Cloud Infrastructure Engineer

    Foodsmart • Hyderabad, Republic Of India, IN
    Foodsmart is the leading telenutrition and foodcare solution, backed by a robust network of Registered Dietitians.Our platform is designed to foster healthier food choices, drive lasting behavior c...Show more
    Last updated: 30+ days ago • Promoted
    AI Infrastructure Engineer

    AI Infrastructure Engineer

    ValueMomentum • Hyderabad, Republic Of India, IN
    Evaluate and source appropriate cloud infrastructure solutions for machine learning needs, ensuring cost-effectiveness and scalability based on project requirements. Automate and manage the deployme...Show more
    Last updated: 30+ days ago • Promoted
    AI Infrastructure Engineer

    AI Infrastructure Engineer

    Stealth AI Startup • Hyderabad, Republic Of India, IN
    Job Role : Backend Engineer – AI Applications.AI Infrastructure and Applications.Generative AI, Agentic Frameworks, Cloud Platforms, and AI Infrastructure. If you are passionate about building robus...Show more
    Last updated: 30+ days ago • Promoted
    Lead Cloud Infrastructure Engineer

    Lead Cloud Infrastructure Engineer

    Cognida.ai • Hyderabad, Republic Of India, IN
    Drive revenue growth, increase profitability and improve operational efficiencies.Forever curious, always on the front lines of technological advancements. Applying our latest learnings, and tools t...Show more
    Last updated: 30+ days ago • Promoted