Talent.com
MLOps Engineer- Billion Dollar US Enterprise Software - Hiring in India!

MLOps Engineer- Billion Dollar US Enterprise Software - Hiring in India!

CareerXperts Consultingsalem, India
2 days ago
Job description

Role Focus : Production ML Systems | GPU Orchestration | Inference at Scale

What You'll Actually Do (Not Buzzwords)

Infrastructure That Doesn't Break

  • Design and maintain the backbone for training, fine-tuning, and deploying ML models that actually work in production
  • Orchestrate GPU workloads on Kubernetes (EKS) with node autoscaling, intelligent bin-packing, and cost-aware scheduling (spot instances, preemptibles—you know the drill)
  • Build CI / CD pipelines that handle ML code, data versioning, and model artifacts like a well-oiled machine (GitHub Actions, Argo Workflows, Terraform)

Production ML, Not Science Projects

  • Partner with Data Scientists and ML Engineers to turn Jupyter notebooks into production-grade systems
  • Deploy and scale inference backends (vLLM, Hugging Face, NVIDIA Triton) that serve real traffic
  • Optimize GPU utilization because every idle A100 hour is money burning
  • Build observability that actually tells you why things broke (Prometheus, Grafana, OpenTelemetry)
  • Ship Fast, Sleep Well

  • Create tooling for seamless model deployment, instant rollback, and A / B testing
  • Lead incident response when production AI systems decide to have opinions
  • Work with security and compliance teams to implement best practices without slowing down innovation
  • What We're Really Looking For

    Must-Haves (No Negotiation)

  • 5+ years in MLOps, infrastructure, or platform engineering —you've been in the trenches
  • Production ML experience : At least one project that's serving real users, not a Kaggle competition
  • Kubernetes expertise with GPUs : You understand taints, tolerations, affinity rules, and why GPU scheduling is its own special hell
  • Cloud-native architecture (AWS preferred) : You think in VPCs, IAM roles, and cost optimization
  • Training pipeline experience : Set up or scaled training / fine-tuning for ML models in production (PyTorch Lightning, Hugging Face Accelerate, DeepSpeed)
  • IaC fluency : Terraform, Helm, Kustomize are second nature
  • Python engineering skills : You can debug a distributed training failure and fix it
  • Inference scaling : You've deployed and scaled inference workloads and lived to tell the tale
  • The "We're Very Interested" Signals

  • You mention scaling inference and we can see the fire in your eyes
  • You've used MLflow, W&B, or SageMaker Experiments and have opinions on which is best
  • You understand CI / CD for ML and why it's different from regular software
  • You've built monitoring systems that caught issues before users did
  • Nice to Have (But Seriously Nice)

  • GPU scheduling wizardry in Kubernetes
  • Model drift monitoring and versioning tools
  • Low-latency inference optimization (quantization, FP8, TensorRT—the good stuff)
  • Experience in compliance or regulated industries where "just ship it" isn't an option
  • What Makes This Role Different

    Ownership. You're not a ticket-taker or a consultant passing through. You'll own infrastructure that powers real AI products, make architectural decisions that matter, and have the autonomy to build things the right way.

    Impact. Your work directly affects model training speed, inference latency, GPU costs, and system reliability. You'll see the results of your optimizations in dollars saved and milliseconds gained.

    Quality over speed. We value security, operational excellence, and sustainable systems. No "move fast and break things" chaos here—we move deliberately and build things that last.

    The Reality Check

    This role is not for you if :

  • You prefer working on proofs-of-concept over production systems
  • You think "it works on my machine" is an acceptable answer
  • You haven't shipped ML systems to production
  • You're looking for pure research or pure DevOps (this is the intersection)
  • This role is for you if :

  • You get excited about making GPUs go brrr efficiently
  • You've been oncall for ML systems and learned hard lessons
  • You believe infrastructure is a product, not an afterthought
  • You want to build the foundation for AI that actually works
  • Write to to get connected!

    Create a job alert for this search

    Mlops Engineer • salem, India

    Related jobs
    • Promoted
    MLOps Engineer

    MLOps Engineer

    X4 TechnologyErode, IN
    MLOps Engineer - Role & Responsibilities.Design, deploy and manage scalable & secure cloud infrastructure.Apply least privilege across cloud platforms (Azure, RBAC, AWS IAM).Enable audit logging co...Show moreLast updated: 14 days ago
    • Promoted
    Senior MLOps Engineer (Production)

    Senior MLOps Engineer (Production)

    SAIVA AISalem,Tamil Nadu, IN
    We are seeking a Senior Machine Learning Engineer to join our team and help shape the future of healthcare technology.In this role, you will design, build, and deploy machine learning systems that ...Show moreLast updated: 10 days ago
    • Promoted
    Distinguished LLM Engineer

    Distinguished LLM Engineer

    Trident ConsultingSalem,Tamil Nadu, IN
    Trident Consulting is looking for a ".Distinguished LLM Engineer - Chennai / Tirunelveli / Coimbatore".Role : Distinguished LLM Engineer. Location : Chennai / Tirunelveli / Coimbatore.Depends on your expe...Show moreLast updated: 5 days ago
    • Promoted
    Sr Software Engineer

    Sr Software Engineer

    Mitchell Martin Inc.Erode, IN
    Job Title : Senior Software Engineer.We are looking for a Senior Software Engineer with strong experience in building scalable, cloud-native applications using AWS services, Node.The ideal candidate...Show moreLast updated: 6 days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    TransPerfecterode, India
    AI / ML Engineer (Remote – Bangalore, India).Location : Bangalore, India (Remote).Contract : 5 months (with potential 12-month extension based on performance). Client : A leading multinational telecommun...Show moreLast updated: 7 days ago
    • Promoted
    DevOps Engineer

    DevOps Engineer

    Insight Globalsalem, India
    DevOps roles with enterprise-scale systems.Strong hands-on experience with Microsoft Azure (IaaS, PaaS, Azure DevOps).Proficiency in both Ad Watch and OpenShift (50 / 50 split expected).Experience wi...Show moreLast updated: 7 days ago
    • Promoted
    MLops Engineer

    MLops Engineer

    RecroSalem,Tamil Nadu, IN
    We are looking for an experienced.Azure and AWS cloud ecosystems.The ideal candidate should bring a strong background in. GenAI tooling, automation, and CI / CD pipelines.Design, implement, and manage...Show moreLast updated: 14 days ago
    • Promoted
    Software Engineer Intern

    Software Engineer Intern

    Kavida.aisalem, India
    Engineer(SDE) Intern Applied AI.We are offering a full-time internship position to final-year students.The internship will last for an initial period of 6-12 months before converting to a full-time...Show moreLast updated: 7 days ago
    • Promoted
    Software Engineer

    Software Engineer

    Intec SelectSalem,Tamil Nadu, IN
    Openlink Endur Developer - $200 - $300 Per Day – India (Remote) – 6 Months – Oil & Gas.A market leading Oil & Gas organisation are searching for an experienced Openlink Endur Developer with full en...Show moreLast updated: 5 days ago
    • Promoted
    SDE 2 AI / ML Engineer (Bangalore Only - atleast 4 years experience)

    SDE 2 AI / ML Engineer (Bangalore Only - atleast 4 years experience)

    JobTwineerode, tamil nadu, in
    Reach out to : careers@jobtwine.Location : Bengaluru or NCR but fully remote for a few months.Meet JayT : Your 24 / 7 AI Hiring Agent that powers Human Decisions for fastest and best hiring outcomes.It...Show moreLast updated: 3 days ago
    • Promoted
    HCM technical +Fast formula+HCM extract

    HCM technical +Fast formula+HCM extract

    Best Infosystems Ltd.Erode, IN
    HCM technical +Fast formula+HCM extract_Contract (One Year)_Pan India(Remote).HCM technical +Fast formula+HCM extract.Functional knowledge of Compensation and Equity area in Oracle is plus.HCM Tech...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Java Engineer – Model Inference Platform (Microservices / ML / Seldon) Hiring For Pan India

    Senior Java Engineer – Model Inference Platform (Microservices / ML / Seldon) Hiring For Pan India

    Tata Consultancy ServicesErode, Tamil Nadu, India
    Design, build, and optimize high-performance microservices using Java 17+, Spring Boot, and reactive frameworks.Develop and maintain APIs for model registration, inference request routing, and mode...Show moreLast updated: 9 hours ago
    • Promoted
    MLOps Lead Engineer

    MLOps Lead Engineer

    RecroErode, IN
    Experience with Azure services such as Azure AI services, Azure Search, Azure ML, Databricks, Azure Kubernetes Service, and AWS services like AWS SageMaker, AWS Bedrock and AWS Lambda.Exposure to G...Show moreLast updated: 13 days ago
    • Promoted
    (Immediate Joiners Only)Cybersecurity Vulnerability & Patch Management Engineer (India – U.S. Shift)

    (Immediate Joiners Only)Cybersecurity Vulnerability & Patch Management Engineer (India – U.S. Shift)

    Triune Infomatics Incerode, India
    Role : Cybersecurity Vulnerability & Patch Management Engineer (India – U.Working Hours : Monday to Friday, 9 AM – 5 PM PST (U. Reporting To : Security Operations (SecOps) Leader – USA.We are hiring a ...Show moreLast updated: 7 days ago
    • Promoted
    GTM Engineer

    GTM Engineer

    Staple AIErode, IN
    India (remote / non-tier 1 cities welcome).At Staple, we’re not building just another SaaS product.Think data preparation, data governance and automation all-in-one. Staple AI helps global enterpris...Show moreLast updated: 14 days ago
    • Promoted
    LTIMindtree Hiring for Python Developer - Pan India

    LTIMindtree Hiring for Python Developer - Pan India

    LTIMindtreesalem, India
    Mandatory Skills- Gen-AI, Data Science, Python, RAG and Cloud (AWS / Azure).Secondary - (Any) - Machine Learning, Deep Learning, ChatGPT, Langchain, Prompt, RAG, Computer vision, classification, MLOp...Show moreLast updated: 7 days ago
    • Promoted
    Senior MLOps Engineer

    Senior MLOps Engineer

    Mitchell Martin Inc.Salem,Tamil Nadu, IN
    Include, but are not limited to, the following : .Own productionizing models—from tracked experiments to governed releases—ensuring resilient services with clear SLOs, runbooks, and fast, safe rollba...Show moreLast updated: 30+ days ago
    • Promoted
    Java Software Engineer

    Java Software Engineer

    IntraEdgeSalem,Tamil Nadu, IN
    Develop and maintain backend microservices using Python, Java and Spring Boot.Build and integrate APIs (both GraphQL and REST) for scalable service communication. Deploy and manage services on Googl...Show moreLast updated: 28 days ago