Talent.com
This job offer is not available in your country.
MLOps Lead

MLOps Lead

TalentOnLease Pvt Ltd.Bangalore
13 hours ago
Job description

Job Summary :

We are seeking an experienced MLOps Engineer to join our team and take ownership of building and managing robust Machine Learning Operations pipelines. The role involves working on cloud platforms, automating ML workflows, monitoring deployed models, ensuring reproducibility, and implementing version control for both code and models. You will collaborate closely with data scientists, ML engineers, and DevOps teams to deliver scalable and reliable ML solutions in production.

Key Responsibilities :

  • Design, implement, and maintain end-to-end ML pipelines for training, testing, and deploying machine learning models in production.
  • Work with cloud platforms (AWS, GCP, Azure) to provision, manage, and optimize ML infrastructure.
  • Implement workflow orchestration using Kubeflow, Airflow, or MLFlow to automate model training, evaluation, and deployment.
  • Set up and manage model monitoring systems for performance tracking, drift detection, and health checks using tools like Prometheus, Grafana, Seldon, or Evidently AI.
  • Establish and enforce practices for model and code version control using Git and related tools to ensure reproducibility and traceability.
  • Collaborate with Data Science teams to productionize models, ensuring efficiency, scalability, and low-latency performance.
  • Work with DevOps teams to implement CI / CD pipelines for ML workflows.
  • Optimize compute, storage, and cost efficiency of ML workloads in cloud environments.
  • Troubleshoot technical issues related to deployment, scaling, and monitoring of ML models.
  • Maintain comprehensive documentation for ML pipelines, deployment processes, and monitoring dashboards.

Technical Skills and Qualifications :

  • Strong hands-on experience in AWS, GCP, or Azure cloud services for ML deployment and operations.
  • Proficiency with Kubeflow, MLFlow, or Airflow for workflow orchestration.
  • Good knowledge of ML model monitoring tools (Prometheus, Grafana, Seldon, Evidently AI).
  • Experience with Git for managing code repositories, branching, and versioning of ML models.
  • Solid understanding of Python, Docker, and Kubernetes for containerization and orchestration.
  • Good knowledge of CI / CD tools and pipelines for ML (Jenkins, GitHub Actions, or Azure DevOps).
  • Familiarity with data engineering workflows and integrating feature engineering pipelines.
  • Strong problem-solving, analytical thinking, and debugging skills.
  • Good-to-Have Skills :

  • Experience with Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Pulumi to manage cloud resources.
  • Exposure to feature stores (e.g., Feast) and data versioning tools (e.g., DVC).
  • Knowledge of Generative AI concepts and LLM deployment, including fine-tuning and serving large models.
  • Experience with A / B testing frameworks for ML model evaluation in production.
  • Understanding of security, compliance, and governance in ML deployment.
  • Education :

  • Bachelor's or Master's degree in Computer Science, Data Science, Artificial Intelligence, or a related field.
  • (ref : hirist.tech)

    Create a job alert for this search

    Lead • Bangalore