Lead Software Engineer -AI Ops / ML OPS / Python & Kubernates
IT (Information Technology) Permanent contract Bangalore, India Hybrid Reference 25000CV3 Start date 2025 / 07 / 31 Publication date 2025 / 07 / 17
Responsibilities
Key Responsibilities :
As a AI Ops Expert , Responsible and full ownership for the deliverables with greater defined quality standards with defined timeline and budget
- Manage AI model lifecycle, versioning, and monitoring in production environments. Build resilient MLOps pipelines and ensure compliance with governance standards.
- Design, implement, and manage AIops solutions to automate and optimize AI / ML workflows.
- Build and scale containerized solutions using Kubernetes.
- Collaborate with data scientists, engineers, and other stakeholders to ensure seamless integration of AI / ML models into production.
- Monitor and maintain the health and performance of AI / ML systems.
- Develop and maintain CI / CD pipelines for AI / ML models.
- Implement best practices for model versioning, testing, and deployment.
- Troubleshoot and resolve issues related to AI / ML infrastructure and workflows.
- Stay up-to-date with the latest AI OPs, MLOps, and Kubernetes tools and technologies.
Profile required
Requirements and skills
Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field.8+ year of relevant experienceProven experience in AIops, MLOps, or related fields.Strong proficiency in Python and experience with Fast API.Strong hands-on expertise on Docker and Kubernetes (Or AKS)Hands-on experience with MS Azure and its AI / ML services, including Azure ML Flow.Proficiency in using DevContainer for development.Knowledge of CI / CD tools such as Jenkins, Argo CD, Helm, GitHub Actions, or Azure DevOps.Experience with containerization and orchestration tools like Docker and Kubernetes.Experience with Infrastructure as code (Terraform or equivalent)Strong problem-solving skills and the ability to work in a fast-paced environment.Excellent communication and collaboration skills.Preferred Skills :
Experience with machine learning frameworks such as TensorFlow, PyTorch, or scikit-learn.Familiarity with data engineering tools like Apache Kafka, Apache Spark, or similar.Knowledge of monitoring and logging tools such as Prometheus, Grafana, or ELK stack.Understanding of data versioning tools like DVC or MLflow.Experience with infrastructure as code (IaC) tools like Terraform or Ansible.Proficiency in Azure-specific tools and services, such as :Azure Machine Learning (Azure ML)Azure DevOpsAzure Kubernetes Service (AKS)Azure FunctionsAzure Logic AppsAzure Data FactoryAzure Monitor and Application InsightsWhy join us
We are committed to creating a diverse environment and are proud to be an equal opportunity employer. All qualified applicants receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status”.