Talent.com
AI Model Optimization & Deployment Lead

AI Model Optimization & Deployment Lead

Mulya TechnologiesBengaluru, Republic Of India, IN
4 hours ago
Job description

Principal Machine Learning Engineer - Multimodal AI & Inference

Bangalore

Founded in 2023,by Industry veterans HQ in California,US

  • We are revolutionizing sustainable AI compute through intuitive software with composable silicon

Overview :

You will design, optimize, and deploy large multimodal models (language, vision, audio, video) to run efficiently on a compact, high-performance AI appliance capable of supporting 100B+ parameter models at real-time speeds. Your mission is to deliver state-of-the-art multimodal inference locally through advanced model optimization, quantization, and system-level integration.

Key Responsibilities :

1. Model Integration & Porting

  • Optimize large-scale foundation models (e.G., Llama, gpt-oss, Whisper, HiDream, Qwen, Wan etc) for on-device inference.
  • Adapt pre-trained models for multimodal tasks (text, image, audio, video, or cross-modal reasoning).
  • Ensure seamless interoperability between modalities — e.G., enabling the system to “see, hear, and talk” naturally.
  • 2. Model Optimization for Edge Hardware

  • Quantize and compress large models (4-bit or mixed precision) while maintaining high accuracy and low latency.
  • Implement and benchmark inference runtimes using frameworks like Llama.Cpp, Ollama, vLLM, ONNX etc.
  • Collaborate with hardware engineers to co-design model architectures optimized for the appliance’s compute fabric.
  • 3. Inference Pipeline Development

  • Build and maintain scalable, high-throughput inference pipelines capable of handling concurrent multimodal requests (text, audio, image, video).
  • Implement token streaming, caching, and scheduling strategies for real-time responses.
  • Develop APIs for low-latency local inference accessible via a web interface.
  • 4. Evaluation & Benchmarking

  • Profile and benchmark performance (throughput, latency, energy efficiency) of deployed models.
  • Run regression tests to validate numerical accuracy after quantization or pruning.
  • Define KPIs for multimodal model performance under real-world usage.
  • 5. Research & Prototyping

  • Investigate emerging multimodal architectures and lightweight model variants for local deployment.
  • Prototype hybrid models that combine LLMs, diffusion models, and ASR / TTS pipelines for advanced multimodal applications.
  • Stay current on state-of-the-art inference frameworks, compression techniques, and multimodal learning trends.
  • Required Qualifications :

  • Strong background in deep learning and model deployment, with hands-on experience in PyTorch and / or TensorFlow.
  • Expertise in model optimization — quantization, pruning, distillation, or mixed-precision inference.
  • Practical knowledge of inference engines (vLLM, llama.Cpp, ONNX Runtime or similar).
  • Experience deploying large models locally or on edge devices with limited memory / compute constraints.
  • Familiarity with multimodal model architectures — e.G., CLIP, Flamingo, LLaVA, or AudioGPT-style systems.
  • Strong software engineering skills (Python, C++, CUDA) and experience integrating models into production systems.
  • Understanding of GPU / accelerator utilization, memory bandwidth optimization, and distributed inference.
  • Preferred Qualifications :

    experience-10+ years

  • Experience with model-parallel or tensor-parallel inference at scale.
  • Contributions to open-source inference frameworks or model serving systems.
  • Familiarity with hardware-aware training or co-optimization of neural networks and hardware.
  • Background in speech, vision, or multimodal ML research.
  • Track record of deploying models that run entirely offline or on embedded / edge systems.
  • Contact : Uday

    Mulya Technologies

    muday_bhaskar@yahoo.com

    "Mining The Knowledge Community"

    Create a job alert for this search

    Deployment Lead • Bengaluru, Republic Of India, IN

    Related jobs
    • Promoted
    AI / ML Developer

    AI / ML Developer

    Viionn Labsbangalore district, karnataka, in
    Derive and design use cases from structured and unstructured data.Provide LLM expertise to solve AI problems using state-of-the-art language models and off-the-shelf LLM services such as OpenAI mod...Show moreLast updated: 18 days ago
    • Promoted
    AIML Engineer

    AIML Engineer

    Tata Consultancy Serviceshosur, tamil nadu, in
    Competencies (Technical / Behavioral Competency).AI / ML, Azure ML Studio, AI / ML On Databricks, Python & CICD Devops.Supervised and unsupervised ML and Predictive Analytics using Python • Feature gener...Show moreLast updated: 18 days ago
    • Promoted
    Lead AI Engineer

    Lead AI Engineer

    BlendBangalore, IN
    We are looking for an AI Engineer with hands-on experience designing and deploying scalable AI solutions.In this role, you will be part of a cross-functional team working on cutting-edge projects i...Show moreLast updated: 14 days ago
    • Promoted
    Lead – Foundational AI

    Lead – Foundational AI

    Piramal FinanceBengaluru, Karnataka, India
    Lead design, training, and deployment of foundational and generative AI models.Fine-tune LLMs (GPT, Claude, open-source) for enterprise use. Guide a team of AI engineers to deliver high-impact solut...Show moreLast updated: 9 days ago
    • Promoted
    Technical Lead

    Technical Lead

    MovateBengaluru, Karnataka, India
    Lead design, architecture, and development of AI-driven applications using.GenAI and Agentic AI frameworks.Mentor and guide a team of developers, ensuring high-quality code and best practices.Colla...Show moreLast updated: 18 days ago
    • Promoted
    • New!
    Technical Adoption Lead – AI Builder

    Technical Adoption Lead – AI Builder

    BigRioGreater Bengaluru Area, India
    Technical Adoption Lead – AI-Driven Enablement & Knowledge Transfer.Onsite - Bangalore (relocation available).We are seeking a Technical Adoption Lead to act as the bridge between the core rewrite ...Show moreLast updated: 8 hours ago
    • Promoted
    Lead- Model Developer (Wholesale Risk)

    Lead- Model Developer (Wholesale Risk)

    MashreqBengaluru, Karnataka, India
    Wholesale Risk Model Development.Technical & Delivery Responsibilities.Support the data preparation and model development, validation, and lifecycle management of Wholesale Rating Models.Support mo...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Hiring AI / ML SW, FW and Si Design, DV Engineers (Bangalore, Bay Area CA)

    Hiring AI / ML SW, FW and Si Design, DV Engineers (Bangalore, Bay Area CA)

    Tsavorite Scalable IntelligenceGreater Bengaluru Area, India
    About Tsavorite Scalable Intelligence Inc.Tsavorite Scalable Intelligence is developing the semiconductor industry’s first Omni Processing Unit™ (OPU). CPU, GPU, Memory and Connectivity in a single ...Show moreLast updated: 8 hours ago
    • Promoted
    Lead Supervised AI

    Lead Supervised AI

    Piramal FinanceBengaluru, Karnataka, India
    ML / DL, with hands-on work in computer vision, document AI, audio / video ML, or similar fields.Strong background in deep learning (CNNs, RNNs, Transformers, ResNet, etc. Experience with tools like PyT...Show moreLast updated: 9 days ago
    • Promoted
    Lead Engineer - AI / ML

    Lead Engineer - AI / ML

    Mindfire SolutionsBangalore, IN
    As a Lead AI / ML Engineer, you spearhead the design, development, and implementation of advanced AI and machine learning models. Your role involves guiding a team of engineers ensuring the successful...Show moreLast updated: 30+ days ago
    • Promoted
    ML / Gen AI Engineer

    ML / Gen AI Engineer

    Intuition IT – Intuitive Technology Recruitmenthosur, tamil nadu, in
    Design, deploy, and manage scalable ML and GenAI workloads using AWS services including SageMaker Studio and Bedrock.Implement and maintain infrastructure using AWS Lambda, EKS, ECS on Fargate, and...Show moreLast updated: 1 day ago
    • Promoted
    Associate Architect - Machine Learning (Azure Cloud)

    Associate Architect - Machine Learning (Azure Cloud)

    QuantiphiBengaluru, Karnataka, India
    Role : Associate Architect - Machine Learning (Azure Cloud).Architect, develop, and deploy.Lead end-to-end ML lifecycle including data preparation, feature engineering, model development, validatio...Show moreLast updated: 3 days ago
    • Promoted
    AIML Lead

    AIML Lead

    ACL DigitalBengaluru, Karnataka, India
    Notice Period : Immediate joiner's only.Should have experience in NLP, Gen AI, Any Cloud(AWS or Azure), Computer Vision, Python,Deep Learning,RAG,Data Science. Must have experience in Leading team.Show moreLast updated: 17 days ago
    • Promoted
    Lead AI Engineer

    Lead AI Engineer

    Genisys GroupBengaluru, Karnataka, India
    As an AI / ML Engineer at Genisys Group, you will be instrumental in developing and.AI solutions, with a strong focus on Large Language Models. LLMs) and Retrieval-Augmented Generation (RAG) technique...Show moreLast updated: 18 days ago
    • Promoted
    AI & ML Engineer

    AI & ML Engineer

    Apna Technologies & Solutions (ApnaTech)Hosur, Tamil Nadu, India
    We are seeking a highly skilled.In this role, you will be responsible for designing, developing, and optimizing machine learning models and algorithms to solve complex problems.You will collaborate...Show moreLast updated: 11 days ago
    • Promoted
    AIML Architect

    AIML Architect

    ValueLabshosur, tamil nadu, in
    We at ValueLabs have an Opening for AI / ML Architect role.At least 7+ years of relevant AI / ML experience or previous ML experience with strong engineering competencies and at least 2+ years in Gener...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Chief AI / ML Engineer ( Noida )

    Chief AI / ML Engineer ( Noida )

    Mulya TechnologiesGreater Bengaluru Area, India
    Top10 Semiconductor Organization in the World.We the advanced R&D hub that focuses on developing cutting-edge technologies to prepare the future. Our vision, "Shape the Future with Innovation and In...Show moreLast updated: 8 hours ago
    • Promoted
    Lead AI / ML Engineer

    Lead AI / ML Engineer

    Optumhosur, tamil nadu, in
    Lead AI / ML Engineer – Clinical AI systems.Optum is a global organization that delivers care, aided by technology, to help millions of people live healthier lives. The work you do with our team will ...Show moreLast updated: 14 days ago