Talent.com
AI Systems Engineer
AI Systems EngineerConfidential • Gurugram, Gurgaon / Gurugram, India
AI Systems Engineer

AI Systems Engineer

Confidential • Gurugram, Gurgaon / Gurugram, India
5 days ago
Job description

About US

Shunya Labs is building the Voice AI Infrastructure Layer for Enterprises powering speech intelligence, conversational agents, and domain-specific voice applications across industries. Born from deep work in mental-health AI and built for global enterprise scale, our stack combines state-of-the-art ASR / TTS models with an open-weights philosophy , driving accuracy, privacy, and scalability.

About the Role

We're seeking an AI Systems Engineer who thrives at the intersection of AI model optimization , infrastructure engineering , and applied research .

You will evaluate, host, and optimize a wide range of AI models—spanning ASR, LLMs, and multimodal systems and build the orchestration layer that powers scalable, low-latency deployments.

This is a role for someone who's comfortable navigating ambiguity , researching emerging AI methods , and translating client requirements into robust, production-ready solutions.

You'll work across the full stack—from GPU inference tuning to React-based control dashboards building a resilient and scalable AI delivery platform.

Key Responsibilities -

AI Model Evaluation & Optimization

  • Evaluate, benchmark, and optimize AI models (speech, text, vision, multimodal) for latency, throughput, and accuracy.
  • Implement advanced inference optimizations using ONNX Runtime , TensorRT , quantization , and GPU batching .
  • Continuously research and experiment with the latest AI runtimes , serving frameworks, and model architectures.
  • Develop efficient caching and model loading strategies for multi-tenant serving.

AI Infrastructure & Orchestration

  • Design and develop a central orchestration layer to manage multi-model inference, load balancing, and intelligent routing.
  • Build scalable, fault-tolerant deployments using AWS ECS / EKS , Lambda , and Terraform .
  • Use Kubernetes autoscaling and GPU node optimization to minimize latency under dynamic load.
  • Implement observability and monitoring (Prometheus, Grafana, CloudWatch) across the model-serving ecosystem.
  • DevOps, CI / CD & Automation

  • Build and maintain CI / CD pipelines for model integration, updates, and deployment (GitHub Actions, CodePipeline, etc.).
  • Manage Dockerized environments , version control, and GPU-enabled build pipelines.
  • Ensure reproducibility and resilience through infrastructure-as-code and automated testing.
  • Frontend & Developer Tools

  • Create React / Next.js -based dashboards for performance visualization, latency tracking, and configuration control.
  • Build intuitive internal tools for model comparison, experiment management, and deployment control.
  • Utilize Cursor , VS Code , and other AI-powered development tools to accelerate iteration.
  • Client Interaction & Solutioning

  • Work closely with clients and internal stakeholders to gather functional and performance requirements .
  • Translate abstract business needs into deployable AI systems with measurable KPIs.
  • Prototype quickly, iterate with feedback, and deliver robust production systems.
  • Research & Continuous Innovation

  • Stay on top of the latest AI research and model releases (OpenAI, Anthropic, Hugging Face, Meta, etc.).
  • Evaluate emerging frameworks for model serving, fine-tuning, and retrieval (LangChain, LlamaIndex, GraphRAG, etc.).
  • Proactively identify and implement performance or cost improvements in the model serving stack.
  • Share learnings and contribute to the internal AI knowledge base.
  • Ambiguous Problem Solving

  • Work effectively in undefined problem spaces , identifying optimal paths forward through experimentation.
  • Break down high-level goals into actionable technical strategies.
  • Balance trade-offs between accuracy, latency, and cost while innovating under uncertainty.
  • Required Skills

  • Strong proficiency in Python , TypeScript / JavaScript , Bash , and modern software development practices.
  • Deep understanding of Docker , Kubernetes , Terraform , and AWS (ECS, Lambda, S3, CloudFront) .
  • Experience with inference optimization (ONNX, TensorRT, quantization, batching).
  • Proven ability to design and scale real-time inference pipelines .
  • Experience building and maintaining CI / CD pipelines and monitoring systems .
  • Hands-on experience with React / Next.js or similar frameworks for dashboard / UI development.
  • Strong grasp of API design , load balancing , and GPU resource management .
  • Nice to Have

  • Experience with LangChain , LlamaIndex , GraphRAG , or vector databases (FAISS, Neo4j) .
  • Familiarity with speech processing models (Whisper, Silero, NeMo, etc.).
  • Prior work with serverless inference or edge AI architectures.
  • Knowledge of data pipelines , model versioning , and MLOps best practices .
  • Soft Skills

  • Excellent problem-solving in ambiguous, evolving environments.
  • Strong ability to research, self-learn, and prototype emerging AI technologies.
  • Confident communicator who can translate technical findings to business impact.
  • Ownership mindset with a collaborative, solution-oriented approach.
  • Skills Required

    S3, API Design, Bash, Cloudfront, React, Typescript, Javascript, Docker, Terraform, AWS ECS, Load Balancing, Kubernetes, Python

    Create a job alert for this search

    System Engineer • Gurugram, Gurgaon / Gurugram, India

    Related jobs
    AI Software Engineer

    AI Software Engineer

    Taskify AI • Delhi, IN
    This role is ideal for professionals passionate about artificial intelligence, machine learning, and software engineering who want to make a tangible impact on real-world applications.As an AI Soft...Show more
    Last updated: 4 days ago • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    Intellectt Inc • Delhi, IN
    Agentic AI Engineer (100% Remote).Intellectt is seeking a highly experienced.The ideal candidate will have hands-on expertise in. LLMs, LangChain, LangGraph, RAG.AI applications for real-world use.O...Show more
    Last updated: 13 days ago • Promoted
    Principal Gen AI Engineer

    Principal Gen AI Engineer

    Alkami Technology • Gurugram, Haryana, India
    We're looking for a highly skilled and experienced.Principal GenAI / AI Engineer and Architect.The ideal candidate has over a decade of hands-on experience in the AI / GenAI space, with a strong focus ...Show more
    Last updated: 1 day ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Reqpedia • Gurugram, Haryana, India
    We seek a motivated Junior Generative AI Developer to design, implement, and optimize cutting-edge generative AI solutions. You’ll work closely with senior engineers to build applications leveraging...Show more
    Last updated: 1 day ago • Promoted
    Agentic AI & n8n Automation Expert

    Agentic AI & n8n Automation Expert

    Dentalkart • Delhi, India, India
    Agentic AI & n8n Automation Expert.The ideal candidate must have strong experience in automation tools, API integrations, and building multi-step automated processes. Build advanced n8n workflows, i...Show more
    Last updated: 4 days ago • Promoted
    AI Engineer

    AI Engineer

    Aura Recruitment Solutions • Delhi, IN
    Pay starts from 150,000 INR per Month.We’re hiring on behalf of our client, a fast-growing, AI-first company building cutting-edge AI-native applications that transform complex, real-world data int...Show more
    Last updated: 13 days ago • Promoted
    AI Engineer

    AI Engineer

    Idea Elan India • Ghaziabad, IN
    AI Engineer (2 - 4 Years Experience).Idea Elan LLC is a product based company that provides comprehensive software solutions for. Universities and Institutions worldwide.We are seeking a skilled AI ...Show more
    Last updated: 5 days ago • Promoted
    AI Engineer

    AI Engineer

    Asite • Delhi, IN
    We start with a simple idea : the built environment should be smarter, safer and more sustainable.Everything we do is about helping the people behind major construction and infrastructure projects w...Show more
    Last updated: 1 day ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Live Connections • Ghaziabad, IN
    We’re Hiring | Generative AI Lead / Principal Engineer.Are you passionate about building cutting-edge.Generative AI and LLM solutions. We’re looking for an experienced.Generative AI Lead / Principal...Show more
    Last updated: 11 days ago • Promoted
    Lead AI Engineer

    Lead AI Engineer

    Blend • Delhi, IN
    We are looking for an AI Engineer with hands-on experience designing and deploying scalable AI solutions.In this role, you will be part of a cross-functional team working on cutting-edge projects i...Show more
    Last updated: 15 days ago • Promoted
    AI Agent Architect

    AI Agent Architect

    Luxoft • Delhi, IN
    We are seeking a hands-on and qualified AI Agent Architect to design and deploy advanced Agentic AI systems—comprising task-specific autonomous tools governed by a master agent—to support complex t...Show more
    Last updated: 13 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Recro • Gurugram, Haryana, India
    This is a highly hands-on role requiring deep architectural insight, coding proficiency, and an obsession with performance, scalability, and reliability. You’ll architect secure, cost-efficient AI p...Show more
    Last updated: 30+ days ago • Promoted
    AI Software Engineer

    AI Software Engineer

    Quik Hire • Delhi, IN
    This role is ideal for professionals passionate about artificial intelligence, machine learning, and software engineering who want to make a tangible impact on real-world applications.As an AI Soft...Show more
    Last updated: 4 days ago • Promoted
    Artificial Intelligence Engineer

    Artificial Intelligence Engineer

    Tata Consultancy Services • Delhi, IN
    Job Title : Artificial Intelligence Engineer.Required Skillset : Senior Full Stack & AI Engineer (AWS | Node.Location : Delhi / Bangalore / Hyderabad / Pune / Mumbai. We are looking for an experienced.Seni...Show more
    Last updated: 30+ days ago • Promoted
    AI Engineer

    AI Engineer

    MightyBot • Delhi, IN
    Join our team as an AI Engineer, where we're focused on graduating AI from interesting demos to indispensable products.You will build reliable, self-improving systems that empower subject matter ex...Show more
    Last updated: 30+ days ago • Promoted
    AI Platform Engineer

    AI Platform Engineer

    BayOne Solutions • Delhi, IN
    We are seeking a highly skilled.In this role, you will work on advanced AI systems including.Retrieval-Augmented Generation (RAG). Model Context Protocol (MCP) tools.OpenWebUI or custom-built soluti...Show more
    Last updated: 10 days ago • Promoted
    Applied AI Engineer

    Applied AI Engineer

    Strategic Talent Partner • Delhi, IN
    Design and deploy advanced multi-agent pipelines for credit analysis.Optimize inference and prompt chains using frameworks like DSPy, GEPA, and LangChain. Implement reasoning techniques (CoT, ToT, G...Show more
    Last updated: 19 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Terra Technology Circle Consulting Private Limited • Gurugram, Haryana, India
    Job Title : Generative AI (GenAI) Engineer.AI / ML models—particularly LLMs and diffusion models.You will work closely with data scientists, product managers, and engineering teams to develop cutting-...Show more
    Last updated: 1 day ago • Promoted