AI Systems EngineerConfidential • Gurugram, Gurgaon / Gurugram, India

AI Systems Engineer

Confidential • Gurugram, Gurgaon / Gurugram, India

5 days ago

Job description

About US

Shunya Labs is building the Voice AI Infrastructure Layer for Enterprises powering speech intelligence, conversational agents, and domain-specific voice applications across industries. Born from deep work in mental-health AI and built for global enterprise scale, our stack combines state-of-the-art ASR / TTS models with an open-weights philosophy , driving accuracy, privacy, and scalability.

About the Role

We're seeking an AI Systems Engineer who thrives at the intersection of AI model optimization , infrastructure engineering , and applied research .

You will evaluate, host, and optimize a wide range of AI models—spanning ASR, LLMs, and multimodal systems and build the orchestration layer that powers scalable, low-latency deployments.

This is a role for someone who's comfortable navigating ambiguity , researching emerging AI methods , and translating client requirements into robust, production-ready solutions.

You'll work across the full stack—from GPU inference tuning to React-based control dashboards building a resilient and scalable AI delivery platform.

Key Responsibilities -

AI Model Evaluation & Optimization

Evaluate, benchmark, and optimize AI models (speech, text, vision, multimodal) for latency, throughput, and accuracy.
Implement advanced inference optimizations using ONNX Runtime , TensorRT , quantization , and GPU batching .
Continuously research and experiment with the latest AI runtimes , serving frameworks, and model architectures.
Develop efficient caching and model loading strategies for multi-tenant serving.

AI Infrastructure & Orchestration

Design and develop a central orchestration layer to manage multi-model inference, load balancing, and intelligent routing.

Build scalable, fault-tolerant deployments using AWS ECS / EKS , Lambda , and Terraform .

Use Kubernetes autoscaling and GPU node optimization to minimize latency under dynamic load.

Implement observability and monitoring (Prometheus, Grafana, CloudWatch) across the model-serving ecosystem.

DevOps, CI / CD & Automation

Build and maintain CI / CD pipelines for model integration, updates, and deployment (GitHub Actions, CodePipeline, etc.).

Manage Dockerized environments , version control, and GPU-enabled build pipelines.

Ensure reproducibility and resilience through infrastructure-as-code and automated testing.

Frontend & Developer Tools

Create React / Next.js -based dashboards for performance visualization, latency tracking, and configuration control.

Build intuitive internal tools for model comparison, experiment management, and deployment control.

Utilize Cursor , VS Code , and other AI-powered development tools to accelerate iteration.

Client Interaction & Solutioning

Work closely with clients and internal stakeholders to gather functional and performance requirements .

Translate abstract business needs into deployable AI systems with measurable KPIs.

Prototype quickly, iterate with feedback, and deliver robust production systems.

Research & Continuous Innovation

Stay on top of the latest AI research and model releases (OpenAI, Anthropic, Hugging Face, Meta, etc.).

Evaluate emerging frameworks for model serving, fine-tuning, and retrieval (LangChain, LlamaIndex, GraphRAG, etc.).

Proactively identify and implement performance or cost improvements in the model serving stack.

Share learnings and contribute to the internal AI knowledge base.

Ambiguous Problem Solving

Work effectively in undefined problem spaces , identifying optimal paths forward through experimentation.

Break down high-level goals into actionable technical strategies.

Balance trade-offs between accuracy, latency, and cost while innovating under uncertainty.

Required Skills

Strong proficiency in Python , TypeScript / JavaScript , Bash , and modern software development practices.

Deep understanding of Docker , Kubernetes , Terraform , and AWS (ECS, Lambda, S3, CloudFront) .

Experience with inference optimization (ONNX, TensorRT, quantization, batching).

Proven ability to design and scale real-time inference pipelines .

Experience building and maintaining CI / CD pipelines and monitoring systems .

Hands-on experience with React / Next.js or similar frameworks for dashboard / UI development.

Strong grasp of API design , load balancing , and GPU resource management .

Nice to Have

Experience with LangChain , LlamaIndex , GraphRAG , or vector databases (FAISS, Neo4j) .

Familiarity with speech processing models (Whisper, Silero, NeMo, etc.).

Prior work with serverless inference or edge AI architectures.

Knowledge of data pipelines , model versioning , and MLOps best practices .

Soft Skills

Excellent problem-solving in ambiguous, evolving environments.

Strong ability to research, self-learn, and prototype emerging AI technologies.

Confident communicator who can translate technical findings to business impact.

Ownership mindset with a collaborative, solution-oriented approach.

Skills Required

S3, API Design, Bash, Cloudfront, React, Typescript, Javascript, Docker, Terraform, AWS ECS, Load Balancing, Kubernetes, Python

Create a job alert for this search

System Engineer • Gurugram, Gurgaon / Gurugram, India

Related jobs

AI Software Engineer

Taskify AI • Delhi, IN

This role is ideal for professionals passionate about artificial intelligence, machine learning, and software engineering who want to make a tangible impact on real-world applications.As an AI Soft...Show more

Last updated: 4 days ago • Promoted

Agentic AI Engineer

Intellectt Inc • Delhi, IN

Agentic AI Engineer (100% Remote).Intellectt is seeking a highly experienced.The ideal candidate will have hands-on expertise in. LLMs, LangChain, LangGraph, RAG.AI applications for real-world use.O...Show more

Last updated: 13 days ago • Promoted

Principal Gen AI Engineer

Alkami Technology • Gurugram, Haryana, India

We're looking for a highly skilled and experienced.Principal GenAI / AI Engineer and Architect.The ideal candidate has over a decade of hands-on experience in the AI / GenAI space, with a strong focus ...Show more

Last updated: 1 day ago • Promoted

Generative AI Engineer

Reqpedia • Gurugram, Haryana, India

We seek a motivated Junior Generative AI Developer to design, implement, and optimize cutting-edge generative AI solutions. You’ll work closely with senior engineers to build applications leveraging...Show more

Last updated: 1 day ago • Promoted

Agentic AI & n8n Automation Expert

Dentalkart • Delhi, India, India

Agentic AI & n8n Automation Expert.The ideal candidate must have strong experience in automation tools, API integrations, and building multi-step automated processes. Build advanced n8n workflows, i...Show more

Last updated: 4 days ago • Promoted

AI Engineer

Aura Recruitment Solutions • Delhi, IN

Pay starts from 150,000 INR per Month.We’re hiring on behalf of our client, a fast-growing, AI-first company building cutting-edge AI-native applications that transform complex, real-world data int...Show more

Last updated: 13 days ago • Promoted

AI Engineer

Idea Elan India • Ghaziabad, IN

AI Engineer (2 - 4 Years Experience).Idea Elan LLC is a product based company that provides comprehensive software solutions for. Universities and Institutions worldwide.We are seeking a skilled AI ...Show more

Last updated: 5 days ago • Promoted

AI Engineer

Asite • Delhi, IN

We start with a simple idea : the built environment should be smarter, safer and more sustainable.Everything we do is about helping the people behind major construction and infrastructure projects w...Show more

Last updated: 1 day ago • Promoted

Generative AI Engineer

Live Connections • Ghaziabad, IN

We’re Hiring | Generative AI Lead / Principal Engineer.Are you passionate about building cutting-edge.Generative AI and LLM solutions. We’re looking for an experienced.Generative AI Lead / Principal...Show more

Last updated: 11 days ago • Promoted

Lead AI Engineer

Blend • Delhi, IN

We are looking for an AI Engineer with hands-on experience designing and deploying scalable AI solutions.In this role, you will be part of a cross-functional team working on cutting-edge projects i...Show more

Last updated: 15 days ago • Promoted

AI Agent Architect

Luxoft • Delhi, IN

We are seeking a hands-on and qualified AI Agent Architect to design and deploy advanced Agentic AI systems—comprising task-specific autonomous tools governed by a master agent—to support complex t...Show more

Last updated: 13 days ago • Promoted

Generative AI Engineer

Recro • Gurugram, Haryana, India

This is a highly hands-on role requiring deep architectural insight, coding proficiency, and an obsession with performance, scalability, and reliability. You’ll architect secure, cost-efficient AI p...Show more

Last updated: 30+ days ago • Promoted

AI Software Engineer

Quik Hire • Delhi, IN

Last updated: 4 days ago • Promoted

Artificial Intelligence Engineer

Tata Consultancy Services • Delhi, IN

Job Title : Artificial Intelligence Engineer.Required Skillset : Senior Full Stack & AI Engineer (AWS | Node.Location : Delhi / Bangalore / Hyderabad / Pune / Mumbai. We are looking for an experienced.Seni...Show more

Last updated: 30+ days ago • Promoted

AI Engineer

MightyBot • Delhi, IN

Join our team as an AI Engineer, where we're focused on graduating AI from interesting demos to indispensable products.You will build reliable, self-improving systems that empower subject matter ex...Show more

Last updated: 30+ days ago • Promoted

AI Platform Engineer

BayOne Solutions • Delhi, IN

We are seeking a highly skilled.In this role, you will work on advanced AI systems including.Retrieval-Augmented Generation (RAG). Model Context Protocol (MCP) tools.OpenWebUI or custom-built soluti...Show more

Last updated: 10 days ago • Promoted

Applied AI Engineer

Strategic Talent Partner • Delhi, IN

Design and deploy advanced multi-agent pipelines for credit analysis.Optimize inference and prompt chains using frameworks like DSPy, GEPA, and LangChain. Implement reasoning techniques (CoT, ToT, G...Show more

Last updated: 19 days ago • Promoted

Generative AI Engineer

Terra Technology Circle Consulting Private Limited • Gurugram, Haryana, India

Job Title : Generative AI (GenAI) Engineer.AI / ML models—particularly LLMs and diffusion models.You will work closely with data scientists, product managers, and engineering teams to develop cutting-...Show more

Last updated: 1 day ago • Promoted