Talent.com
AI Inference Kernel Engineer (CUDA)

AI Inference Kernel Engineer (CUDA)

Phinitymumbai, maharashtra, in
8 days ago
Job description

We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms. Just as AlphaEvolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in FlashAttention, we're building the infrastructure that will enable every AI model to optimize its own compute stack. Of course, to automate algorithm and hardware discovery, we need to break the data barrier. CUDA is a low-resource language, and kernel optimization depends a lot on context and hardware that models simply are not trained on.

Phinity is building the canonical training data infrastructure that will enable agentic hardware engineering and optimization, which will fuel algorithmic discovery. We are building environments for agents to learn to write kernel from a spec and optimize them on specific hardware, and eventually, to discover new hardware breakthroughs. Our customers include one of the largest frontier model labs.

We're seeking top engineers for a contractor role who can optimize hardware for model training and inference workloads, who can bake their industry experience into a model. This is a hybrid Systems Engineer / AI research role where you will be looking through and debugging model reasoning traces and designing the optimal CUDA problems to teach unreleased models to automate your work in industry. Please do not apply unless you have optimized kernels before.

Skill requirements :

Languages : CUDA, C++, Python,

Frameworks : JAX / XLA, PyTorch, TensorFlow (at the C++ level), Pallas

Libraries : cuBLAS, cuDNN, CUTLASS, CUB, Thrust

Compiler Tools : NVCC, PTX assembly, MLIR / XLA understanding

Hardware Knowledge : SM architecture, tensor cores, memory hierarchies (HBM, L2, shared, registers)

Apply if you have :
  • Achieved >
  • 10x speedups on production ML workloads

    • Written kernels that outperform vendor libraries
    • Optimized attention, GEMM, or convolution at the assembly level
    • Built custom fusions that beat XLA / Triton compiler output
    • Published papers or open-source kernels used in production
    Create a job alert for this search

    Ai Engineer • mumbai, maharashtra, in

    Related jobs
    • Promoted
    Agentic Ai Engineer

    Agentic Ai Engineer

    Nityo InfotechThāne, Republic Of India, IN
    AI Agent Development & LLM Integration.Build AI agents using frameworks like LangGraph, Autogen, Crew, or PydanticAI.Design and optimize prompt engineering workflows for LLMs (e.Develop modular, re...Show moreLast updated: 15 days ago
    • Promoted
    • New!
    Sr Ai Engineer

    Sr Ai Engineer

    Litmus7Dombivli, Republic Of India, IN
    As part of this initiative, resource should research and experiment with the latest AI and cloud innovations (such as AWS Agents, Databricks AI, and other Model Context Protocol (MCP integrations),...Show moreLast updated: 19 hours ago
    AI Engineer

    AI Engineer

    AryaXAIMumbai, IN-MH, IN
    AryaXAI stands at the forefront of AI innovation, revolutionizing AI for mission-critical businesses by building explainable, safe, and aligned systems that scale responsibly.Our mission is to crea...Show moreLast updated: 30+ days ago
    • Promoted
    AI Inference Kernel Engineer (CUDA)

    AI Inference Kernel Engineer (CUDA)

    PhinityMumbai, IN
    We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new c...Show moreLast updated: 9 days ago
    • Promoted
    Sr AI Engineer

    Sr AI Engineer

    Litmus7navi mumbai, maharashtra, in
    As part of this initiative, resource should research and experiment with the latest AI and cloud innovations (such as AWS Agents, Databricks AI, and other Model Context Protocol (MCP integrations),...Show moreLast updated: 1 day ago
    • Promoted
    Computer Vision Engineer

    Computer Vision Engineer

    RecroMumbai, IN
    AI Engineer - Manufacturing Analysis Platform.We're seeking a passionate AI Engineer to lead the development of our core AI analysis engine. You'll be architecting and implementing machine learning ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior AI Engineer (Oracle Fusion & OCI)

    Senior AI Engineer (Oracle Fusion & OCI)

    SmarTek21dombivli, maharashtra, in
    We are looking for a highly skilled AI Engineer to build and scale enterprise-grade AI solutions using advanced LLMs, agent frameworks, and automation platforms. In this hands-on role, you will desi...Show moreLast updated: 1 day ago
    • Promoted
    Lead AI Engineer

    Lead AI Engineer

    Genisys Groupnavi mumbai, maharashtra, in
    As an AI / ML Engineer at Genisys Group, you will be instrumental in developing and.AI solutions, with a strong focus on Large Language Models. LLMs) and Retrieval-Augmented Generation (RAG) technique...Show moreLast updated: 1 day ago
    • Promoted
    Ai Inference Kernel Engineer

    Ai Inference Kernel Engineer

    PhinityDombivli, Republic Of India, IN
    We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new c...Show moreLast updated: 8 days ago
    • Promoted
    AI Engineer

    AI Engineer

    ConfidentialMumbai, India
    AI Engineer — Image‑to‑Video (Mid‑Level).Location : Mumbai (on‑site / hybrid).Contract : 6 months, extendable.Build, fine‑tune, and ship image‑to‑video generation pipelines (prompt‑to‑video, storyboard...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Ai Engineer

    Senior Ai Engineer

    First American (India)Thāne, Republic Of India, IN
    We are hiring for great (Senior) AI Engineer who are open to learning the tools we use and building the AI Platform at First American. What we are looking for Requirements are : .Candidate must have i...Show moreLast updated: 15 days ago
    • Promoted
    AI Engineer

    AI Engineer

    TalentBridgeMumbai, IN
    Job Type : 6-Month Contract, after 6 months it will convert to fulltime.We are looking for an experienced AIML Engineer with 4–8 years of expertise in AI / ML solutions, specifically in building intel...Show moreLast updated: 20 days ago
    • Promoted
    Engineer-AI

    Engineer-AI

    Sakonmumbai city, maharashtra, in
    Role : AI Engineer – Agentic Systems & LLM Applications.We’re looking for a well-rounded, forward-thinking AI Engineer who can design, build, and deploy intelligent systems powered by LLMs, retrieva...Show moreLast updated: 1 day ago
    • Promoted
    Senior AI Engineer

    Senior AI Engineer

    First American (India)navi mumbai, maharashtra, in
    We are hiring for great (Senior) AI Engineer who are open to learning the tools we use and building the AI Platform at First American. What we are looking for Requirements are : .Candidate must have i...Show moreLast updated: 21 days ago
    • Promoted
    AiOps Engineer

    AiOps Engineer

    L&T Technology Servicesdombivli, maharashtra, in
    Only immediate to 15 days joiner.Develop and Deploy AI Solutions : .Design, build, and deploy end-to-end Machine Learning and Generative AI pipelines on. Google Cloud Platform, using Vertex AI service...Show moreLast updated: 19 days ago
    • Promoted
    • New!
    Engineer-Ai

    Engineer-Ai

    SakonThāne, Republic Of India, IN
    Role : AI Engineer – Agentic Systems & LLM Applications.We’re looking for a well-rounded, forward-thinking AI Engineer who can design, build, and deploy intelligent systems powered by LLMs, retrieva...Show moreLast updated: 19 hours ago
    • Promoted
    • New!
    AI / ML Engineer with Snowflake Cortex

    AI / ML Engineer with Snowflake Cortex

    ConfidentialMumbai, India
    If interested, please send an email to [HIDDEN TEXT] along with details : CTC, ECTC, Notice period and a brief summary of your Snowflake Cortex work (use cases, tools, outcomes) • •.We are looking f...Show moreLast updated: 17 hours ago
    AI Engineer

    AI Engineer

    ScaleneWorksMumbai, Maharashtra, India
    Quick Apply
    The AI Engineer is responsible for implementing Generative AI solutions for DXC business problems, improving efficiency, personalized user experiences, and the ability to automate complex tasks dri...Show moreLast updated: 30+ days ago