Talent.com
AI Inference Kernel Engineer (CUDA)

AI Inference Kernel Engineer (CUDA)

PhinityThane, IN
1 day ago
Job description

We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms. Just as AlphaEvolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in FlashAttention, we're building the infrastructure that will enable every AI model to optimize its own compute stack. Of course, to automate algorithm and hardware discovery, we need to break the data barrier. CUDA is a low-resource language, and kernel optimization depends a lot on context and hardware that models simply are not trained on.

Phinity is building the canonical training data infrastructure that will enable agentic hardware engineering and optimization, which will fuel algorithmic discovery. We are building environments for agents to learn to write kernel from a spec and optimize them on specific hardware, and eventually, to discover new hardware breakthroughs. Our customers include one of the largest frontier model labs.

We're seeking top engineers for a contractor role who can optimize hardware for model training and inference workloads, who can bake their industry experience into a model. This is a hybrid Systems Engineer / AI research role where you will be looking through and debugging model reasoning traces and designing the optimal CUDA problems to teach unreleased models to automate your work in industry. Please do not apply unless you have optimized kernels before.

Skill requirements :

Languages : CUDA, C++, Python,

Frameworks : JAX / XLA, PyTorch, TensorFlow (at the C++ level), Pallas

Libraries : cuBLAS, cuDNN, CUTLASS, CUB, Thrust

Compiler Tools : NVCC, PTX assembly, MLIR / XLA understanding

Hardware Knowledge : SM architecture, tensor cores, memory hierarchies (HBM, L2, shared, registers)

Apply if you have :
  • Achieved >
  • 10x speedups on production ML workloads

    • Written kernels that outperform vendor libraries
    • Optimized attention, GEMM, or convolution at the assembly level
    • Built custom fusions that beat XLA / Triton compiler output
    • Published papers or open-source kernels used in production
    Create a job alert for this search

    Ai Engineer • Thane, IN

    Related jobs
    • Promoted
    Computer Vision Engineer

    Computer Vision Engineer

    Green HR SolutionsThane, IN
    Hiring for a USA based multinational Software Company.We are seeking a talented Computer Vision Engineer to join our team and develop innovative solutions using cutting-edge AI and image processing...Show moreLast updated: 23 days ago
    • Promoted
    AI Inference Kernel Engineer (CUDA)

    AI Inference Kernel Engineer (CUDA)

    PhinityMumbai, IN
    We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new c...Show moreLast updated: 1 day ago
    • Promoted
    Senior AI Engineer

    Senior AI Engineer

    Nous InfosystemsKalyan-Dombivli, IN
    Nous Infosystems is a CMMI® Level 5 and ISO 9001 : 2000 certified global information Technology Company with expertise in providing quality software solutions and IT-enabled support services to a wid...Show moreLast updated: 14 days ago
    • Promoted
    Principal Engineer / Team Lead (CUDA,C++, GPU) with Airamatrix(AI Product Company) - Work From Office - 5 Days a Week

    Principal Engineer / Team Lead (CUDA,C++, GPU) with Airamatrix(AI Product Company) - Work From Office - 5 Days a Week

    AIRA MatrixThane, Maharashtra, India
    You will provide leadership in designing and implementing ground-breaking GPU computers that run demanding deep learning, high-performance computing, and computationally intensive workloads.We seek...Show moreLast updated: 30+ days ago
    • Promoted
    AI Agent Developer

    AI Agent Developer

    Sikich IndiaThane, IN
    Sikich is seeking a talented and driven developers with 3-5 years of experience to help us design, build, and deploy intelligent agents using Microsoft’s ecosystem. This role involves working with M...Show moreLast updated: 28 days ago
    • Promoted
    Principal Engineer - HPC / CUDA / GPU

    Principal Engineer - HPC / CUDA / GPU

    Hitya GlobalThane
    Key Responsibilities : - You will provide leadership in designing and implementing groundbreaking GPU computers that run demanding deep learning, high-perfo...Show moreLast updated: 30+ days ago
    • Promoted
    AI Cloud Engineer

    AI Cloud Engineer

    SKS EnterprisesMumbai
    Role Overview : We are looking for a skilled and forward-thinking AI Cloud Engineer to join our AI & Cloud Engineering team. This role is ideal for someone who thr...Show moreLast updated: 30+ days ago
    • Promoted
    Full Stack AI engineer

    Full Stack AI engineer

    AnswerThis (YC F25)Mumbai, IN
    Remote (Applications open worldwide).Semantic Search, Vector Databases, Prompt Engineering, GenAI Frameworks, React Agents, Graph Agents, Document Parsing, Python, Scalable APIs.AnswerThis is an AI...Show moreLast updated: 29 days ago
    • Promoted
    Artificial Intelligence Engineer

    Artificial Intelligence Engineer

    Productist.AIThane, Maharashtra, India
    About the Role We are seeking a highly specialized and results-driven AI Automation Engineer to join our fast-moving team. You will be the architect of our internal and client-facing workflows, l...Show moreLast updated: 2 days ago
    • Promoted
    AI Engineer

    AI Engineer

    TalentBridgeKalyan-Dombivli, IN
    Job Type : 6-Month Contract, after 6 months it will convert to fulltime.We are looking for an experienced AIML Engineer with 4–8 years of expertise in AI / ML solutions, specifically in building intel...Show moreLast updated: 12 days ago
    • Promoted
    AI Engineer

    AI Engineer

    Magna HireMumbai
    Job Description : We're building the future of AI-driven security automation and are looking for a hands-on AI Engineer who can design, deploy, and scal...Show moreLast updated: 26 days ago
    • Promoted
    Gen Ai - Engineer

    Gen Ai - Engineer

    Diligente TechnologiesThane, IN
    Hands-on experience with Generative AI (GenAI) and agent-based AI frameworks.Proficiency in backend programming languages such as Node. Strong knowledge of both SQL and NoSQL databases.Experience wi...Show moreLast updated: 11 days ago
    • Promoted
    Full Stack AI Engineer

    Full Stack AI Engineer

    Targeticon Digital Services Pvt. Ltd.Mumbai
    What You'll Do : - Build and deploy AI-powered applications end-to-end using modern AI stacks, including integrating Large Language Models (LLMs) into real-world applicati...Show moreLast updated: 30+ days ago
    • Promoted
    Artificial Intelligence Engineer

    Artificial Intelligence Engineer

    Debales AIKalyan-Dombivli, IN
    Debales AI builds autonomous AI Agents that seamlessly integrate into existing systems — no new dashboards, no added workflow overhead. With 100+ integrations and 80+ specialized AI Agents, we strea...Show moreLast updated: 2 days ago
    • Promoted
    Artificial Intelligence Engineer

    Artificial Intelligence Engineer

    Capabiliq INCNavi Mumbai, Maharashtra, India
    We’re Hiring : AI Engineer Join one of the most exciting opportunities in the AI world! Our client — a US-based, AI-funded startup (Series B) — is building something truly revolutionary in the fie...Show moreLast updated: 1 day ago
    AI Engineer

    AI Engineer

    ScaleneWorksMumbai, Maharashtra, India
    Quick Apply
    The AI Engineer is responsible for implementing Generative AI solutions for DXC business problems, improving efficiency, personalized user experiences, and the ability to automate complex tasks dri...Show moreLast updated: 30+ days ago
    • Promoted
    Senior AI Engineer

    Senior AI Engineer

    First American (India)Kalyan-Dombivli, IN
    We are hiring for great (Senior) AI Engineer who are open to learning the tools we use and building the AI Platform at First American. What we are looking for Requirements are : .Candidate must have i...Show moreLast updated: 13 days ago
    • Promoted
    AuxoAI - Artificial Intelligence Engineer

    AuxoAI - Artificial Intelligence Engineer

    AuxoAIMumbai
    AuxoAI is seeking a skilled and experienced AI Engineers to join our dynamic team.The ideal candidate will have 4+ years of prior experience in software engineering. This role involves collaborating...Show moreLast updated: 30+ days ago