Talent.com
AI Inference Kernel Engineer (CUDA)

AI Inference Kernel Engineer (CUDA)

PhinityBengaluru, IN
4 days ago
Job description

We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms. Just as AlphaEvolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in FlashAttention, we're building the infrastructure that will enable every AI model to optimize its own compute stack. Of course, to automate algorithm and hardware discovery, we need to break the data barrier. CUDA is a low-resource language, and kernel optimization depends a lot on context and hardware that models simply are not trained on.

Phinity is building the canonical training data infrastructure that will enable agentic hardware engineering and optimization, which will fuel algorithmic discovery. We are building environments for agents to learn to write kernel from a spec and optimize them on specific hardware, and eventually, to discover new hardware breakthroughs. Our customers include one of the largest frontier model labs.

We're seeking top engineers for a contractor role who can optimize hardware for model training and inference workloads, who can bake their industry experience into a model. This is a hybrid Systems Engineer / AI research role where you will be looking through and debugging model reasoning traces and designing the optimal CUDA problems to teach unreleased models to automate your work in industry. Please do not apply unless you have optimized kernels before.

Skill requirements :

Languages : CUDA, C++, Python,

Frameworks : JAX / XLA, PyTorch, TensorFlow (at the C++ level), Pallas

Libraries : cuBLAS, cuDNN, CUTLASS, CUB, Thrust

Compiler Tools : NVCC, PTX assembly, MLIR / XLA understanding

Hardware Knowledge : SM architecture, tensor cores, memory hierarchies (HBM, L2, shared, registers)

Apply if you have :
  • Achieved >
  • 10x speedups on production ML workloads

    • Written kernels that outperform vendor libraries
    • Optimized attention, GEMM, or convolution at the assembly level
    • Built custom fusions that beat XLA / Triton compiler output
    • Published papers or open-source kernels used in production
    Create a job alert for this search

    Ai Engineer • Bengaluru, IN

    Related jobs
    • Promoted
    Senior AI Engineer - Computer Vision

    Senior AI Engineer - Computer Vision

    lookupBengaluru, Karnataka, India
    To make video as accessible to machines as text and voice are today.Video is everywhere, but it's unsearchable—a black box of insight that no one can open or atleast open affordably.We're building ...Show moreLast updated: 17 days ago
    • Promoted
    GenAI Engineer

    GenAI Engineer

    Persistent SystemsBengaluru, Karnataka, India
    We are seeking a Generative AI Engineer to design and build intelligent systems using state-of-the-art generative models. You will work on developing applications powered by large language models (L...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Full-Stack Software Engineer - AI Applications

    Principal Full-Stack Software Engineer - AI Applications

    Mulya TechnologiesGreater Bengaluru Area, India
    Principal Full-Stack Software Engineer - AI Applications.Founded in 2023,by Industry veterans HQ in California,US.We are revolutionizing sustainable AI compute through intuitive software with compo...Show moreLast updated: 16 days ago
    • Promoted
    Principal Software Engineer – AI-Native, Startup Mindset

    Principal Software Engineer – AI-Native, Startup Mindset

    BigRioGreater Bengaluru Area, India
    Principal Software Engineer – AI-Native, Startup Mindset.India- Bangalore (relocation available).We are seeking an exceptional engineer who is ready to change how software is built, thrives in fast...Show moreLast updated: 14 days ago
    • Promoted
    Generative AI Engineer

    Generative AI Engineer

    TEKsystems Global Services in IndiaBangalore Urban, Karnataka, India
    We are looking for candidates with AI / ML experience on Azure (Azure ML, Azure OpenAI, Document Intelligence), AWS (SageMaker, Bedrock, Agents, Q), or GCP (Vertex AI, Google AI Platform) Kubernetes,...Show moreLast updated: 30+ days ago
    • Promoted
    Snowflake Cortex Developer

    Snowflake Cortex Developer

    Tata Consultancy ServicesGreater Bengaluru Area, India
    Job Title : Snowflake Cortex Developer.Required Skillset : Snowflake Cortex, SQL, Snowflake.Location : Delhi / Bangalore / Hyderabad / Pune / Mumbai. Strong data engineer ( snowflake + SQL) with good under...Show moreLast updated: 9 days ago
    • Promoted
    Conversational AI- Engineer (3-5 yrs)

    Conversational AI- Engineer (3-5 yrs)

    VerintBengaluru, Karnataka, India
    In this role, you will leverage advanced technical and linguistic skills to become an expert in our AI Orchestration platform : IVA Studio. You will build upon a deep expertise in Conversational AI a...Show moreLast updated: 17 days ago
    • Promoted
    Sr. Backend Engineer - AI&CV

    Sr. Backend Engineer - AI&CV

    GalaxEyeBengaluru, Karnataka, India
    GalaxEye is building groundbreaking AI-first products that transform satellite data into actionable intelligence.We’re creating the world’s most advanced multi-sensor geospatial intelligence platfo...Show moreLast updated: 4 days ago
    • Promoted
    Computer Vision Engineer

    Computer Vision Engineer

    Green HR Solutionshosur, tamil nadu, in
    Hiring for a USA based multinational Software Company.We are seeking a talented Computer Vision Engineer to join our team and develop innovative solutions using cutting-edge AI and image processing...Show moreLast updated: 25 days ago
    • Promoted
    GEN AI Developer

    GEN AI Developer

    Best Infosystems Ltd.Greater Bengaluru Area, India
    GEN AI Developer_Full-time_Bangalore / Pune / Navi Mumbai / Noida / Hyderabad / Chennai.Bangalore / Pune / Navi Mumbai / Noida / Hyderabad / Chennai. Gen Ai, Azure Open Ai, Python and which are mentions in cheat sheet....Show moreLast updated: 30+ days ago
    • Promoted
    Senior Serdes Architect

    Senior Serdes Architect

    Mulya TechnologiesGreater Bengaluru Area, India
    Senior SerDes Architect and Lead.About Omni Design Technologies.Omni Design Technologies is a leading provider of high-performance, ultra-low power IP cores, from 28nm down through advanced FinFET ...Show moreLast updated: 25 days ago
    • Promoted
    Full Stack Engineer (MERN)

    Full Stack Engineer (MERN)

    IntelenseGreater Bengaluru Area, India
    Founded in December 2018, Intelense develops cutting-edge AI software that transforms unstructured heterogeneous data into actionable insights. Initially uncovering valuable trends in public spaces ...Show moreLast updated: 17 days ago
    • Promoted
    GEN AI Sr. Developer

    GEN AI Sr. Developer

    Best Infosystems Ltd.Greater Bengaluru Area, India
    Developer_Full-time_Bangalore / Pune / Navi Mumbai / Noida / Hyderabad / Chennai.Bangalore / Pune / Navi Mumbai / Noida / Hyderabad / Chennai. Gen Ai, Azure Open Ai, Python and which are mentions in cheat sheet.Senior ...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    AS Technology CorporationBangalore Urban, Karnataka, India
    We are seeking an experienced AI / ML Architect to lead the design and implementation of intelligent solutions for client use cases. The role involves building PoCs and scalable platforms leveraging C...Show moreLast updated: 27 days ago
    • Promoted
    Software Engineer, AI Agents

    Software Engineer, AI Agents

    Asha Health (YC F24)Greater Bengaluru Area, India
    Asha Health is a New York based seed stage startup that helps medical practices launch their own AI clinics.We're backed by Y Combinator, General Catalyst, 186 Ventures, Reach Capital and many more...Show moreLast updated: 16 days ago
    • Promoted
    AI Engineer

    AI Engineer

    TalentBridgeBengaluru, IN
    Job Type : 6-Month Contract, after 6 months it will convert to fulltime.We are looking for an experienced AIML Engineer with 4–8 years of expertise in AI / ML solutions, specifically in building intel...Show moreLast updated: 15 days ago
    • Promoted
    Tech Lead / Senior Developer – Vision AI

    Tech Lead / Senior Developer – Vision AI

    Tata ElectronicsGreater Bengaluru Area, India
    Tata Electronics (a wholly owned subsidiary of Tata Sons Pvt.India’s first AI-enabled state-of-the-art Semiconductor Foundry. This facility will produce chips for applications such as power manageme...Show moreLast updated: 23 days ago
    • Promoted
    Kofax Developer

    Kofax Developer

    SYNECHRON TECHNOLOGYBangalore Rural, Karnataka, India
    We have immediate opportunity for Tungstun kofax.At Synechron, we believe in the power of digital to transform businesses for the better. Our global consulting firm combines creativity and innovativ...Show moreLast updated: 30+ days ago