Talent.com
LLM Systems Performance Engineer (CUDA)

LLM Systems Performance Engineer (CUDA)

Phinityhosur, tamil nadu, in
11 hours ago
Job description

We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms. Just as AlphaEvolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in FlashAttention, we're building the infrastructure that will enable every AI model to optimize its own compute stack. Of course, to automate algorithm and hardware discovery, we need to break the data barrier. CUDA is a low-resource language, and kernel optimization depends a lot on context and hardware that models simply are not trained on.

Phinity is building the canonical training data infrastructure that will enable agentic hardware engineering and optimization, which will fuel algorithmic discovery. We are building environments for agents to learn to write kernel from a spec and optimize them on specific hardware, and eventually, to discover new hardware breakthroughs. Our customers include one of the largest frontier model labs.

We're seeking top engineers for a contractor role who can optimize hardware for model training and inference workloads, who can bake their industry experience into a model. This is a hybrid Systems Engineer / AI research role where you will be looking through and debugging model reasoning traces and designing the optimal CUDA problems to teach unreleased models to automate your work in industry. Please do not apply unless you have optimized kernels before.

Skill requirements :

Languages : CUDA, C++, Python,

Frameworks : JAX / XLA, PyTorch, TensorFlow (at the C++ level), Pallas

Libraries : cuBLAS, cuDNN, CUTLASS, CUB, Thrust

Compiler Tools : NVCC, PTX assembly, MLIR / XLA understanding

Hardware Knowledge : SM architecture, tensor cores, memory hierarchies (HBM, L2, shared, registers)

Apply if you have :
  • Achieved >
  • 10x speedups on production ML workloads

    • Written kernels that outperform vendor libraries
    • Optimized attention, GEMM, or convolution at the assembly level
    • Built custom fusions that beat XLA / Triton compiler output
    • Published papers or open-source kernels used in production
    Create a job alert for this search

    Performance Engineer • hosur, tamil nadu, in

    Related jobs
    • Promoted
    Senior LLM Engineer

    Senior LLM Engineer

    RingCentralBengaluru, Karnataka, India
    We are seeking an experienced AI Engineer with a strong background in Natural Language Understanding (NLU) who is passionate about pushing the boundaries of Conversational AI.In this role, you will...Show moreLast updated: 11 days ago
    • Promoted
    MLOps Engineer

    MLOps Engineer

    X4 TechnologyBangalore, IN
    MLOps Engineer - Role & Responsibilities.Design, deploy and manage scalable & secure cloud infrastructure.Apply least privilege across cloud platforms (Azure, RBAC, AWS IAM).Enable audit logging co...Show moreLast updated: 11 days ago
    • Promoted
    CAST Software - System Administrator - LMS Implementation

    CAST Software - System Administrator - LMS Implementation

    CAST Software India Pvt LtdBangalore
    About the job : Brief about the Position : Preferred candidate from B.E background or Data Analytics background with 1 years of experience in a large corpo...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer III - System Architecture

    Site Reliability Engineer III - System Architecture

    HyreSnapBangalore
    Responsibilities : - Architect and lead the design of scalable, reliable infrastructure solutions.Implement strategies for high availabili...Show moreLast updated: 30+ days ago
    • Promoted
    ▷ Immediate Start : Full Stack LLM Engineer

    ▷ Immediate Start : Full Stack LLM Engineer

    Cerebras SystemsBengaluru, Karnataka, India
    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programm...Show moreLast updated: 5 days ago
    • Promoted
    • New!
    ▷ [High Salary] Senior Systems Engineer(L3)– Remote Monitoring & Management(Kaseya, Automate OR Conn

    ▷ [High Salary] Senior Systems Engineer(L3)– Remote Monitoring & Management(Kaseya, Automate OR Conn

    KcesBengaluru, Karnataka, India
    Remote
    We’re Hiring : Senior Systems Engineer(L3) – RMM, Azure, and Network Automation - Mandatory Experience in either Kaseya, Automate, and ConnectWise Please note that this is a L3 Level role and need ...Show moreLast updated: 1 hour ago
    • Promoted
    Performance Engineering / RedHat Linux

    Performance Engineering / RedHat Linux

    ConfidentialBengaluru / Bangalore
    Monitor, analyze, and troubleshoot system and application performance issues on.Conduct performance testing, benchmarking, and capacity planning for applications and infrastructure.Use profiling an...Show moreLast updated: 30+ days ago
    • Promoted
    MLOps Engineer - CI / CD Pipeline

    MLOps Engineer - CI / CD Pipeline

    Connect2TalentBangalore
    Job Title : MLOps Engineer Location : Bangalore / Gurgaon Experience : 4 to 8 years &l...Show moreLast updated: 30+ days ago
    • Promoted
    LLMOps Engineer

    LLMOps Engineer

    ConfidentialBengaluru / Bangalore
    Deploy and scale LLM inference workloads on Kubernetes (K8s) with 99.Build agentic tools and services for fraud investigations with complex reasoning capabilities. Work with Platform Engineers to se...Show moreLast updated: 30+ days ago
    • Promoted
    RMS (Reliability Monitoring System) Technical Expert – OSAT

    RMS (Reliability Monitoring System) Technical Expert – OSAT

    Tata ElectronicsKolar, Karnataka, India
    Tata Electronics (a wholly owned subsidiary of Tata Sons Pvt.India’s first AI-enabled state-of-the-art Semiconductor Foundry. This facility will produce chips for applications such as power manageme...Show moreLast updated: 30+ days ago
    • Promoted
    Systems Engineer - (C / C++)-Observability & Performance Platform

    Systems Engineer - (C / C++)-Observability & Performance Platform

    TechConnexions - Startup Hiring SpecialistsBengaluru, Karnataka, India
    Systems Engineer - (C / C++)-Observability & Performance Platform.Years (5+ years of hands-on experience developing in C and C++ in production environments. We are a leading provider of innovative sof...Show moreLast updated: 10 days ago
    • Promoted
    • New!
    Performance Engineer

    Performance Engineer

    InfosysBengaluru, Karnataka, India
    This role is central to the design, analysis, and validation of.Collaborate with cross-functional teams including.Deliver technical packages aligned with project milestones, cost, and quality requi...Show moreLast updated: less than 1 hour ago
    • Promoted
    AppScan Product _Lead Performance Engineer _Remote Location

    AppScan Product _Lead Performance Engineer _Remote Location

    HCL AppScanBangalore, IN
    Remote
    HCL Software” : - Is a Product Development Division of HCL Tech : That operates its primary Software business.At HCL Software we Develop, Market, Sell and Support over 20 Product families in the area...Show moreLast updated: 25 days ago
    • Promoted
    Full Stack LLM Engineer

    Full Stack LLM Engineer

    Cerebras SystemsBengaluru, Karnataka, India
    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programm...Show moreLast updated: 11 days ago
    • Promoted
    Capgemini - MLOps Engineer

    Capgemini - MLOps Engineer

    Capgemini Technology Services India LimitedBangalore
    Your Role : - Design, implement, and maintain end-to-end ML pipelines for model training, evaluation, and deployment &...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior systems engineer(l3)– remote monitoring & management(kaseya, automate or connectwise)

    Senior systems engineer(l3)– remote monitoring & management(kaseya, automate or connectwise)

    KcesBengaluru, Karnataka, India
    Remote
    We’re Hiring : Senior Systems Engineer(L3) – RMM, Azure, and Network Automation - Mandatory Experience in either Kaseya, Automate, and Connect Wise. Please note that this is a L3 Level role and need ...Show moreLast updated: 11 hours ago
    • Promoted
    Senior Systems Engineer(L3)– Remote Monitoring & Management(Kaseya, Automate OR ConnectWise)

    Senior Systems Engineer(L3)– Remote Monitoring & Management(Kaseya, Automate OR ConnectWise)

    KcesBangalore Urban, Karnataka, India
    Remote
    We’re Hiring : Senior Systems Engineer(L3) – RMM, Azure, and Network Automation - Mandatory Experience in either Kaseya, Automate, and ConnectWise. Please note that this is a L3 Level role and need s...Show moreLast updated: 1 day ago
    • Promoted
    LLM & ML Ops Engineer

    LLM & ML Ops Engineer

    ConfidentialBengaluru / Bangalore
    Gainwell is seeking LLM Ops Engineers and ML Ops Engineers to join our growing AI / ML team.This role is responsible for developing, deploying, and maintaining scalable infrastructure and pipelines f...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    LLM Systems Performance Engineer (CUDA)

    LLM Systems Performance Engineer (CUDA)

    PhinityBangalore, IN
    We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new c...Show moreLast updated: 16 hours ago
    • Promoted
    MLops Engineer

    MLops Engineer

    RecroBengaluru, IN
    We are looking for an experienced.Azure and AWS cloud ecosystems.The ideal candidate should bring a strong background in. GenAI tooling, automation, and CI / CD pipelines.Design, implement, and manage...Show moreLast updated: 11 days ago