Talent.com
GPU Cloud Validation and Performance Engineer

GPU Cloud Validation and Performance Engineer

LTIMindtreeRepublic Of India, IN
21 days ago
Job description

Job Description :

Senior Infrastructure Test & Validation Engineer (Zero-Touch GPU Cloud – GitOps Validation & Certification)

We are seeking a Senior Infrastructure Test & Validation Engineer with 10+ years of experience to lead the Zero-Touch Validation, Upgrade, and Certification automation of our on-prem GPU cloud platform. This role focuses on ensuring the stability, performance, and conformance of the entire stack—from hardware to Kubernetes—using automated, GitOps-based validation pipelines. The ideal candidate has a strong infrastructure background with deep hands-on skills in Sonobuoy , LitmusChaos , k6 , and pytest , and is passionate about automated test orchestration, platform resilience, and continuous conformance.

Key Responsibilities

  • Design and implement automated, GitOps-compliant pipelines for validation and certification of the GPU cloud stack across hardware, OS, Kubernetes, and platform layers.
  • Integrate Sonobuoy for Kubernetes conformance and certification testing.
  • Design and orchestrate chaos engineering workflows using LitmusChaos to validate system resilience across failure scenarios.
  • Implement performance testing suites using k6 and system-level benchmarks, integrated into CI / CD pipelines.
  • Develop and maintain end-to-end test frameworks using pytest and / or Go , focusing on cluster lifecycle events, upgrade paths, and GPU workloads.
  • Ensure test coverage and validation across multiple dimensions : conformance, performance, fault injection, and post-upgrade validation.
  • Build and maintain dashboards and reporting for automated test results, including traceability, drift detection, and compliance tracking.
  • Collaborate with infrastructure, SRE, and platform teams to embed testing and validation early in the deployment lifecycle.
  • Own quality assurance gates for all automation-driven deployments.

Required Skills & Experience

  • 10+ years of hands-on experience in infrastructure engineering, systems validation, or SRE roles.
  • Primary key skills required are pytest, Go, k6 scripting, automation frameworks integration (Sonobuoy, LitmusChaos), CI integration
  • Strong experience with :
  • Sonobuoy for Kubernetes conformance and diagnostics
  • LitmusChaos for fault injection and resilience validation
  • k6 for performance / load testing in distributed environments
  • pytest or Go-based test frameworks for automation and validation scripting
  • Deep understanding of Kubernetes architecture, upgrade patterns, and operational risks.
  • Experience validating infrastructure components (GPU drivers, kernel modules, CNI, CRI, etc.) across lifecycle events.
  • Proficient in GitOps workflows and integrating tests into declarative, Git-backed pipelines (e.G., with Argo CD, Flux).
  • Hands-on experience with CI / CD systems (e.G., GitHub Actions, GitLab CI, Jenkins) to automate test orchestration.
  • Solid scripting and automation experience (Python, Bash, or Go).
  • Familiarity with GPU-based infrastructure and its performance characteristics is a strong plus.
  • Strong debugging, root cause analysis, and incident investigation skills.
  • Create a job alert for this search

    Validation Engineer • Republic Of India, IN

    Related jobs
    • Promoted
    • New!
    Autonomous Driving Validation Engineer

    Autonomous Driving Validation Engineer

    L&T Technology ServicesRepublic Of India, IN
    Hands-on exp in validation of ADAS ( L2, L2+) functions such as ACC, LKA, AEB, Planner Driving behaviour etc.Create and execute test case for ADAS functions in simulated and on real world environme...Show moreLast updated: 18 hours ago
    • Promoted
    CPU Feature Validation Engineer

    CPU Feature Validation Engineer

    TenstorrentRepublic Of India, IN
    Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions mu...Show moreLast updated: 21 days ago
    • Promoted
    AI System Validation Engineer

    AI System Validation Engineer

    RecroRepublic Of India, IN
    Up to ₹19 LPA (pro-rated for 4 months).The role involves designing and executing test cases, validating model outputs, and ensuring the overall quality, reliability, and fairness of AI systems.This...Show moreLast updated: 11 days ago
    • Promoted
    • New!
    ADAS Validation Engineer

    ADAS Validation Engineer

    L&T Technology ServicesRepublic Of India, IN
    Hands-on exp in validation of ADAS ( L2, L2+) functions such as ACC, LKA, AEB, Planner Driving behaviour etc.Create and execute test case for ADAS functions in simulated and on real world environme...Show moreLast updated: 18 hours ago
    • Promoted
    Storage Validation Engineer

    Storage Validation Engineer

    DDNRepublic Of India, IN
    This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DataDirect Networks (DDN) is a globa...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Verification Engineer

    Senior Verification Engineer

    Tata Consultancy ServicesRepublic Of India, IN
    Must have very good System Verilog / UVM experience.Must have expertise in PCI gen6 and CXL3.Have experience in IP / SoC Verification. Expertise in AMBA / AXI bus protocols and ARM CPU.Experience in devel...Show moreLast updated: 30+ days ago
    • Promoted
    GPU Verification Engineer

    GPU Verification Engineer

    ConfidentialIndia
    WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs,.Grounded in a culture of...Show moreLast updated: 4 days ago
    • Promoted
    AMS Verification Engineer / Lead

    AMS Verification Engineer / Lead

    eInfochips (An Arrow Company)Nagpur, IN
    Minimum 6 years relevant experience is required.Bangalore, Hyderabad, Noida, Chennai, Ahmedabad, Pune.Min 6 Years of overall experience in ASIC Verification. Should have worked on AMS Verification f...Show moreLast updated: 30+ days ago
    • Promoted
    WTG Installation Validation Engineer

    WTG Installation Validation Engineer

    POWERCON®Republic Of India, IN
    The POWERCON® Group – an Indian MNC, is a Total Solution Provider for Renewable Energy Project Development, Construction and Lifetime Operations. The 2 flagships POWERCON® Ventures (Wind arm) and Po...Show moreLast updated: 21 days ago
    • Promoted
    • New!
    Scenario Validation Engineer

    Scenario Validation Engineer

    L&T Technology ServicesRepublic Of India, IN
    Hands-on exp in validation of ADAS ( L2, L2+) functions such as ACC, LKA, AEB, Planner Driving behaviour etc.Create and execute test case for ADAS functions in simulated and on real world environme...Show moreLast updated: 17 hours ago
    • Promoted
    ADAS Simulation and Validation Engineer

    ADAS Simulation and Validation Engineer

    TaggdPune, Republic Of India, IN
    Design, Develop & Validate ADAS functionalities through virtual simulations.Develop and maintain simulation models for ADAS functionalities using industry-standard tools and languages (e.CARSIM / MAT...Show moreLast updated: 21 days ago
    • Promoted
    Senior Verification Engineer

    Senior Verification Engineer

    VeriFast TechnologiesPune, Republic Of India, IN
    VeriFast Technologies is hiring Sr Verification Engineers in Pune with minimum 3+ to 5+ years of experience with PCIe / PCI-E, UCIe, CXL, strong SV, UVM, hands on AXI with ASIC / SoC.Do call me at 9934...Show moreLast updated: 30+ days ago
    • Promoted
    Validation & Performance Automation Eng

    Validation & Performance Automation Eng

    LTIMindtreeRepublic Of India, IN
    Senior Infrastructure Test & Validation Engineer (Zero-Touch GPU Cloud – GitOps Validation & Certification).Senior Infrastructure Test & Validation Engineer. Zero-Touch Validation, Upgrade, and Cert...Show moreLast updated: 21 days ago
    • Promoted
    Validation Engineer

    Validation Engineer

    HCLTechChennai, Republic Of India, IN
    Plan, design, and execute test strategies for cloud applications, ensuring comprehensive testing coverage.Implement and maintain automated testing frameworks for efficient and reliable testing proc...Show moreLast updated: 30+ days ago
    • Promoted
    Emulation and Validation Engineer

    Emulation and Validation Engineer

    L&T Technology ServicesRepublic Of India, IN
    The core responsibility of an emulation engineer is to.This involves using specialized hardware platforms, like.This "emulated" chip can run at near real-time speeds, allowing engineers to test lar...Show moreLast updated: 11 days ago
    • Promoted
    Lead Verification Engineer

    Lead Verification Engineer

    Tata Consultancy ServicesRepublic Of India, IN
    Must have very good System Verilog / UVM experience.Must have expertise in PCI gen6 and CXL3.Have experience in IP / SoC Verification. Expertise in AMBA / AXI bus protocols and ARM CPU.Experience in devel...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Verification and Validation Engineer

    Verification and Validation Engineer

    Signotron (India) Pvt. Ltd.Republic Of India, IN
    Founded in 1985 by technical entrepreneurs from IIT Kharagpur, Signotron has a long history of developing custom-made, cutting-edge hardware technology. These solutions now come integrated with IoT-...Show moreLast updated: 18 hours ago
    • Promoted
    • New!
    Post Silicon Validation Engineer

    Post Silicon Validation Engineer

    USTBhopal, Republic Of India, IN
    Perform characterization to analyze device performance and compliance to datasheet specification.Conceptualize, design and implement hardware and automation software for characterization of Automot...Show moreLast updated: 12 hours ago