Job Overview
Responsible to drive solutioning for GPU-as-a-Service (GaaS) and AI Cloud offerings. The ideal candidate will design, optimize, and deliver scalable GPU-based cloud solutions leveraging NVIDIA and other AI cloud platforms.
Responsibilities
- Architect and solution GPU-accelerated workloads for AI, ML, and HPC applications.
- Design and implement scalable GPU-as-a-Service offerings on NVIDIA AI Enterprise, DGX Cloud, or public / private cloud platforms.
- Collaborate with product, engineering, and sales teams to define AI cloud strategies and customer solutions.
- Benchmark GPU performance, optimize costs, and ensure seamless cloud integration.
- Engage with clients to understand workloads, recommend architectures, and support deployments.
Educational Qualifications
BE / B-Tech or equivalent with Computer Science or Electronics & Communication
RELEVANT EXPERIENCE
Relevant Experience in AI Cloud, GPU computing, or solution architecture.Hands-on experience with NVIDIA AI, DGX systems, CUDA, Triton Inference Server, and cloud platforms (AWS, Azure, GCP).Strong understanding of AI / ML pipelines, Kubernetes, and containerization.Excellent communication and pre-sales solutioning skills.