We are looking for bright minds to join us as AI Research Interns in Noida for a 6-month stint.
Key Responsibilities
- Benchmark open-source LLMs for information extraction, document reasoning, and OCR on different NVIDIA GPUs and Google TPUs
- Tune infrastructure and vLLM configurations to maximize tokens-per-second per dollar spent
- Continuously improve system performance through prompt refinement and rival prompt strategies
Technical Skills
Proficiency in Python and experience with deep learning frameworks (PyTorch / TensorFlow)Familiarity with LLM serving frameworks (vLLM, HuggingFace, TensorRT-LLM, Triton)Understanding of GPU / TPU architectures and performance tuningExposure to OCR tools (Tesseract, PaddleOCR, DocTR)Knowledge of prompt engineering and evaluation techniquesPreferred Profile
B.Tech Computer Science from a Tier-1 college (2025 & 2026 batch)Prior experience or internships in any of the above work streamsOwnership mindset, problem-solving attitude, ability to learn and experiment quicklyAttention to detail, proactive approach, and hunger for success