Talent.com
Senior Applied AI Scientist - Document Understanding
Senior Applied AI Scientist - Document UnderstandingPiazza Consulting Group • Hyderabad, Republic Of India, IN
Senior Applied AI Scientist - Document Understanding

Senior Applied AI Scientist - Document Understanding

Piazza Consulting Group • Hyderabad, Republic Of India, IN
1 day ago
Job description

We are looking for someone obsessed with turning messy real-world documents into perfectly structured, actionable data.

If you live and breathe document layout analysis, OCR post-processing, and visual + language models, this role is custom-built for you.

Core Responsibilities

  • Design and build production-grade Document Intelligence pipelines (invoices, contracts, forms, reports, handwritten, tables, multi-language, etc.)
  • Train / fine-tune and deploy Layout-aware models (LayoutLMv3, Donut, LLaMA-Adapter, Nougat, etc.)
  • Build and optimize Vision-Language models (VLLMs) on custom enterprise document datasets
  • Improve OCR accuracy using layout context, post-correction with LLMs, and geometric reasoning
  • Own the full stack : data preparation → model training (PyTorch) → evaluation → ONNX / TensorRT optimization → FastAPI deployment
  • Push boundaries on table extraction, key-value pairing, nested hierarchies, and multi-page document understanding

Must-Have Skills & Experience

  • Very strong understanding of document layout analysis (bounding boxes, reading order, logical blocks, nested tables, headers / footers, multi-column detection)
  • Hands-on experience with modern Document AI architectures :
  • LayoutLMv1 / v2 / v3, DocFormer, LayoutXLM, Donut, Pix2Struct, Nougat, UDOP, etc.
  • Vision-Language models (LLaVA, Qwen-VL, InternVL, PaliGemma, etc.)
  • Deep experience fine-tuning and serving LLMs & VLLMs (Llama-3, Mistral, Phi-3-vision, Qwen, etc.) using PEFT (LoRA / QLoRA), vLLM, TGI, or Ollama
  • Strong PyTorch proficiency (custom trainers, distributed training with DDP / FSDP, TorchCompile, mixed precision)
  • Solid grasp of OCR ecosystems and post-processing (Tesseract, EasyOCR, PaddleOCR, AWS Textract / Google Document AI limitations and how to beat them)
  • Experience building datasets from real enterprise documents (Labelling tools : UBIAI, Label Studio, Doccano, custom UI)
  • Good applied math : Transformers, attention mechanisms, positional encodings (especially 2D layouts), RoPE, ALiBi
  • Nice-to-Have (Big Bonus)

  • Published research or open-source contributions in Document AI / VLLM space
  • Experience with multimodal RAG over documents
  • ONNX / TensorRT / DeepSpeed optimization for low-latency inference
  • Kubernetes + GPU scheduling (we run our own bare-metal cluster)
  • Who thrives here?

    You get excited when you see a 50-page scanned purchase order with overlapping stamps and handwritten notes — because you already know exactly how you’re going to destroy it.

    Perks

  • Work directly on enterprise deals worth crores — your model = real revenue impact
  • Unlimited GPU access (A100s & H100s in-house)
  • Create a job alert for this search

    Applied Ai Scientist • Hyderabad, Republic Of India, IN

    Related jobs
    Senior Applied AI Scientist

    Senior Applied AI Scientist

    Confidential • Hyderabad / Secunderabad, Telangana, India
    Security represents the most critical priorities for our customers in a world awash in digital threats, regulatory scrutiny, and estate complexity. Microsoft Security aspires to make the world a saf...Show more
    Last updated: 15 days ago • Promoted
    Senior Data Scientist

    Senior Data Scientist

    ValueMatrix.AI • Hyderabad, IN
    A Behavioural sciences DeepTech company.This is a full-time remote role for a Senior Data Scientist.The Senior Data Scientist will be responsible for performing data analysis, data visualization, a...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI and Data Science Engineer

    Senior AI and Data Science Engineer

    DAZN India • Hyderabad, Republic Of India, IN
    Is your next career move to work in a team which uses data, reporting and analytical skills to help answer business questions to make DAZN a data-driven company?. DAZN is a tech-first sport streamin...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI Applied Scientist

    Senior AI Applied Scientist

    Confidential • Hyderabad / Secunderabad, Telangana, India
    As a Senior AI Applied Scientist for The Customer Service Applications Team, you will play a pivotal role in advancing Microsoft's mission to empower every individual and organization on the planet...Show more
    Last updated: 27 days ago • Promoted
    Senior Data Scientist SME & AI

    Senior Data Scientist SME & AI

    Information Tech Consultants • Hyderabad, IN
    Senior Data Scientist SME & AI Architect (10+ Years Experience) 🧠.We are seeking a highly accomplished and results-oriented. Senior Data Scientist Subject Matter Expert (SME).This is a pivotal role...Show more
    Last updated: 6 days ago • Promoted
    Senior Data Science Specialist

    Senior Data Science Specialist

    Prudent Technologies and Consulting, Inc. • Hyderabad, Republic Of India, IN
    We are seeking an experienced Senior Data Scientist to join our team and drive impactful data-driven solutions across the organization. This role focuses on advanced analytical problem-solving, buil...Show more
    Last updated: 3 days ago • Promoted
    Senior Bioanalysis Research Associate

    Senior Bioanalysis Research Associate

    Aragen Life Sciences • Hyderabad, Republic Of India, IN
    Job Title : Research Associate / Sr Research Associate / Associate Scientist.Should expertise in handling the HPLC and LCMS along with sound knowledge on troubleshooting. Well versed with method deve...Show more
    Last updated: 4 days ago • Promoted
    Senior Applied Scientist (Data Science & ML)

    Senior Applied Scientist (Data Science & ML)

    AdZeta • Hyderabad, IN
    AdZeta turns insights from your data lake to profit signals that seamlessly integrate with your ad stack.We unify your first-party data, predict lifetime value, activate high-value signals, and pro...Show more
    Last updated: 1 day ago • Promoted
    Applied AI ( Palantir Certification Program)

    Applied AI ( Palantir Certification Program)

    People Tech Group Inc • Hyderabad, Telangana, India
    Job Description : Semantic Data & AI Fellowship.Hiring – Full Time | Hyderabad Location.We are looking for skilled and enthusiastic professionals to join our growing team at People Tech Group.If you...Show more
    Last updated: 23 days ago • Promoted
    Principal Applied AI Scientist

    Principal Applied AI Scientist

    Tanla Platforms Limited • Hyderabad, Republic Of India, IN
    We are seeking an experienced AI / ML Engineer to design and implement cutting-edge solutions in the field of Generative AI and Large Language Models (LLMs). This role involves leading the development...Show more
    Last updated: 10 days ago • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Syren • Hyderabad, Telangana, India
    Feature Engineering and Data Preprocessing : .Work with raw data to perform data cleaning, transformation, and feature engineering to prepare datasets for analysis. Collaborate with cross-functional t...Show more
    Last updated: 6 days ago • Promoted
    AI Engineer / Senior AI Engineer – Document Intelligence

    AI Engineer / Senior AI Engineer – Document Intelligence

    Piazza Consulting Group • Hyderabad, India
    We are looking for someone obsessed with turning messy real-world documents into perfectly structured, actionable data.If you live and breathe document layout analysis, OCR post-processing, and vis...Show more
    Last updated: 9 hours ago • Promoted • New!
    Senior ADME Studies Scientist

    Senior ADME Studies Scientist

    Aragen Life Sciences • Hyderabad, Republic Of India, IN
    Job Title : Senior Research Associate / Associate Scientist.Primary responsibility is to study design, execution, data analysis and reporting of ADME studies. Responsible for data integrity and good d...Show more
    Last updated: 7 days ago • Promoted
    Applied ML Scientist

    Applied ML Scientist

    SAIVA AI • Hyderabad, IN
    SAIVA AI applies machine learning to make optimal use of electronic health data for the most vulnerable healthcare population. Our mission is to improve patient outcomes by augmenting clinical decis...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Scientist - Generative AI & LLM

    Senior Data Scientist - Generative AI & LLM

    PrimEra Medical Technologies • Hyderabad, Telangana, India
    Position : Senior Data Scientist - Generative AI & LLM.We are seeking a highly skilled and experienced Senior Data Scientist to join our innovative team. The ideal candidate will possess a strong bac...Show more
    Last updated: 11 days ago • Promoted
    Applied AI Scientist

    Applied AI Scientist

    Re Sustainability Limited • Hyderabad, Republic Of India, IN
    Proactively engage with business stakeholders, product managers, and domain experts to deeply understand key organizational challenges and strategic goals. Formulate and scope data science initiativ...Show more
    Last updated: 2 days ago • Promoted
    Senior Computer Science Writer with PhD

    Senior Computer Science Writer with PhD

    Fengkai Group Co., Limited • Hyderabad, IN
    We are seeking researchers and editors capable of providing in-depth assessments of academic manuscripts (similar to journal peer reviews). Leverage scientific expertise to write, proofread, and fac...Show more
    Last updated: 1 day ago • Promoted
    Ai Engineer / Senior Ai Engineer – Document Intelligence

    Ai Engineer / Senior Ai Engineer – Document Intelligence

    Piazza Consulting Group • Hyderabad, Republic Of India, IN
    We are looking for someone obsessed with turning messy real-world documents into perfectly structured, actionable data.If you live and breathe document layout analysis, OCR post-processing, and vis...Show more
    Last updated: 1 day ago • Promoted