We are looking for a
Data Scientist
with strong experience in
Computer Vision ,
Document Parsing (LayoutLM) , and
Generative AI
to join our growing AI team. If you love solving complex document-understanding problems and building scalable ML solutions, we’d love to meet you!
What You’ll Work On
Build and optimize
Computer Vision
and
Document AI
models for PDFs, scanned docs, and complex layouts.
Work with
LayoutLM / LayoutLMv2 / LayoutLMv3 , Donut, DocFormer, and transformer-based architectures.
Develop
GenAI applications
(RAG pipelines, LLM fine-tuning, summarization, prompt engineering).
Design ML pipelines for large document datasets and integrate OCR tools (Tesseract, PaddleOCR, Textract, Google Vision).
Deploy models using MLOps best practices (Docker, MLflow, cloud ML services).
Collaborate with product & engineering teams to bring AI features to production.
What We’re Looking For
4–6 years
of experience in Data Science / Machine Learning / Deep Learning.
Hands-on Computer Vision experience (CNNs, vision transformers, multimodal models).
Strong background in
document parsing
and
document intelligence .
Experience with
GenAI / LLMs
(Hugging Face, OpenAI, LangChain, Llama).
Excellent skills in Python, PyTorch or TensorFlow, and ML libraries.
Experience working with OCR + production ML systems.
Good understanding of cloud platforms (AWS / GCP / Azure) & containerization.
Prior team management experience is a must.
✨ Nice to Have
Experience with
vector databases
(FAISS, Pinecone, Milvus).
Familiarity with Kubeflow, Airflow, or end-to-end MLOps pipelines.
Experience fine-tuning LLMs or multimodal models.
Education
Bachelor’s or Master’s in CS, Data Science, AI / ML, or related field.
Senior Data Scientist • Delhi, India