We are looking for a Data Scientist with strong experience in Computer Vision , Document Parsing (LayoutLM) , and Generative AI to join our growing AI team. If you love solving complex document-understanding problems and building scalable ML solutions, we’d love to meet you!
What You’ll Work On
Build and optimize Computer Vision and Document AI models for PDFs, scanned docs, and complex layouts.
Work with LayoutLM / LayoutLMv2 / LayoutLMv3 , Donut, DocFormer, and transformer-based architectures.
Develop GenAI applications (RAG pipelines, LLM fine-tuning, summarization, prompt engineering).
Design ML pipelines for large document datasets and integrate OCR tools (Tesseract, PaddleOCR, Textract, Google Vision).
Deploy models using MLOps best practices (Docker, MLflow, cloud ML services).
Collaborate with product & engineering teams to bring AI features to production.
What We’re Looking For
4–6 years of experience in Data Science / Machine Learning / Deep Learning.
Hands-on Computer Vision experience (CNNs, vision transformers, multimodal models).
Strong background in document parsing and document intelligence .
Experience with GenAI / LLMs (Hugging Face, OpenAI, LangChain, Llama).
Excellent skills in Python, PyTorch or TensorFlow, and ML libraries.
Experience working with OCR + production ML systems.
Good understanding of cloud platforms (AWS / GCP / Azure) & containerization.
Prior team management experience is a must.
✨ Nice to Have
Experience with vector databases (FAISS, Pinecone, Milvus).
Familiarity with Kubeflow, Airflow, or end-to-end MLOps pipelines.
Experience fine-tuning LLMs or multimodal models.
Education
Bachelor’s or Master’s in CS, Data Science, AI / ML, or related field.
Senior Data Scientist • Dombivali, Maharashtra, India