We are looking for a Data Scientist with strong experience in Computer Vision , Document Parsing (LayoutLM) , and Generative AI to join our growing AI team. If you love solving complex document-understanding problems and building scalable ML solutions, we’d love to meet you!
🔍 What You’ll Work On
- Build and optimize Computer Vision and Document AI models for PDFs, scanned docs, and complex layouts.
- Work with LayoutLM / LayoutLMv2 / LayoutLMv3 , Donut, DocFormer, and transformer-based architectures.
- Develop GenAI applications (RAG pipelines, LLM fine-tuning, summarization, prompt engineering).
- Design ML pipelines for large document datasets and integrate OCR tools (Tesseract, PaddleOCR, Textract, Google Vision).
- Deploy models using MLOps best practices (Docker, MLflow, cloud ML services).
- Collaborate with product & engineering teams to bring AI features to production.
🧠 What We’re Looking For
4–6 years of experience in Data Science / Machine Learning / Deep Learning.Hands-on Computer Vision experience (CNNs, vision transformers, multimodal models).Strong background in document parsing and document intelligence .Experience with GenAI / LLMs (Hugging Face, OpenAI, LangChain, Llama).Excellent skills in Python, PyTorch or TensorFlow, and ML libraries.Experience working with OCR + production ML systems.Good understanding of cloud platforms (AWS / GCP / Azure) & containerization.Prior team management experience is a must.✨ Nice to Have
Experience with vector databases (FAISS, Pinecone, Milvus).Familiarity with Kubeflow, Airflow, or end-to-end MLOps pipelines.Experience fine-tuning LLMs or multimodal models.🎓 Education
Bachelor’s or Master’s in CS, Data Science, AI / ML, or related field.