About IDfy
IDfy is an Integrated Identity Platform offering products and solutions for KYC, KYB, Background Verifications, Risk Assessment, and Digital Onboarding. We establish trust while delivering a frictionless experience for you, your employees, customers and partners.
Only IDfy combines enterprise-grade technology with business understanding and has the widest breadth of offerings in the industry. With more than 12+ years of experience and 2 million verifications per day, we are pioneers in this industry.
Our clients include HDFC Bank, Induslnd Bank, Zomato, Amazon, PhonePe, Paytm, HUL and many others.
We have successfully raised $27M from Elev8 Venture Partners, KB Investments & Tenacity Ventures!
We work fully onsite on all days of the week from our office in Andheri East, Mumbai
We're a perfect match if you have-
- 4-8 years of experience in Machine Learning, NLP, and LLM-based solutions.
- Strong expertise in fine-tuning and deploying LLMs (GPT-4, Llama, Mistral, or custom models).
- Experience with RAG-based architectures, including vector embeddings (FAISS, ChromaDB, Weaviate, Pinecone, or similar).
- Hands-on with agentic RAG, sliding window chunking, and efficient context retrieval techniques.
- Deep understanding of privacy AI use cases, including contract analysis, regulatory classification, and PII mapping.
- Proficiency in Python and frameworks like PyTorch, TensorFlow, JAX, Hugging Face, or LangChain.
- Experience in building scalable AI APIs and microservices.
- Exposure to MLOps practices, including model monitoring, inference optimization, and API scalability.
- Experience working with at least one cloud provider (AWS, GCP, or Azure).
Good-to-Have Skills
Experience in hybrid AI architectures combining vector search + relational databases.Familiarity with functional programming languages (Go, Elixir, Rust, etc.).Understanding of privacy compliance frameworks (DPDP Act, GDPR, CCPA, ISO 27701).Exposure to Kubernetes, Docker, and ML deployment best practices.Contributions to open-source LLM projects or privacy AI research.In this role you will :
Develop and fine-tune LLMs for contract analysis, regulatory classification, and risk assessment.Implement Retrieval-Augmented Generation (RAG) using vector embeddings and hybrid DB-based querying to power DPIA and compliance workflows.Build AI-driven contract analysis systems to detect dark patterns, classify clauses, and provide remediation suggestions.Develop knowledge graph-based purpose taxonomies for privacy policies and PII classification.Automate data discovery for structured and unstructured data, classifying it into PII categories.Optimize sliding window chunking, token-efficient parsing, and context-aware summarization for legal and compliance texts.Build APIs and ML services for deploying models in a high-availability production environment.Collaborate with privacy, legal, and compliance teams to build AI solutions that power Privy’s data governance tools.Stay ahead of the curve with agentic RAG, multi-modal LLMs, and self-improving models in the compliance domain.