Job Description
Education and Work Experience Requirements :
- 5 to 8 years of experience as Data Scientist or GenAI specialist
- 2 to 3 years of experience in Generative AI solution development.
- Proven track record and experience with with GenAI technologies
- Open source LLMs like Llama, Gemma, Mixtral etco Closed source LLMs such as Open AI GPT, Azure Open AI, Claude, Gemini etco Prompt Engineering / Tuning, RAG, RAFT, LLM finetuning such as PEFT(LoRA, QLoRA ..)
- Understanding of SLMs such as Phi3, BERT and Transformer architectureo Vector databases like Pincone, Qdrant etc.
- Good knowledge of advanced statistical methods. Experience working with Text Data using transformer-based model
- Expertise with the following scripting languages : o Python, R, Tensorflow, Keras, Pytorch
- OpenNLP, CoreNLP, WordNet, NLTK, SpaCy, Gensim, Large Language Models, Knowledge Graphs
- Good and experience of machine learning algorithms and ability to apply them in supervised and un-supervised NLP tasks.
- Knowledge of NLP algorithms that can handle various NLP tasks such as intent recognition, entity extraction, language modeling, text classification, question answering, text summarization, topic modeling and so on.
- Experience building and fine-tuning Language Models (LMs), such as BERT, ELMo, XLNet etc to solve bespoke NLP tasks.
- Tech savy and willing to work with open-Source Tools
- Should have independently handled a project technically and provided directions to the other Team Members. Able to lead the project independently.
- Experience in turning ideas into actionable designs. Able to persuade stakeholders and champion effective techniques through development.
- Strong interpersonal and communication skills : ability to tell a clear, concise, actionable story with data, to folks across various levels of the company.
- Good to have foundational knowledge on Cloud, API frameworks like Flask, Fast API, Swagger / Postman tools
- Prior experience working on Mobility or Healthcare domain will be a plus
- Mandatory Skills :
- Design, develop, test, and deploy Machine Learning models using state-of-the-art algorithms with a strong focus on language models.
- Strong understanding of LLMs, and associated technologies like RAG, Agents, VectorDB and Guardrails
- Hand-on experience in GenAI frameworks like LlamaIndex, Langchain, Autogen, etc.
- Experience in cloud services like Azure, GCP and AWS
- Interact with our research team and with key partners in the market to build end-to-end AI / ML / NLP solutions : Conversational AI, document understanding and QnA.
- Mine and analyze data, applying statistical methods as necessary, pertaining to customers' discovery, and viewing experiences to identify critical product insights.
- Proactively develop new metrics and studies to quantify the value of different aspects of product.
- Drive efforts to enable product and engineering leaders to share your knowledge and insights through clear and concise communication, education, and data visualization.
- Translate analytic insights into concrete, actionable recommendations for business or product improvement.
- Build and improve reusable tools & modelling pipelines and support knowledge sharing across several teams.
- Define and deploy best practices in Machine Learning & MLOps / LLMOps, mentor and teach colleagues.
- Partner closely with product and engineering leaders throughout the lifecycle of project
Skills Required
Data Science