Job Title : Data Scientist - NLP
Client : Optimum Data Analytics
Location : Remote / Pune (Hybrid option available)
Experience : 5-7 Years
Employment Type : Full-time
About Optimum Data Analytics :
Optimum Data Analytics is a strategic technology partner delivering turnkey AI solutions. We help organizations accelerate decision-making by providing advanced analytics and AI-powered platforms. Our team of statisticians, computer scientists, data scientists, and product managers brings expertise, flexibility, and innovation to help businesses grow, transform, and achieve measurable outcomes.
Role Overview :
We are seeking an experienced Data Scientist - NLP to join our AI and advanced analytics team. The ideal candidate will have strong expertise in Natural Language Processing (NLP), Generative AI, and Large Language Models (LLMs). You will work on cutting-edge projects, building intelligent text-based solutions for applications like classification, sentiment analysis, information extraction, and conversational AI.
Key Responsibilities :
- Design, develop, and optimize NLP models for classification, sentiment analysis, entity recognition, and conversational AI.
- Work with large-scale unstructured text data to build insights and predictive solutions.
- Implement and fine-tune LLMs (Hugging Face, OpenAI, etc.) for real-world use cases.
- Experiment with embeddings, vector databases, and generative AI models.
- Collaborate with product and engineering teams to deploy models into production environments.
- Stay updated on latest NLP, LLM, and generative AI research, applying advancements to business problems.
Required Skills & Qualifications :
5-7 years of hands-on experience as a Data Scientist / NLP Engineer.Strong programming skills in Python with expertise in NLP libraries (NLTK, SpaCy, Hugging Face Transformers, Gensim).Solid experience with LLMs, embeddings, and vector databases.Proficiency in data preprocessing, feature engineering, and model evaluation techniques.Strong understanding of SQL and working experience with cloud platforms (AWS, GCP, or Azure).Strong problem-solving, communication, and collaboration skills.Good to Have :
Experience in MLOps tools (MLflow, Kubeflow).Exposure to retrieval-augmented generation (RAG) pipelines.Familiarity with deep learning frameworks (TensorFlow, PyTorch).(ref : hirist.tech)