Sr. Data Scientist
- Pune
- Full Time
- 02 / 01 / 2023.
- Experience : 3-5 years.
- Apply Now
Job description
We are looking for a candidate whose primary focus will be in applying Natural Language Processing (NLP) AI techniques, doing machine learning, and building high-quality prediction systems to classify data. Presenting information using data visualization techniques. Undertaking data collection, preprocessing, and harmonization
Roles and Responsibilities
Develop applications in machine learning and artificial intelligence. Selecting features, building and optimizing classifiers using machine learning techniques.Understanding business objectives and developing models that help to achieve them, along with metrics to track their progressManaging available resources such as hardware, data, and personnel so that deadlines are metAnalyzing the ML algorithms that could be used to solve a given problem and ranking them by their success probabilityExploring and visualizing data to gain an understanding of it, then identifying differences in data distribution that could affect performance when deploying the model in the real worldVerifying data quality, and / or ensuring it via data cleaningSupervising the data acquisition process if more data is neededFinding available datasets online that could be used for trainingDefining validation strategiesDefining the pre-processing or feature engineering to be done on a given datasetDefining data augmentation pipelinesTraining models and tuning their hyperparametersAnalyzing the errors of the model and designing strategies to overcome them Deploying models to productionDesired Candidate Profile
Sound understanding of ML and DL algorithmArchitecture level understanding of CNN RNN algorithmExperience with NLP data models and librariesGood understanding of entity extraction using NLPHandson tensorflow, scikit learn, spacy libraries etcGood knowledge of transfer learningGood scripting and programming skills in Python and StreamlightExperience with common data science toolkits, such as Python, NumPy, Transformers, Fast.AI, etc.Excellent understanding of machine learning techniques and algorithms, such as k-NN, Naive Bayes, SVM, Decision Forests, etc.Proficiency in using query languagesGood applied statistics skills, such as distributions, statistical testing, regression, etc.Data-oriented personality. Data Wrangling and Data ExplorationTableau, DataPrep is a PLUSNote – Preference for immediate joiner and salary no bar for right candidate.