Talent.com
This job offer is not available in your country.
Sr Data Scientist-Innovation lab

Sr Data Scientist-Innovation lab

Genzeon GlobalHyderabad, TG, in
21 days ago
Job type
  • Quick Apply
Job description

Job Description

Sr Data Scientist

Job Responsibilities :

  • LLM Architecture : Good understanding of the architecture underlying large language models, such as Transformer-based models and their variants . Design and implement deep learning model architectures using PyTorch.
  • Language Model Training and Fine-Tuning : Experience in training  large-scale language models from scratch, as well as fine-tuning pre-trained models on domain data.
  • Data Preprocessing for NLP : Skilled in preprocessing textual data, including tokenization, stemming, lemmatization, and handling of different text encoding.
  • Transfer Learning and Adaptation : Proficiency in applying transfer learning techniques to adapt existing LLMs to new languages, domains, or specific business needs.
  • Data Annotation and Evaluation : Skills in designing and implementing data annotation strategies for training LLMs and evaluating their performance using appropriate metrics.
  • Scalability and Deployment : Experience in scaling LLMs for production environments, ensuring efficiency and robustness in deployment.
  • Model Training, Optimization, and Evaluation :  Evaluate the performance of PyTorch models using appropriate metrics and techniques like cross-validation, holdout sets, or online evaluation. This encompasses the complete cycle of training, fine-tuning, and validating language models. You will be designing and adapting LLMs for use in virtual assistants, Information retrieval and extraction etc.
  • Experimentation with Emerging Technologies and Methods : Actively exploring new technologies and methodologies in language model development, including experimental frameworks and software tools.
  • LLM Alignment : Understanding of algorithms like DPO, PPO, KPO, RLHF and using it for guardrails.
  • AI Data Retrieval : Data retrieval from unstructured data, extract key value pairs using techniques like donut, layoutLM, table transformers.
  • Analyze data and build EDAs to identify data patterns Hands-on and strong understanding of concepts in Deep Learning and NLP Proficient in TensorFlow and similar libraries.

Required Qualifications

  • 5 + years of hands-on experience in developing and deploying Large Language Models, and Machine learning and working with Pytorch.
  • A thorough understanding of machine learning, particularly deep learning techniques, including knowledge of neural network architectures, training methods, and optimization algorithms.
  • Proficiency in AI technology, Python, including experience with NLP libraries (e.g., Hugging Face Transformers, NLTK, spaCy), text classification.
  • Experience with frameworks : PyTorch, or Tensorflow.
  • Experience with cloud services (AWS, Azure) and ML deployment tool Docker
  • Familiarity with model fine-tuning and optimization techniques for LLMs.
  • Proven track record of innovative solutions in the field of LLMs.
  • Strong communication skills, with the ability to explain complex AI concepts to non-expert audiences.
  • Additional good to have qualifications :

  • 4+ years' experience in data analytics, data science, quantitative analysis using statistical computer languages to draw insights from large data sets 3+ years' experience in Python development, preferably delivering production code for data applications.
  • Experience with unstructured data or computer vision models is a plus.
  • Experience with SQL is a big plus Extensive model implementation experience using Scikit.
  • Experience designing and developing for security critical applications; experience with the specifics for HIPAA / PHI / PII / GDPR a big plus.
  • Basic experience with Linux, Git, Jupyter Notebooks is must Knowledge of Agile development practices Flexibility and adaptability to respond to a rapidly changing environment.
  • Experience with distributed computational techniques and job orchestration tools and platforms is very valuable : airflow, etc.
  • Requirements

    DataScience LLM GenerativeAI NLP PyTorch TensorFlow MachineLearning DeepLearning ArtificialIntelligence HuggingFace RLHF AIAlignment CloudAI

    Create a job alert for this search

    Sr • Hyderabad, TG, in

    Related jobs
    • Promoted
    Senior Data Scientist

    Senior Data Scientist

    EXLHyderabad, IN
    We are looking for a results-driven.The ideal candidate will work on developing predictive models, building data pipelines, and experimenting with GenAI tools to solve business challenges and enhan...Show moreLast updated: 24 days ago
    • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Mitchell Martin Inc.Hyderabad, IN
    Include but are not limited to the following : .Apply machine learning, deep learning, and artificial intelligence techniques. Use advanced analytics methods to extract value from business data.Perfor...Show moreLast updated: 21 days ago
    • Promoted
    Data Scientist

    Data Scientist

    ValueLabsHyderabad, Telangana, India
    Minimum 5 years of experience in AI / ML and software development, with at least 2 years in Generative AI.Proven delivery of 2+ GenAI-based solutions using OpenAI, Claude, or other LLMs in production...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Data Scientist

    Data Scientist

    Azoca TechnologiesHyderabad, Telangana, India
    We are looking for an experienced Data Scientist to join on a 6-month engagement.The ideal candidate will bring expertise in data engineering, time series forecasting, machine learning frameworks, ...Show moreLast updated: 13 hours ago
    • Promoted
    Sr. Data Scientist, R&D (Remote, India)

    Sr. Data Scientist, R&D (Remote, India)

    PulsePointHyderabad, IN
    Remote
    WebMD and its affiliates is an Equal Opportunity / Affirmative Action employer and does not discriminate on the basis of race, ancestry, color, religion, sex, gender, age, marital status, sexual orie...Show moreLast updated: 18 days ago
    • Promoted
    Data Scientist - II (Gen AI - LLM's & RAG)

    Data Scientist - II (Gen AI - LLM's & RAG)

    Tanla Platforms LimitedHyderabad, Telangana, India
    Design, develop and implement cutting-edge AI / ML solutions, including Large Language Models (LLMs) and Generative AI applications. Lead projects end-to-end while mentoring team members in AI-ML, inc...Show moreLast updated: 30+ days ago
    • Promoted
    Data Scientist

    Data Scientist

    vueverse.Hyderabad, IN
    We are looking for an experienced.Machine Learning, Generative AI, and Large Language Models (LLMs).NLP, predictive modeling, and analytics. Integrate ML solutions into business systems in collabora...Show moreLast updated: 30+ days ago
    • Promoted
    Associate Data Scientist

    Associate Data Scientist

    v4c.aiHyderabad, IN
    The Associate Data Scientist supports the development and implementation of data models, focusing on Machine Learning, under the supervision of more experienced scientists, contributing to the team...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Providence IndiaHyderabad, Telangana, India
    How is this team contributing to vision of Providence?.Ensure a continual and seamless service for customers regardless of which services they access. What will you be responsible for?.End-to-end de...Show moreLast updated: 14 days ago
    • Promoted
    Data Scientist

    Data Scientist

    ValueMomentumhyderabad, telangana, in
    Candidate who excels at preparing raw data for analysis and has the expertise to derive actionable features for analytics and machine learning. Candidate will be involved in the full data lifecycle—...Show moreLast updated: 26 days ago
    • Promoted
    Senior Data Scientist

    Senior Data Scientist

    ValueMomentumHyderabad, Telangana, India
    Analyze complex data sets to identify trends, patterns, and insights that can drive business decisions.Design and develop predictive models, algorithms, machine learning, and artificial intelligenc...Show moreLast updated: 16 days ago
    • Promoted
    Sr. Data Scientist

    Sr. Data Scientist

    Veltrishyderabad, telangana, in
    Role : Senior Data Scientist- Agentic AI.Mandatory Skills : Python, NLP, Pytorch, Tensorflow, Financial domain.AI / ML models powering mortgage automation agents. This role involves designing models for...Show moreLast updated: 18 days ago
    • Promoted
    Senior Data Scientist

    Senior Data Scientist

    CEShyderabad, telangana, in
    This is an opportunity to work at the intersection of software development, machine learning, and financial data—delivering real-world impact from day one. You’ll develop tools, infrastructure, and ...Show moreLast updated: 16 days ago
    • Promoted
    Sr. Data Scientist

    Sr. Data Scientist

    YASH TechnologiesHyderabad, Telangana, India
    We are seeking an experienced Senior Data Scientist to design, develop, and deploy advanced machine learning and AI solutions. In this role, you will work closely with cross-functional teams and bus...Show moreLast updated: 3 days ago
    • Promoted
    Fourkites - Principal Data Scientist - ETA Modeling

    Fourkites - Principal Data Scientist - ETA Modeling

    FourKites India Pvt LtdBangalore,Chennai,Hyderabad
    We are seeking an exceptional Principal Data Scientist with 15+ years of experience to lead technical innovation in Shipments ETA prediction across multiple transportation modes.This senior individ...Show moreLast updated: 30+ days ago
    • Promoted
    Bristol Myers Squibb - Data Scientist II - GPS / Predictive Modeling

    Bristol Myers Squibb - Data Scientist II - GPS / Predictive Modeling

    Bristol Myers SquibbHyderabad
    Working with Us : Challenging.Those aren't words that are usually associated with a job.But working at Bristol Myers Squibb is anything but us...Show moreLast updated: 30+ days ago
    • Promoted
    Data Scientist

    Data Scientist

    RecroSecunderabad, Telangana, India
    We are looking for a highly skilled AI Full Stack Engineer with deep expertise in Python and a passion for building scalable AI-powered solutions. This role offers the opportunity to work on cutti...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Scientist II

    Senior Data Scientist II

    Embursehyderabad, telangana, in
    Emburse Senior Data Scientist II.As a Senior Data Scientist II at Emburse, you will drive design and implementation of data products using a variety of AI approaches including foundation models (Ge...Show moreLast updated: 18 days ago