Talent.com
AI Engineer - GPT / LangChain / RAG / Data Pipelines
AI Engineer - GPT / LangChain / RAG / Data PipelinesPeak Trust Global Real Estate • baddi, himachal pradesh, in
AI Engineer - GPT / LangChain / RAG / Data Pipelines

AI Engineer - GPT / LangChain / RAG / Data Pipelines

Peak Trust Global Real Estate • baddi, himachal pradesh, in
14 hours ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 70K / Month based on experience

Role Summary

We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation

  • Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks
  • Extract multi-format documents (PDFs, HTML, text, images)
  • Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation

  • Clean and process unstructured documents
  • Apply OCR (Tesseract, PaddleOCR) for scanned files
  • Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.
  • Prepare data in formats such as JSON, JSONL, or CSV
  • 3. Dataset Preparation

  • Segment and structure text for ML training
  • Create Q&A datasets, summaries, instruction-response pairs, and labeled text
  • Build high-quality datasets compatible with fine-tuning frameworks
  • 4. Retrieval & Indexing Pipelines

  • Implement document chunking strategies
  • Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )
  • Build retrieval workflows using LangChain or LlamaIndex
  • Optimize retrieval accuracy and latency
  • 5. Model Training & Fine-Tuning

  • Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods
  • Monitor training performance and refine datasets
  • Package and deploy fine-tuned models
  • 6. Data Visualization & Analytics

  • Create analytical charts, trends, and insights using :
  • Pandas
  • Matplotlib
  • Seaborn
  • Plotly
  • Build simple internal dashboards or visual summaries for reports
  • Transform raw datasets into meaningful visual insights
  • 7. Automation & Infrastructure

  • Write modular, maintainable Python scripts
  • Containerize workflows with Docker
  • Maintain version control with Git
  • Ensure reproducibility and pipeline stability
  • Required Technical Skills

  • Strong proficiency in Python
  • Experience with Firecrawl , Playwright, Scrapy, or similar tools
  • Strong background in document parsing , text processing, and OCR
  • Familiarity with LangChain or LlamaIndex
  • Experience with vector databases
  • Hands-on experience with HuggingFace , Transformer models, and fine-tuning
  • Ability to write clean, efficient data pipelines
  • Experience with Matplotlib , Seaborn , Plotly , or other visualization tools
  • Comfort using Docker and Git
  • Nice to Have

  • Experience serving models or building small APIs (FastAPI)
  • Exposure to GPU training environments
  • Background in large-scale unstructured data work
  • Ability to create lightweight dashboards (Plotly Dash, Streamlit)
  • Ideal Candidate

  • Comfortable owning full pipelines independently
  • Detail-oriented and analytical
  • Strong problem-solving ability
  • Can work with minimal supervision
  • Enjoys building structured systems from scratch
  • Create a job alert for this search

    Ai Data Engineer • baddi, himachal pradesh, in

    Related jobs
    Architect

    Architect

    Veltris • baddi, himachal pradesh, in
    AI Architect - Telecom & Networking.Routing, Switching / SD-WAN / Provider Edge).ML Algorithms; Graph Neural Networks, Time-series Forecasting Algorithms (ARIMA, LSTM…). ML / DL libraries (PyTorch, Te...Show more
    Last updated: 14 hours ago • Promoted • New!
    Python Developer

    Python Developer

    TekXera • baddi, himachal pradesh, in
    Senior Python Engineer – Service Implementation.India | Pakistan | Nigeria | Kenya | Egypt | Ghana | Bangladesh | Turkey | Mexico. Full-Time Contract (9 Months, Extendable).San Francisco–based AI re...Show more
    Last updated: 14 hours ago • Promoted • New!
    AI Engineer Intern

    AI Engineer Intern

    Qureal AI • baddi, himachal pradesh, in
    Hiring : AI Engineer Intern (Remote + Paid).We are looking for a dedicated AI Engineer Intern.This role offers a real opportunity to transition into a full-time position based on performance.PPO / F...Show more
    Last updated: 14 hours ago • Promoted • New!
    Machine Learning Engineer

    Machine Learning Engineer

    Recro • baddi, himachal pradesh, in
    What would you be doing / Expected from this role?.Collaborate with cross-functional teams including data scientists, engineers, and product managers to deliver AI-driven solutions.Drive the archite...Show more
    Last updated: 30+ days ago • Promoted
    Sr. Azure Data Architect & Presales Solution

    Sr. Azure Data Architect & Presales Solution

    Programmers.io • baddi, himachal pradesh, in
    Job Title : Azure Data Architect.Location : Hyderabad, Pune, Jaipur.Experience required : 12+ years.We are seeking a highly experienced. The ideal candidate should bring strong expertise in SQL, ETL / EL...Show more
    Last updated: 17 days ago • Promoted
    Lead Full-Stack + AI Engineer (Founding Team)

    Lead Full-Stack + AI Engineer (Founding Team)

    Grovio AI • baddi, himachal pradesh, in
    We’re building an autonomous, multi-agent AI OS that plans, executes, and optimizes marketing across modern digital ecosystems. Think : an AI that acts like a virtual CMO — planning, writing, analyz...Show more
    Last updated: 6 hours ago • Promoted • New!
    Data Engineer

    Data Engineer

    Grantify • baddi, himachal pradesh, in
    Grantify is an innovative education platform that connects students and universities through a transparent admissions and tuition-matching ecosystem. By aligning student budgets and academic aspirat...Show more
    Last updated: 14 hours ago • Promoted • New!
    Senior Software Engineer

    Senior Software Engineer

    Programmers.io • baddi, himachal pradesh, in
    We are seeking a highly skilled and experienced Senior Azure Data Engineer to join our team.The ideal candidate will have deep expertise in Microsoft Azure data services, cloud-based data engineeri...Show more
    Last updated: 30+ days ago • Promoted
    Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

    Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

    Peak Trust Global Real Estate • Baddi, Republic Of India, IN
    This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
    Last updated: 9 hours ago • Promoted • New!
    Oracle Analytics & AI Solutions Architect

    Oracle Analytics & AI Solutions Architect

    TribolaTech Inc • baddi, himachal pradesh, in
    Oracle Analytics & AI Solutions Architect.Our client believes in connecting people and business to Insurance in ways that are Innovative, Hyper-Relevant, Compelling and Personal.They bring together...Show more
    Last updated: 15 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Live Connections • baddi, India
    Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field; Master’s or PhD preferred. ML / Data Science with a focus on generative AI, LLMs, or co...Show more
    Last updated: 7 hours ago • Promoted • New!
    Generative Ai Engineer

    Generative Ai Engineer

    Live Connections • Baddi, Republic Of India, IN
    Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field;.ML / Data Science with a focus on generative AI, LLMs, or computer vision.Expertise in...Show more
    Last updated: 16 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Primesoft Inc • baddi, himachal pradesh, in
    Primesoft Enterprise IT Services Pvt.As a Software Engineer II - Data, you will contribute to the design and development of data systems including pipelines, APIs, analytics, AI and machine learnin...Show more
    Last updated: 30+ days ago • Promoted
    Python Developer With Test Driven Development (TDD)

    Python Developer With Test Driven Development (TDD)

    ENCORE IT SOLUTIONS • baddi, himachal pradesh, in
    Job Description – Senior Python Developer – Service Implementation (TDD) (Contract).Short-term Contract (9 months).Flexible (8 hours / day with 4 hours PST overlap). Candidate should be comfortable wo...Show more
    Last updated: 14 hours ago • Promoted • New!
    Python for Machine Learning

    Python for Machine Learning

    People Prime Worldwide • baddi, himachal pradesh, in
    Our client is a trusted global innovator of IT and business services.They help clients transform through consulting, industry solutions, business process services, digital & IT modernisation and ma...Show more
    Last updated: 2 days ago • Promoted
    Freelance Data Engineer

    Freelance Data Engineer

    Leading MNC • baddi, himachal pradesh, in
    Looking for a Freelance Data Engineer to join a team of rockstar developers.The candidate should have a minimum of 8+ yrs. If you're looking for freelance / part time opportunity (along with your day...Show more
    Last updated: 13 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Turing • baddi, himachal pradesh, in
    Turing is looking for people with LLM experience to join us in solving business problems for our Fortune 500 customers.You will be a key member of the Turing GenAI delivery organization and part of...Show more
    Last updated: 21 days ago • Promoted
    Python Web Scraping Engineer – Automation (3 to 10 yrs)

    Python Web Scraping Engineer – Automation (3 to 10 yrs)

    AIMLEAP • baddi, himachal pradesh, in
    Python Web Scraping Engineer – Advanced Automation (WFH).Bachelor’s degree in Computer Science, IT, or related field .IT / Software Services / Data & AI . Strong hands-on experience handling.Seleniu...Show more
    Last updated: 14 hours ago • Promoted • New!