Talent.com
Ai Engineer - Gpt / Langchain / Rag / Data Pipelines
Ai Engineer - Gpt / Langchain / Rag / Data PipelinesPeak Trust Global Real Estate • Hosūr, Republic Of India, IN
Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

Peak Trust Global Real Estate • Hosūr, Republic Of India, IN
4 hours ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 70K / Month based on experience

Role Summary

We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation

  • Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks
  • Extract multi-format documents (PDFs, HTML, text, images)
  • Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation

  • Clean and process unstructured documents
  • Apply OCR (Tesseract, PaddleOCR) for scanned files
  • Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.
  • Prepare data in formats such as JSON, JSONL, or CSV
  • 3. Dataset Preparation

  • Segment and structure text for ML training
  • Create Q&A datasets, summaries, instruction-response pairs, and labeled text
  • Build high-quality datasets compatible with fine-tuning frameworks
  • 4. Retrieval & Indexing Pipelines

  • Implement document chunking strategies
  • Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )
  • Build retrieval workflows using LangChain or LlamaIndex
  • Optimize retrieval accuracy and latency
  • 5. Model Training & Fine-Tuning

  • Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods
  • Monitor training performance and refine datasets
  • Package and deploy fine-tuned models
  • 6. Data Visualization & Analytics

  • Create analytical charts, trends, and insights using :
  • Pandas
  • Matplotlib
  • Seaborn
  • Plotly
  • Build simple internal dashboards or visual summaries for reports
  • Transform raw datasets into meaningful visual insights
  • 7. Automation & Infrastructure

  • Write modular, maintainable Python scripts
  • Containerize workflows with Docker
  • Maintain version control with Git
  • Ensure reproducibility and pipeline stability
  • Required Technical Skills

  • Strong proficiency in Python
  • Experience with Firecrawl , Playwright, Scrapy, or similar tools
  • Strong background in document parsing , text processing, and OCR
  • Familiarity with LangChain or LlamaIndex
  • Experience with vector databases
  • Hands-on experience with HuggingFace , Transformer models, and fine-tuning
  • Ability to write clean, efficient data pipelines
  • Experience with Matplotlib , Seaborn , Plotly , or other visualization tools
  • Comfort using Docker and Git
  • Nice to Have

  • Experience serving models or building small APIs (FastAPI)
  • Exposure to GPU training environments
  • Background in large-scale unstructured data work
  • Ability to create lightweight dashboards (Plotly Dash, Streamlit)
  • Ideal Candidate

  • Comfortable owning full pipelines independently
  • Detail-oriented and analytical
  • Strong problem-solving ability
  • Can work with minimal supervision
  • Enjoys building structured systems from scratch
  • Create a job alert for this search

    Ai Data Engineer • Hosūr, Republic Of India, IN

    Related jobs
    AI Engineer - GPT / LangChain / RAG / Data Pipelines

    AI Engineer - GPT / LangChain / RAG / Data Pipelines

    Peak Trust Global Real Estate • hosur, tamil nadu, in
    This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
    Last updated: 9 hours ago • Promoted • New!
    AI Engineer Intern

    AI Engineer Intern

    Qureal AI • hosur, tamil nadu, in
    Hiring : AI Engineer Intern (Remote + Paid).We are looking for a dedicated AI Engineer Intern.This role offers a real opportunity to transition into a full-time position based on performance.PPO / F...Show more
    Last updated: 9 hours ago • Promoted • New!
    Senior Software Engineer

    Senior Software Engineer

    Programmers.io • hosur, tamil nadu, in
    We are seeking a highly skilled and experienced Senior Azure Data Engineer to join our team.The ideal candidate will have deep expertise in Microsoft Azure data services, cloud-based data engineeri...Show more
    Last updated: 30+ days ago • Promoted
    Freelance Data Engineer

    Freelance Data Engineer

    Leading MNC • hosur, tamil nadu, in
    Looking for a Freelance Data Engineer to join a team of rockstar developers.The candidate should have a minimum of 8+ yrs. If you're looking for freelance / part time opportunity (along with your day...Show more
    Last updated: 13 days ago • Promoted
    Python Developer With Test Driven Development (TDD)

    Python Developer With Test Driven Development (TDD)

    ENCORE IT SOLUTIONS • hosur, tamil nadu, in
    Job Description – Senior Python Developer – Service Implementation (TDD) (Contract).Short-term Contract (9 months).Flexible (8 hours / day with 4 hours PST overlap). Candidate should be comfortable wo...Show more
    Last updated: 9 hours ago • Promoted • New!
    Generative AI Engineer

    Generative AI Engineer

    Live Connections • hosur, tamil nadu, in
    Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field; Master’s or PhD preferred. ML / Data Science with a focus on generative AI, LLMs, or co...Show more
    Last updated: 16 days ago • Promoted
    Architect

    Architect

    Veltris • hosur, tamil nadu, in
    AI Architect - Telecom & Networking.Routing, Switching / SD-WAN / Provider Edge).ML Algorithms; Graph Neural Networks, Time-series Forecasting Algorithms (ARIMA, LSTM…). ML / DL libraries (PyTorch, Te...Show more
    Last updated: 9 hours ago • Promoted • New!
    Data Engineer

    Data Engineer

    Grantify • hosur, tamil nadu, in
    Grantify is an innovative education platform that connects students and universities through a transparent admissions and tuition-matching ecosystem. By aligning student budgets and academic aspirat...Show more
    Last updated: 9 hours ago • Promoted • New!
    AI Engineer

    AI Engineer

    Recruin • Hosur, Tamil Nadu, India
    Our Client is a global leader in diversified electronics for the semiconductor manufacturing ecosystem.Virtually every electronic device in the world is produced using our technologies.No laptop, s...Show more
    Last updated: 6 hours ago • Promoted • New!
    Python Web Scraping Engineer – Automation (3 to 10 yrs)

    Python Web Scraping Engineer – Automation (3 to 10 yrs)

    AIMLEAP • hosur, tamil nadu, in
    Python Web Scraping Engineer – Advanced Automation (WFH).Bachelor’s degree in Computer Science, IT, or related field .IT / Software Services / Data & AI . Strong hands-on experience handling.Seleniu...Show more
    Last updated: 9 hours ago • Promoted • New!
    Generative Ai Engineer

    Generative Ai Engineer

    Live Connections • Hosūr, Republic Of India, IN
    Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field;.ML / Data Science with a focus on generative AI, LLMs, or computer vision.Expertise in...Show more
    Last updated: 16 days ago • Promoted
    Python Developer

    Python Developer

    TekXera • hosur, tamil nadu, in
    Senior Python Engineer – Service Implementation.India | Pakistan | Nigeria | Kenya | Egypt | Ghana | Bangladesh | Turkey | Mexico. Full-Time Contract (9 Months, Extendable).San Francisco–based AI re...Show more
    Last updated: 9 hours ago • Promoted • New!
    Generative Ai Engineer

    Generative Ai Engineer

    Turing • Hosūr, Republic Of India, IN
    Turing is looking for people with LLM experience to join us in solving business problems for our Fortune 500 customers.You will be a key member of the Turing GenAI delivery organization and part of...Show more
    Last updated: 21 days ago • Promoted
    Lead Full-Stack + AI Engineer (Founding Team)

    Lead Full-Stack + AI Engineer (Founding Team)

    Grovio AI • hosur, tamil nadu, in
    We’re building an autonomous, multi-agent AI OS that plans, executes, and optimizes marketing across modern digital ecosystems. Think : an AI that acts like a virtual CMO — planning, writing, analyz...Show more
    Last updated: 1 hour ago • Promoted • New!
    Senior Data Engineer

    Senior Data Engineer

    Primesoft Inc • hosur, tamil nadu, in
    Primesoft Enterprise IT Services Pvt.As a Software Engineer II - Data, you will contribute to the design and development of data systems including pipelines, APIs, analytics, AI and machine learnin...Show more
    Last updated: 30+ days ago • Promoted
    Python for Machine Learning

    Python for Machine Learning

    People Prime Worldwide • hosur, tamil nadu, in
    Our client is a trusted global innovator of IT and business services.They help clients transform through consulting, industry solutions, business process services, digital & IT modernisation and ma...Show more
    Last updated: 2 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Recro • hosur, tamil nadu, in
    What would you be doing / Expected from this role?.Collaborate with cross-functional teams including data scientists, engineers, and product managers to deliver AI-driven solutions.Drive the archite...Show more
    Last updated: 30+ days ago • Promoted
    AI Architect

    AI Architect

    TekPillar® • Hosur, Tamil Nadu, India
    Job Title : AI Architect Experience : 8+ Years Location : Bangalore Mandatory Skills : Cloud platforms (AWS or equivalent) Artificial Intelligence / Large Language Models (LLMs) Key Responsibilities : ...Show more
    Last updated: 4 hours ago • Promoted • New!