Talent.com
AI Engineer - GPT / LangChain / RAG / Data Pipelines
AI Engineer - GPT / LangChain / RAG / Data PipelinesPeak Trust Global Real Estate • India
No longer accepting applications
AI Engineer - GPT / LangChain / RAG / Data Pipelines

AI Engineer - GPT / LangChain / RAG / Data Pipelines

Peak Trust Global Real Estate • India
19 hours ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 70K / Month based on experience

Role Summary

We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation

  • Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks
  • Extract multi-format documents (PDFs, HTML, text, images)
  • Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation

  • Clean and process unstructured documents
  • Apply OCR (Tesseract, PaddleOCR) for scanned files
  • Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.
  • Prepare data in formats such as JSON, JSONL, or CSV
  • 3. Dataset Preparation

  • Segment and structure text for ML training
  • Create Q&A datasets, summaries, instruction-response pairs, and labeled text
  • Build high-quality datasets compatible with fine-tuning frameworks
  • 4. Retrieval & Indexing Pipelines

  • Implement document chunking strategies
  • Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )
  • Build retrieval workflows using LangChain or LlamaIndex
  • Optimize retrieval accuracy and latency
  • 5. Model Training & Fine-Tuning

  • Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods
  • Monitor training performance and refine datasets
  • Package and deploy fine-tuned models
  • 6. Data Visualization & Analytics

  • Create analytical charts, trends, and insights using :
  • Pandas
  • Matplotlib
  • Seaborn
  • Plotly
  • Build simple internal dashboards or visual summaries for reports
  • Transform raw datasets into meaningful visual insights
  • 7. Automation & Infrastructure

  • Write modular, maintainable Python scripts
  • Containerize workflows with Docker
  • Maintain version control with Git
  • Ensure reproducibility and pipeline stability
  • Required Technical Skills

  • Strong proficiency in Python
  • Experience with Firecrawl , Playwright, Scrapy, or similar tools
  • Strong background in document parsing , text processing, and OCR
  • Familiarity with LangChain or LlamaIndex
  • Experience with vector databases
  • Hands-on experience with HuggingFace , Transformer models, and fine-tuning
  • Ability to write clean, efficient data pipelines
  • Experience with Matplotlib , Seaborn , Plotly , or other visualization tools
  • Comfort using Docker and Git
  • Nice to Have

  • Experience serving models or building small APIs (FastAPI)
  • Exposure to GPU training environments
  • Background in large-scale unstructured data work
  • Ability to create lightweight dashboards (Plotly Dash, Streamlit)
  • Ideal Candidate

  • Comfortable owning full pipelines independently
  • Detail-oriented and analytical
  • Strong problem-solving ability
  • Can work with minimal supervision
  • Enjoys building structured systems from scratch
  • Create a job alert for this search

    Ai Data Engineer • India

    Related jobs
    AI Engineer (Data Pipelines & RAG)

    AI Engineer (Data Pipelines & RAG)

    BeGig • India, India
    Job Role- AI Engineer (Data Pipelines & RAG).Work Mode- Remote(6 days working).We are looking for a hands-on AI / Data Engineer (4–7 years) to build and scale data pipelines powering GenAI and agenti...Show more
    Last updated: 12 days ago • Promoted
    Lead AI Solutions Engineer

    Lead AI Solutions Engineer

    Luxoft • Republic Of India, IN
    Our client, one of the leading Agriculture Companies, is modernising their landscape and adopting AI and innovations in their process. We are seeking a highly skilled and innovative Python / AI Engine...Show more
    Last updated: 4 days ago • Promoted
    AI Data Solutions Engineer

    AI Data Solutions Engineer

    Turing • Republic Of India, IN
    We’re looking for experienced Python engineers to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models think, reason, and ...Show more
    Last updated: 23 days ago • Promoted
    Ai And Databricks

    Ai And Databricks

    Quantum Integrators • Pune, Republic Of India, IN
    Role : AI and Databricks Engineer.Strong experience developing industry grade LLM and Gen-AI applications using OpenAI, Anthropic (Claude), or other major providers. Spark, Delta Lake, MLflow, Databr...Show more
    Last updated: 11 days ago • Promoted
    AI Pipeline Architect

    AI Pipeline Architect

    BeGig • Republic Of India, IN
    Job Role- AI Engineer (Data Pipelines & RAG).Work Mode- Remote(6 days working).We are looking for a hands-on AI / Data Engineer (4–7 years) to build and scale data pipelines powering GenAI and agenti...Show more
    Last updated: 12 days ago • Promoted
    AI and Databricks Solutions Engineer

    AI and Databricks Solutions Engineer

    Quantum Integrators • Pune, Republic Of India, IN
    Role : AI and Databricks Engineer.Strong experience developing industry grade LLM and Gen-AI applications using OpenAI, Anthropic (Claude), or other major providers. Spark, Delta Lake, MLflow, Databr...Show more
    Last updated: 10 days ago • Promoted
    AI Pipeline Engineer

    AI Pipeline Engineer

    EXIDEUS LLC • Republic Of India, IN
    We’re not building “just another platform.Sales, Medicine, and Psychology.Our MVP is in an advanced stage, the foundation is solid, and we already have a strong core team (Backend, Frontend, Design...Show more
    Last updated: 8 days ago • Promoted
    Generative Ai Engineer

    Generative Ai Engineer

    Ascendion • Pune, Republic Of India, IN
    Python, Langchain, Langgraph, Autogen.We are actively seeking highly skilled.The ideal candidates will bring hands-on experience in developing and deploying Generative AI (GenAI) solutions from MVP...Show more
    Last updated: 30+ days ago • Promoted
    Senior Python AI Engineer

    Senior Python AI Engineer

    Arcitech • Republic Of India, IN
    Python Gen AI Developer – 3 to 5 Years Experience (3+ years Mandatory).Are you passionate about building cutting-edge AI products?. Do you enjoy experimenting with LLMs, building scalable RAG system...Show more
    Last updated: 3 hours ago • Promoted • New!
    Generative AI Solutions Engineer (Python)

    Generative AI Solutions Engineer (Python)

    Arcitech • Republic Of India, IN
    Develop, train, and optimize ML models using PyTorch, TensorFlow, and Keras.Build end-to-end LLM and RAG pipelines using LangChain and LangGraph. Work with LLM APIs (OpenAI, Anthropic Claude, Azure ...Show more
    Last updated: 9 days ago • Promoted
    AI Data Pipeline Engineer

    AI Data Pipeline Engineer

    Tata Consultancy Services • Republic Of India, IN
    Build and maintain data infrastructure : Design and construct scalable, reliable data pipelines, storage, and processing systems in the cloud. Ensure data quality : Clean, transform, and enrich raw da...Show more
    Last updated: 30+ days ago • Promoted
    Lead AI / ML Solutions Engineer

    Lead AI / ML Solutions Engineer

    Luxoft • Republic Of India, IN
    Our client, one of the leading Agriculture Companies, is modernising their landscape and adopting AI and innovations in their process. We are seeking a highly skilled and innovative Python / AI Engine...Show more
    Last updated: 22 days ago • Promoted
    AI Engineer - GPT / LangChain / RAG / Data Pipelines

    AI Engineer - GPT / LangChain / RAG / Data Pipelines

    Peak Trust Global Real Estate • India, India
    This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
    Last updated: 1 day ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Live Connections • India, India
    Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field; Master’s or PhD preferred. ML / Data Science with a focus on generative AI, LLMs, or co...Show more
    Last updated: 16 days ago • Promoted
    AI Data Pipeline Architect

    AI Data Pipeline Architect

    Peak Trust Global Real Estate • Republic Of India, IN
    This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
    Last updated: 1 day ago • Promoted
    AI Solutions Engineer

    AI Solutions Engineer

    Tata Consultancy Services • Republic Of India, IN
    Python Development for AI / ML projects.Exposure to building large-scale AI / ML solutions.Lang-chain opensource framework knowledge good to have. Expertise in at least one popular Python framework (lik...Show more
    Last updated: 30+ days ago • Promoted
    Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

    Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

    Peak Trust Global Real Estate • Republic Of India, IN
    This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
    Last updated: 1 day ago • Promoted
    Senior AI / ML Engineer

    Senior AI / ML Engineer

    Luxoft • India, India
    Our client, one of the leading Agriculture Companies, is modernising their landscape and adopting AI and innovations in their process. We are seeking a highly skilled and innovative Python / AI Engine...Show more
    Last updated: 21 days ago • Promoted