Talent.com
AI Engineer - GPT / LangChain / RAG / Data Pipelines
AI Engineer - GPT / LangChain / RAG / Data PipelinesPeak Trust Global Real Estate • Narela, Delhi, India
AI Engineer - GPT / LangChain / RAG / Data Pipelines

AI Engineer - GPT / LangChain / RAG / Data Pipelines

Peak Trust Global Real Estate • Narela, Delhi, India
15 hours ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 70K / Month based on experience

Role Summary We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks

Extract multi-format documents (PDFs, HTML, text, images)

Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation Clean and process unstructured documents

Apply OCR (Tesseract, PaddleOCR) for scanned files

Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.

Prepare data in formats such as JSON, JSONL, or CSV

3. Dataset Preparation Segment and structure text for ML training

Create Q&A datasets, summaries, instruction-response pairs, and labeled text

Build high-quality datasets compatible with fine-tuning frameworks

4. Retrieval & Indexing Pipelines Implement document chunking strategies

Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )

Build retrieval workflows using LangChain or LlamaIndex

Optimize retrieval accuracy and latency

5. Model Training & Fine-Tuning Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods

Monitor training performance and refine datasets

Package and deploy fine-tuned models

6. Data Visualization & Analytics Create analytical charts, trends, and insights using :

Pandas

Matplotlib

Seaborn

Plotly

Build simple internal dashboards or visual summaries for reports

Transform raw datasets into meaningful visual insights

7. Automation & Infrastructure Write modular, maintainable Python scripts

Containerize workflows with Docker

Maintain version control with Git

Ensure reproducibility and pipeline stability

Required Technical Skills Strong proficiency in Python

Experience with Firecrawl , Playwright, Scrapy, or similar tools

Strong background in document parsing , text processing, and OCR

Familiarity with LangChain or LlamaIndex

Experience with vector databases

Hands-on experience with HuggingFace , Transformer models, and fine-tuning

Ability to write clean, efficient data pipelines

Experience with Matplotlib , Seaborn , Plotly , or other visualization tools

Comfort using Docker and Git

Nice to Have Experience serving models or building small APIs (FastAPI)

Exposure to GPU training environments

Background in large-scale unstructured data work

Ability to create lightweight dashboards (Plotly Dash, Streamlit)

Ideal Candidate Comfortable owning full pipelines independently

Detail-oriented and analytical

Strong problem-solving ability

Can work with minimal supervision

Enjoys building structured systems from scratch

Create a job alert for this search

Ai Data Engineer • Narela, Delhi, India

Related jobs
Generative AI Engineer

Generative AI Engineer

Live Connections • narela, delhi, in
Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field; Master’s or PhD preferred. ML / Data Science with a focus on generative AI, LLMs, or co...Show more
Last updated: 16 days ago • Promoted
AI Engineer - GPT / LangChain / RAG / Data Pipelines

AI Engineer - GPT / LangChain / RAG / Data Pipelines

Peak Trust Global Real Estate • narela, delhi, in
This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
Last updated: 18 hours ago • Promoted • New!
Lead Data Scientist + Instructor - AI&ML

Lead Data Scientist + Instructor - AI&ML

Newton School of Technology • Sonipat, Haryana, India
Newton School of Technology (NST) is a new-age institution redefining technical education in India.Founded by IIT alumni, NST offers a 4-year B. Tech in Computer Science and AI, focused on hands-on ...Show more
Last updated: 24 days ago • Promoted
Machine Learning Engineer

Machine Learning Engineer

Recro • narela, delhi, in
Job Description : AI / ML Engineer (3D Geometry & Manufacturing).We are seeking an exceptionally talented and entrepreneurial. Design for Manufacturability (DFM).If you are passionate about leveraging ...Show more
Last updated: 30+ days ago • Promoted
Architect

Architect

Veltris • narela, delhi, in
AI Architect - Telecom & Networking.Routing, Switching / SD-WAN / Provider Edge).ML Algorithms; Graph Neural Networks, Time-series Forecasting Algorithms (ARIMA, LSTM…). ML / DL libraries (PyTorch, Te...Show more
Last updated: 18 hours ago • Promoted • New!
Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

Peak Trust Global Real Estate • Narela, Republic Of India, IN
This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
Last updated: 11 hours ago • Promoted • New!
Generative AI Engineer

Generative AI Engineer

telerapp • narela, delhi, in
As a Gen AI Engineer at Telerapp, you will play a crucial role in designing, developing, and implementing cutting-edge generative artificial intelligence models and systems.Your expertise will cont...Show more
Last updated: 4 days ago • Promoted
Data Engineer

Data Engineer

Grantify • narela, delhi, in
Grantify is an innovative education platform that connects students and universities through a transparent admissions and tuition-matching ecosystem. By aligning student budgets and academic aspirat...Show more
Last updated: 18 hours ago • Promoted • New!
Freelance Data Engineer

Freelance Data Engineer

Leading MNC • narela, delhi, in
Looking for a Freelance Data Engineer to join a team of rockstar developers.The candidate should have a minimum of 8+ yrs. If you're looking for freelance / part time opportunity (along with your day...Show more
Last updated: 13 days ago • Promoted
Python Developer

Python Developer

TekXera • narela, delhi, in
Senior Python Engineer – Service Implementation.India | Pakistan | Nigeria | Kenya | Egypt | Ghana | Bangladesh | Turkey | Mexico. Full-Time Contract (9 Months, Extendable).San Francisco–based AI re...Show more
Last updated: 18 hours ago • Promoted • New!
Generative AI Engineer

Generative AI Engineer

Turing • narela, delhi, in
Turing is looking for people with LLM experience to join us in solving business problems for our Fortune 500 customers.You will be a key member of the Turing GenAI delivery organization and part of...Show more
Last updated: 21 days ago • Promoted
Senior Data Engineer

Senior Data Engineer

Primesoft Inc • narela, delhi, in
Primesoft Enterprise IT Services Pvt.As a Software Engineer II - Data, you will contribute to the design and development of data systems including pipelines, APIs, analytics, AI and machine learnin...Show more
Last updated: 30+ days ago • Promoted
Python for Machine Learning

Python for Machine Learning

People Prime Worldwide • narela, delhi, in
Our client is a trusted global innovator of IT and business services.They help clients transform through consulting, industry solutions, business process services, digital & IT modernisation and ma...Show more
Last updated: 2 days ago • Promoted
Lead Full-Stack + AI Engineer (Founding Team)

Lead Full-Stack + AI Engineer (Founding Team)

Grovio AI • narela, delhi, in
We’re building an autonomous, multi-agent AI OS that plans, executes, and optimizes marketing across modern digital ecosystems. Think : an AI that acts like a virtual CMO — planning, writing, analyz...Show more
Last updated: 10 hours ago • Promoted • New!
Oracle Analytics & AI Solutions Architect

Oracle Analytics & AI Solutions Architect

TribolaTech Inc • narela, delhi, in
Oracle Analytics & AI Solutions Architect.Our client believes in connecting people and business to Insurance in ways that are Innovative, Hyper-Relevant, Compelling and Personal.They bring together...Show more
Last updated: 15 days ago • Promoted
Python Web Scraping Engineer – Automation (3 to 10 yrs)

Python Web Scraping Engineer – Automation (3 to 10 yrs)

AIMLEAP • narela, delhi, in
Python Web Scraping Engineer – Advanced Automation (WFH).Bachelor’s degree in Computer Science, IT, or related field .IT / Software Services / Data & AI . Strong hands-on experience handling.Seleniu...Show more
Last updated: 18 hours ago • Promoted • New!
Data Engineer + Instructor

Data Engineer + Instructor

Newton School of Technology • Sonipat, Haryana, India
We are looking for a passionate and experienced.AI concepts, with a strong emphasis on Python programming.You will guide learners in building skills to visualize, analyze, and model data, preparing...Show more
Last updated: 24 days ago • Promoted
AI Engineer Intern

AI Engineer Intern

Qureal AI • narela, delhi, in
Hiring : AI Engineer Intern (Remote + Paid).We are looking for a dedicated AI Engineer Intern.This role offers a real opportunity to transition into a full-time position based on performance.PPO / F...Show more
Last updated: 18 hours ago • Promoted • New!