Talent.com
AI Engineer - GPT / LangChain / RAG / Data Pipelines
AI Engineer - GPT / LangChain / RAG / Data PipelinesPeak Trust Global Real Estate • Sangli, Maharashtra, India
AI Engineer - GPT / LangChain / RAG / Data Pipelines

AI Engineer - GPT / LangChain / RAG / Data Pipelines

Peak Trust Global Real Estate • Sangli, Maharashtra, India
7 hours ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 70K / Month based on experience

Role Summary We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks

Extract multi-format documents (PDFs, HTML, text, images)

Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation Clean and process unstructured documents

Apply OCR (Tesseract, PaddleOCR) for scanned files

Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.

Prepare data in formats such as JSON, JSONL, or CSV

3. Dataset Preparation Segment and structure text for ML training

Create Q&A datasets, summaries, instruction-response pairs, and labeled text

Build high-quality datasets compatible with fine-tuning frameworks

4. Retrieval & Indexing Pipelines Implement document chunking strategies

Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )

Build retrieval workflows using LangChain or LlamaIndex

Optimize retrieval accuracy and latency

5. Model Training & Fine-Tuning Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods

Monitor training performance and refine datasets

Package and deploy fine-tuned models

6. Data Visualization & Analytics Create analytical charts, trends, and insights using :

Pandas

Matplotlib

Seaborn

Plotly

Build simple internal dashboards or visual summaries for reports

Transform raw datasets into meaningful visual insights

7. Automation & Infrastructure Write modular, maintainable Python scripts

Containerize workflows with Docker

Maintain version control with Git

Ensure reproducibility and pipeline stability

Required Technical Skills Strong proficiency in Python

Experience with Firecrawl , Playwright, Scrapy, or similar tools

Strong background in document parsing , text processing, and OCR

Familiarity with LangChain or LlamaIndex

Experience with vector databases

Hands-on experience with HuggingFace , Transformer models, and fine-tuning

Ability to write clean, efficient data pipelines

Experience with Matplotlib , Seaborn , Plotly , or other visualization tools

Comfort using Docker and Git

Nice to Have Experience serving models or building small APIs (FastAPI)

Exposure to GPU training environments

Background in large-scale unstructured data work

Ability to create lightweight dashboards (Plotly Dash, Streamlit)

Ideal Candidate Comfortable owning full pipelines independently

Detail-oriented and analytical

Strong problem-solving ability

Can work with minimal supervision

Enjoys building structured systems from scratch

Create a job alert for this search

Ai Data Engineer • Sangli, Maharashtra, India

Related jobs
AI Engineer - GPT / LangChain / RAG / Data Pipelines

AI Engineer - GPT / LangChain / RAG / Data Pipelines

Peak Trust Global Real Estate • sangli, maharashtra, in
This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
Last updated: 9 hours ago • Promoted • New!
Lead Full-Stack + AI Engineer (Founding Team)

Lead Full-Stack + AI Engineer (Founding Team)

Grovio AI • sangli, maharashtra, in
We’re building an autonomous, multi-agent AI OS that plans, executes, and optimizes marketing across modern digital ecosystems. Think : an AI that acts like a virtual CMO — planning, writing, analyz...Show more
Last updated: 2 hours ago • Promoted • New!
AI Architect

AI Architect

TekPillar® • Sangli, Maharashtra, India
Job Title : AI Architect Experience : 8+ Years Location : Bangalore Mandatory Skills : Cloud platforms (AWS or equivalent) Artificial Intelligence / Large Language Models (LLMs) Key Responsibilities : ...Show more
Last updated: 7 hours ago • Promoted • New!
AI Engineer Intern

AI Engineer Intern

Qureal AI • sangli, maharashtra, in
Hiring : AI Engineer Intern (Remote + Paid).We are looking for a dedicated AI Engineer Intern.This role offers a real opportunity to transition into a full-time position based on performance.PPO / F...Show more
Last updated: 9 hours ago • Promoted • New!
Senior Software Engineer

Senior Software Engineer

Programmers.io • sangli, maharashtra, in
Senior AI-Integrated Software Engineer (.Remote until office reopens, Work from Home.We are looking for a dynamic and innovative. The ideal candidate will bring hands-on experience in AI-assisted de...Show more
Last updated: 30+ days ago • Promoted
Generative AI Engineer

Generative AI Engineer

Live Connections • sangli, maharashtra, in
Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field; Master’s or PhD preferred. ML / Data Science with a focus on generative AI, LLMs, or co...Show more
Last updated: 16 days ago • Promoted
Python Developer

Python Developer

TekXera • sangli, maharashtra, in
Senior Python Engineer – Service Implementation.India | Pakistan | Nigeria | Kenya | Egypt | Ghana | Bangladesh | Turkey | Mexico. Full-Time Contract (9 Months, Extendable).San Francisco–based AI re...Show more
Last updated: 9 hours ago • Promoted • New!
Senior Data Engineer

Senior Data Engineer

Primesoft Inc • sangli, maharashtra, in
Primesoft Enterprise IT Services Pvt.As a Software Engineer II - Data, you will contribute to the design and development of data systems including pipelines, APIs, analytics, AI and machine learnin...Show more
Last updated: 30+ days ago • Promoted
Data Engineer

Data Engineer

Grantify • sangli, maharashtra, in
Grantify is an innovative education platform that connects students and universities through a transparent admissions and tuition-matching ecosystem. By aligning student budgets and academic aspirat...Show more
Last updated: 9 hours ago • Promoted • New!
Machine Learning Engineer

Machine Learning Engineer

Recro • sangli, maharashtra, in
What would you be doing / Expected from this role?.Collaborate with cross-functional teams including data scientists, engineers, and product managers to deliver AI-driven solutions.Drive the archite...Show more
Last updated: 30+ days ago • Promoted
Freelance Data Engineer

Freelance Data Engineer

Leading MNC • sangli, maharashtra, in
Looking for a Freelance Data Engineer to join a team of rockstar developers.The candidate should have a minimum of 8+ yrs. If you're looking for freelance / part time opportunity (along with your day...Show more
Last updated: 13 days ago • Promoted
Python Web Scraping Engineer – Automation (3 to 10 yrs)

Python Web Scraping Engineer – Automation (3 to 10 yrs)

AIMLEAP • sangli, maharashtra, in
Python Web Scraping Engineer – Advanced Automation (WFH).Bachelor’s degree in Computer Science, IT, or related field .IT / Software Services / Data & AI . Strong hands-on experience handling.Seleniu...Show more
Last updated: 9 hours ago • Promoted • New!
Python for Machine Learning

Python for Machine Learning

People Prime Worldwide • sangli, maharashtra, in
Our client is a trusted global innovator of IT and business services.They help clients transform through consulting, industry solutions, business process services, digital & IT modernisation and ma...Show more
Last updated: 2 days ago • Promoted
Architect

Architect

Veltris • sangli, maharashtra, in
AI Architect - Telecom & Networking.Routing, Switching / SD-WAN / Provider Edge).ML Algorithms; Graph Neural Networks, Time-series Forecasting Algorithms (ARIMA, LSTM…). ML / DL libraries (PyTorch, Te...Show more
Last updated: 9 hours ago • Promoted • New!
Data Engineer

Data Engineer

N53 Tech • Sangli, Maharashtra, India
Our client is expanding their Data Engineering function and hiring across multiple levels (3 to 10 years).You will help build and scale their data architecture, data pipelines, data platform govern...Show more
Last updated: 7 hours ago • Promoted • New!
Python Developer With Test Driven Development (TDD)

Python Developer With Test Driven Development (TDD)

ENCORE IT SOLUTIONS • sangli, maharashtra, in
Job Description – Senior Python Developer – Service Implementation (TDD) (Contract).Short-term Contract (9 months).Flexible (8 hours / day with 4 hours PST overlap). Candidate should be comfortable wo...Show more
Last updated: 9 hours ago • Promoted • New!
Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

Peak Trust Global Real Estate • Sāngli, Republic Of India, IN
This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
Last updated: 5 hours ago • Promoted • New!
AI Engineer

AI Engineer

Recruin • Sangli, Maharashtra, India
Our Client is a global leader in diversified electronics for the semiconductor manufacturing ecosystem.Virtually every electronic device in the world is produced using our technologies.No laptop, s...Show more
Last updated: 7 hours ago • Promoted • New!