Talent.com
AI Engineer - GPT / LangChain / RAG / Data Pipelines
AI Engineer - GPT / LangChain / RAG / Data PipelinesPeak Trust Global Real Estate • Guwahati, Assam, India
AI Engineer - GPT / LangChain / RAG / Data Pipelines

AI Engineer - GPT / LangChain / RAG / Data Pipelines

Peak Trust Global Real Estate • Guwahati, Assam, India
1 day ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 70K / Month based on experience

Role Summary We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks

Extract multi-format documents (PDFs, HTML, text, images)

Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation Clean and process unstructured documents

Apply OCR (Tesseract, PaddleOCR) for scanned files

Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.

Prepare data in formats such as JSON, JSONL, or CSV

3. Dataset Preparation Segment and structure text for ML training

Create Q&A datasets, summaries, instruction-response pairs, and labeled text

Build high-quality datasets compatible with fine-tuning frameworks

4. Retrieval & Indexing Pipelines Implement document chunking strategies

Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )

Build retrieval workflows using LangChain or LlamaIndex

Optimize retrieval accuracy and latency

5. Model Training & Fine-Tuning Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods

Monitor training performance and refine datasets

Package and deploy fine-tuned models

6. Data Visualization & Analytics Create analytical charts, trends, and insights using :

Pandas

Matplotlib

Seaborn

Plotly

Build simple internal dashboards or visual summaries for reports

Transform raw datasets into meaningful visual insights

7. Automation & Infrastructure Write modular, maintainable Python scripts

Containerize workflows with Docker

Maintain version control with Git

Ensure reproducibility and pipeline stability

Required Technical Skills Strong proficiency in Python

Experience with Firecrawl , Playwright, Scrapy, or similar tools

Strong background in document parsing , text processing, and OCR

Familiarity with LangChain or LlamaIndex

Experience with vector databases

Hands-on experience with HuggingFace , Transformer models, and fine-tuning

Ability to write clean, efficient data pipelines

Experience with Matplotlib , Seaborn , Plotly , or other visualization tools

Comfort using Docker and Git

Nice to Have Experience serving models or building small APIs (FastAPI)

Exposure to GPU training environments

Background in large-scale unstructured data work

Ability to create lightweight dashboards (Plotly Dash, Streamlit)

Ideal Candidate Comfortable owning full pipelines independently

Detail-oriented and analytical

Strong problem-solving ability

Can work with minimal supervision

Enjoys building structured systems from scratch

Create a job alert for this search

Ai Data Engineer • Guwahati, Assam, India

Related jobs
Generative AI Engineer

Generative AI Engineer

Live Connections • guwahati, assam, in
Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field; Master’s or PhD preferred. ML / Data Science with a focus on generative AI, LLMs, or co...Show more
Last updated: 17 days ago • Promoted
Generative AI Engineer

Generative AI Engineer

Turing • guwahati, assam, in
Turing is looking for people with LLM experience to join us in solving business problems for our Fortune 500 customers.You will be a key member of the Turing GenAI delivery organization and part of...Show more
Last updated: 22 days ago • Promoted
Artificial Intelligence Engineer

Artificial Intelligence Engineer

Cozzera • guwahati, assam, in
Job Title : AI Software Developer.Remote (1–2 days / month onsite) [Bangalore / Chennai / Hyderabad / Pune / Noida / Gurugram]. AI / ML solutions for enterprise healthcare applications.The ideal candidat...Show more
Last updated: 8 hours ago • Promoted • New!
Artificial Intelligence Engineer

Artificial Intelligence Engineer

InstaSupply.ca • guwahati, assam, in
InstaSupply (Construction supply + logistics platform).InstaSupply is building a full ecosystem for construction materials : . Customer app with AI-powered search.Supplier app with automated catalog s...Show more
Last updated: 8 hours ago • Promoted • New!
AI Business Analyst

AI Business Analyst

Aventis Solutions • guwahati, assam, in
Aventis Solutions is igniting the AI revolution : .They have just launched The AI Executive podcast, which can be found here : . MMQBvaKxQSuXcZ2MLnv?si=f8fb3c2cd9ee4d12.Now, our tech partner is establis...Show more
Last updated: 8 hours ago • Promoted • New!
AI Analyst

AI Analyst

Aventis Solutions • guwahati, assam, in
Aventis Solutions is igniting the AI revolution : .They have just launched The AI Executive podcast, which can be found here : . Now, our tech partner is establishing a new AI Innovation Hub in Pune, In...Show more
Last updated: 30+ days ago • Promoted
Data Engineer

Data Engineer

Grantify • guwahati, assam, in
Grantify is an innovative education platform that connects students and universities through a transparent admissions and tuition-matching ecosystem. By aligning student budgets and academic aspirat...Show more
Last updated: 1 day ago • Promoted
Oracle Analytics & AI Solutions Architect

Oracle Analytics & AI Solutions Architect

TribolaTech Inc • guwahati, assam, in
Oracle Analytics & AI Solutions Architect.Our client believes in connecting people and business to Insurance in ways that are Innovative, Hyper-Relevant, Compelling and Personal.They bring together...Show more
Last updated: 16 days ago • Promoted
Senior AI Engineer

Senior AI Engineer

Xtnsion.AI • guwahati, assam, in
AI is building the agentic CX layer for modern businesses — AI voice + chat agents that autonomously handle bookings, lead follow-up, support workflows, CRM actions, and more across phone, WhatsApp...Show more
Last updated: 8 hours ago • Promoted • New!
Artificial Intelligence Engineer

Artificial Intelligence Engineer

AllysAI | AI Lab-as-a-Service • guwahati, assam, in
AllysAI is not a typical AI company.We're an AI Lab-as-a-Service that helps enterprises escape "pilot hell" and ship production AI in 60-90 days—not 12+ months. Our clients include Al Futtaim, Merz ...Show more
Last updated: 8 hours ago • Promoted • New!
AI / ML Developer

AI / ML Developer

Cozzera • guwahati, assam, in
Job Title : AI / ML Builder – Salesforce + Generative AI.We are seeking a highly skilled.The ideal candidate will design and implement intelligent, secure, and scalable AI-driven solutions using.Einst...Show more
Last updated: 8 hours ago • Promoted • New!
Generative Ai Engineer

Generative Ai Engineer

Live Connections • Guwahati, Republic Of India, IN
Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field;.ML / Data Science with a focus on generative AI, LLMs, or computer vision.Expertise in...Show more
Last updated: 16 days ago • Promoted
AI / ML Engineer

AI / ML Engineer

Cozzera • guwahati, assam, in
We are looking for an experienced AI / ML Engineer with a strong background in machine learning and deep learning, especially in time-series, sensor, and behavioral data. Strong foundation in ML and d...Show more
Last updated: 8 hours ago • Promoted • New!
Freelance Senior Data Engineer (ADF • Databricks • Vectr • Cribl)

Freelance Senior Data Engineer (ADF • Databricks • Vectr • Cribl)

ThreatXIntel • guwahati, assam, in
ThreatXIntel is a startup cybersecurity company focused on delivering advanced and tailored solutions to protect businesses and organizations from cyber threats. Our expertise spans cloud security, ...Show more
Last updated: 8 hours ago • Promoted • New!
Machine Learning Engineer

Machine Learning Engineer

Recro • guwahati, assam, in
What would you be doing / Expected from this role?.Collaborate with cross-functional teams including data scientists, engineers, and product managers to deliver AI-driven solutions.Drive the archite...Show more
Last updated: 30+ days ago • Promoted
Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

Peak Trust Global Real Estate • Guwahati, Republic Of India, IN
This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
Last updated: 1 day ago • Promoted
Lead Full-Stack + AI Engineer (Founding Team)

Lead Full-Stack + AI Engineer (Founding Team)

Grovio AI • guwahati, assam, in
We’re building an autonomous, multi-agent AI OS that plans, executes, and optimizes marketing across modern digital ecosystems. Think : an AI that acts like a virtual CMO — planning, writing, analyz...Show more
Last updated: 20 hours ago • Promoted • New!
Senior Data Engineer

Senior Data Engineer

Primesoft Inc • guwahati, assam, in
Primesoft Enterprise IT Services Pvt.As a Software Engineer II - Data, you will contribute to the design and development of data systems including pipelines, APIs, analytics, AI and machine learnin...Show more
Last updated: 30+ days ago • Promoted