AI Data EngineerPeak Trust Global Real Estate • vapi, gujarat, in

AI Data Engineer

Peak Trust Global Real Estate • vapi, gujarat, in

10 hours ago

Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 60K / Month

Role Summary

We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation

Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks
Extract multi-format documents (PDFs, HTML, text, images)
Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation

Clean and process unstructured documents

Apply OCR (Tesseract, PaddleOCR) for scanned files

Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.

Prepare data in formats such as JSON, JSONL, or CSV

3. Dataset Preparation

Segment and structure text for ML training

Create Q&A datasets, summaries, instruction-response pairs, and labeled text

Build high-quality datasets compatible with fine-tuning frameworks

4. Retrieval & Indexing Pipelines

Implement document chunking strategies

Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )

Build retrieval workflows using LangChain or LlamaIndex

Optimize retrieval accuracy and latency

5. Model Training & Fine-Tuning

Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods

Monitor training performance and refine datasets

Package and deploy fine-tuned models

6. Data Visualization & Analytics

Create analytical charts, trends, and insights using :

Pandas

Matplotlib

Seaborn

Plotly

Build simple internal dashboards or visual summaries for reports

Transform raw datasets into meaningful visual insights

7. Automation & Infrastructure

Write modular, maintainable Python scripts

Containerize workflows with Docker

Maintain version control with Git

Ensure reproducibility and pipeline stability

Required Technical Skills

Strong proficiency in Python

Experience with Firecrawl , Playwright, Scrapy, or similar tools

Strong background in document parsing , text processing, and OCR

Familiarity with LangChain or LlamaIndex

Experience with vector databases

Hands-on experience with HuggingFace , Transformer models, and fine-tuning

Ability to write clean, efficient data pipelines

Experience with Matplotlib , Seaborn , Plotly , or other visualization tools

Comfort using Docker and Git

Nice to Have

Experience serving models or building small APIs (FastAPI)

Exposure to GPU training environments

Background in large-scale unstructured data work

Ability to create lightweight dashboards (Plotly Dash, Streamlit)

Ideal Candidate

Comfortable owning full pipelines independently

Detail-oriented and analytical

Strong problem-solving ability

Can work with minimal supervision

Enjoys building structured systems from scratch

Create a job alert for this search

Ai Data Engineer • vapi, gujarat, in

Related jobs

Engineer-AI

Sakon • vapi, gujarat, in

Role : AI Engineer – Agentic Systems & LLM Applications.We’re looking for a well-rounded, forward-thinking AI Engineer who can design, build, and deploy intelligent systems powered by LLMs, retrieva...Show more

Last updated: 13 days ago • Promoted

AI Data Engineer - 17852

Turing • vapi, gujarat, in

We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show more

Last updated: 12 days ago • Promoted

AI Engineer (Data Pipelines & RAG)

BeGig • vapi, gujarat, in

Job Role- AI Engineer (Data Pipelines & RAG).Work Mode- Remote(6 days working).We are looking for a hands-on AI / Data Engineer (4–7 years) to build and scale data pipelines powering GenAI and agenti...Show more

Last updated: 10 hours ago • Promoted • New!

Lead AI / ML Engineer

Optum • vapi, gujarat, in

Lead AI / ML Engineer – Clinical AI systems.Optum is a global organization that delivers care, aided by technology, to help millions of people live healthier lives. The work you do with our team will ...Show more

Last updated: 10 days ago • Promoted

Forward Deployed AI Engineer

Palindrome • vapi, gujarat, in

As a Forward-Deployed Engineer (GenAI), you will partner directly with Palindrome’s customers in wealth management and financial services to design, prototype, and productionise GenAI-driven soluti...Show more

Last updated: 10 days ago • Promoted

Machine Learning Engineer - Agentic AI & AIOps

Platform9 • vapi, gujarat, in

Platform9 is a leader in simplifying enterprise private clouds.Our flagship product, Private Cloud Director, turns existing infrastructure into a full-featured private cloud.Enterprise IT teams can...Show more

Last updated: 11 days ago • Promoted

Lead Data Engineer

Cimpress • vapi, gujarat, in

Our Team : Enterprise Business Solutions.Vista’s Enterprise Business Solutions (EBS) domain is working to make our company one of the most data-driven organizations to support Finance, Supply Chain,...Show more

Last updated: 13 days ago • Promoted

Azure Data Engineer

SystemBender • vapi, gujarat, in

Responsible for designing and maintaining scalable data pipelines on Microsoft Fabric and Azure.Focus includes ingesting structured, semi-structured, and unstructured data, managing OneLake / Delta L...Show more

Last updated: 7 days ago • Promoted

Generative AI Engineer

Turing • vapi, gujarat, in

Turing is looking for people with LLM experience to join us in solving business problems for our Fortune 500 customers.You will be a key member of the Turing GenAI delivery organization and part of...Show more

Last updated: 10 days ago • Promoted

Machine Learning Engineer-Agentic AI

Innodata Inc. • vapi, gujarat, in

Design and implement multi-agent systems using LangChain, LangGraph, CrewAI, AutoGen or similar frameworks.Build A2A (agent-to-agent) orchestration and implement MCP (multi-context protocol) for co...Show more

Last updated: 13 days ago • Promoted

Data Engineer

Tata Consultancy Services • vapi, gujarat, in

TCS has been a great pioneer in feeding the fire of Techies like you.We are a global leader in the technology arena and there’s nothing that can stop us from growing together.Your role is of key im...Show more

Last updated: 30+ days ago • Promoted

Senior AI / ML Engineer

RingCentral • vapi, gujarat, in

We are seeking an experienced AI Engineer with a strong background in Natural Language Understanding (NLU) who is passionate about pushing the boundaries of Conversational AI.In this role, you will...Show more

Last updated: 13 days ago • Promoted

AI / ML Engineer

TransPerfect • vapi, gujarat, in

We are seeking a Senior AI / ML Engineer to join our client’s AI team and contribute to the development of cutting-edge intelligent systems. In this role, you’ll be responsible for designing, training...Show more

Last updated: 30+ days ago • Promoted

Data Engineer

Meril • Vapi, Gujarat, India

Experience Required : Minimum 1–2 Years.We are seeking a skilled and motivated Data Engineer to join our growing technology team. You’ll work with diverse data types and modern data platforms to desi...Show more

Last updated: 11 days ago • Promoted

Senior AI / ML Engineer

Luxoft • vapi, gujarat, in

Our client, one of the leading Agriculture Companies, is modernising their landscape and adopting AI and innovations in their process. We are seeking a highly skilled and innovative Python / AI Engine...Show more

Last updated: 10 days ago • Promoted

Lead AI Engineer

Blend • vapi, gujarat, in

We are looking for an AI Engineer with hands-on experience designing and deploying scalable AI solutions.In this role, you will be part of a cross-functional team working on cutting-edge projects i...Show more

Last updated: 9 days ago • Promoted

Agentic AI Engineer

Intellectt Inc • vapi, gujarat, in

Agentic AI Engineer (100% Remote).Intellectt is seeking a highly experienced.The ideal candidate will have hands-on expertise in. LLMs, LangChain, LangGraph, RAG.AI applications for real-world use.O...Show more

Last updated: 7 days ago • Promoted

AI Engineer - Experience in building AI agents at scale and implementing robust AI evaluation - CTC INR 50 L

CareerXperts Consulting • vapi, gujarat, in

We are seeking a highly skilled and motivated AI Engineer with expertise in large language models (LLMs), AI workflows, and machine learning. This role combines deep technical knowledge in ML / AI wit...Show more

Last updated: 18 hours ago • Promoted • New!