Talent.com
AI Data Engineer
AI Data EngineerPeak Trust Global Real Estate • Erode, IN
AI Data Engineer

AI Data Engineer

Peak Trust Global Real Estate • Erode, IN
14 hours ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 60K / Month

Role Summary

We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation

  • Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks
  • Extract multi-format documents (PDFs, HTML, text, images)
  • Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation

  • Clean and process unstructured documents
  • Apply OCR (Tesseract, PaddleOCR) for scanned files
  • Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.
  • Prepare data in formats such as JSON, JSONL, or CSV
  • 3. Dataset Preparation

  • Segment and structure text for ML training
  • Create Q&A datasets, summaries, instruction-response pairs, and labeled text
  • Build high-quality datasets compatible with fine-tuning frameworks
  • 4. Retrieval & Indexing Pipelines

  • Implement document chunking strategies
  • Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )
  • Build retrieval workflows using LangChain or LlamaIndex
  • Optimize retrieval accuracy and latency
  • 5. Model Training & Fine-Tuning

  • Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods
  • Monitor training performance and refine datasets
  • Package and deploy fine-tuned models
  • 6. Data Visualization & Analytics

  • Create analytical charts, trends, and insights using :
  • Pandas
  • Matplotlib
  • Seaborn
  • Plotly
  • Build simple internal dashboards or visual summaries for reports
  • Transform raw datasets into meaningful visual insights
  • 7. Automation & Infrastructure

  • Write modular, maintainable Python scripts
  • Containerize workflows with Docker
  • Maintain version control with Git
  • Ensure reproducibility and pipeline stability
  • Required Technical Skills

  • Strong proficiency in Python
  • Experience with Firecrawl , Playwright, Scrapy, or similar tools
  • Strong background in document parsing , text processing, and OCR
  • Familiarity with LangChain or LlamaIndex
  • Experience with vector databases
  • Hands-on experience with HuggingFace , Transformer models, and fine-tuning
  • Ability to write clean, efficient data pipelines
  • Experience with Matplotlib , Seaborn , Plotly , or other visualization tools
  • Comfort using Docker and Git
  • Nice to Have

  • Experience serving models or building small APIs (FastAPI)
  • Exposure to GPU training environments
  • Background in large-scale unstructured data work
  • Ability to create lightweight dashboards (Plotly Dash, Streamlit)
  • Ideal Candidate

  • Comfortable owning full pipelines independently
  • Detail-oriented and analytical
  • Strong problem-solving ability
  • Can work with minimal supervision
  • Enjoys building structured systems from scratch
  • Create a job alert for this search

    Ai Data Engineer • Erode, IN

    Related jobs
    AI Data Engineer - 17852

    AI Data Engineer - 17852

    Turing • Salem,Tamil Nadu, IN
    We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show more
    Last updated: 12 days ago • Promoted
    Lead AI Engineer

    Lead AI Engineer

    Blend • Tiruppur, IN
    We are looking for an AI Engineer with hands-on experience designing and deploying scalable AI solutions.In this role, you will be part of a cross-functional team working on cutting-edge projects i...Show more
    Last updated: 9 days ago • Promoted
    Applied AI Engineer

    Applied AI Engineer

    Strategic Talent Partner • Tiruppur, IN
    Design and deploy advanced multi-agent pipelines for credit analysis.Optimize inference and prompt chains using frameworks like DSPy, GEPA, and LangChain. Implement reasoning techniques (CoT, ToT, G...Show more
    Last updated: 13 days ago • Promoted
    Data Engineer

    Data Engineer

    Digitalzone • Tiruppur, IN
    As a Data Engineer, you will design, build, and optimize data pipelines and real-time systems that power AI-driven decisioning and analytics. Develop and maintain scalable ETL / ELT pipelines using Py...Show more
    Last updated: 13 days ago • Promoted
    AI Platform Engineer

    AI Platform Engineer

    BayOne Solutions • Erode, IN
    We are seeking a highly skilled.In this role, you will work on advanced AI systems including.Retrieval-Augmented Generation (RAG). Model Context Protocol (MCP) tools.OpenWebUI or custom-built soluti...Show more
    Last updated: 4 days ago • Promoted
    Remote GenAI Engineer

    Remote GenAI Engineer

    EazyML • Salem,Tamil Nadu, IN
    Remote
    Founded by Bell Labs research veterans, and associated with breakthrough startups like Amelia, EazyML, specializes in Transparent Machine Learning. Early on EazyML founders saw the need for Transpa...Show more
    Last updated: 28 days ago • Promoted
    Data Engineer

    Data Engineer

    Recro • Tiruppur, IN
    Data Pipeline Engineering : Design, build, and maintain ingestion, transformation, and storage pipelines using Azure Data Factory, Synapse Analytics, and Data Lake. AI Data Enablement : Collaborate wi...Show more
    Last updated: 30+ days ago • Promoted
    AI Data Engineer

    AI Data Engineer

    Peak Trust Global Real Estate • Tiruppur, IN
    This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
    Last updated: 14 hours ago • Promoted • New!
    AI Engineer

    AI Engineer

    TechWaves Recruitment • Erode, IN
    Location : Role is remote, however you must be based within a City location in India.Our client who provides world’s leading Agent and Agentic solution platform designed for teams looking to streaml...Show more
    Last updated: 13 days ago • Promoted
    Data Engineer

    Data Engineer

    Aceolution • Salem,Tamil Nadu, IN
    Data Engineer – Python Expert(Freelance Role).We are looking for a seasoned Senior Data Engineer to architect, build, and own the data pipelines that power our large language model (LLM) developmen...Show more
    Last updated: 30+ days ago • Promoted
    AI Engineer

    AI Engineer

    Sutra.AI • Erode, IN
    Our mission is to help enterprises transform raw data into intelligent, actionable insights through AI, automation, and decision intelligence. The ideal candidate is hands-on, detail-oriented, and t...Show more
    Last updated: 5 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Sesheng Company • Erode, IN
    GenAI Engineer (Semantic Search & RAG Systems).Remote in India (to work in US Time zone).You will be instrumental in designing and deploying a cutting-edge semantic search capability to power our n...Show more
    Last updated: 13 days ago • Promoted
    AI Data Engineer (AWS / Snowflake)

    AI Data Engineer (AWS / Snowflake)

    Entech • Salem,Tamil Nadu, IN
    AI Data Engineer (AWS / Snowflake).Entech Consulting is supporting our client, an AI-driven company that helps global organizations reduce the cost and time-to-market for promotional assets through i...Show more
    Last updated: 27 days ago • Promoted
    AI Engineer

    AI Engineer

    Aura Recruitment Solutions • Erode, IN
    Pay starts from 150,000 INR per Month.We’re hiring on behalf of our client, a fast-growing, AI-first company building cutting-edge AI-native applications that transform complex, real-world data int...Show more
    Last updated: 7 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Philodesign Technologies Inc • Erode, IN
    Retrieval-Augmented Generation) or.Generative AI solution performance.Show more
    Last updated: 7 days ago • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    Intellectt Inc • Salem,Tamil Nadu, IN
    Agentic AI Engineer (100% Remote).Intellectt is seeking a highly experienced.The ideal candidate will have hands-on expertise in. LLMs, LangChain, LangGraph, RAG.AI applications for real-world use.O...Show more
    Last updated: 7 days ago • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Lingaro • Salem,Tamil Nadu, IN
    AI / ML Engineer – Senior Consultant.AI Engineering Group is part of Data Science & AI Competency Center and is focusing technical and engineering aspects of DS / ML / AI solutions.We are looking for exp...Show more
    Last updated: 30+ days ago • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    BeGig • Tiruppur, IN
    Agentic AI Engineer – Real Estate & Construction.We are seeking a talented engineer to.You will own the full lifecycle of agent frameworks, including toolchain architecture, prompt engineering, dat...Show more
    Last updated: 11 days ago • Promoted