Talent.com
AI Data Engineer
AI Data EngineerPeak Trust Global Real Estate • narela, delhi, in
AI Data Engineer

AI Data Engineer

Peak Trust Global Real Estate • narela, delhi, in
1 day ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 60K / Month

Role Summary

We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation

  • Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks
  • Extract multi-format documents (PDFs, HTML, text, images)
  • Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation

  • Clean and process unstructured documents
  • Apply OCR (Tesseract, PaddleOCR) for scanned files
  • Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.
  • Prepare data in formats such as JSON, JSONL, or CSV
  • 3. Dataset Preparation

  • Segment and structure text for ML training
  • Create Q&A datasets, summaries, instruction-response pairs, and labeled text
  • Build high-quality datasets compatible with fine-tuning frameworks
  • 4. Retrieval & Indexing Pipelines

  • Implement document chunking strategies
  • Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )
  • Build retrieval workflows using LangChain or LlamaIndex
  • Optimize retrieval accuracy and latency
  • 5. Model Training & Fine-Tuning

  • Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods
  • Monitor training performance and refine datasets
  • Package and deploy fine-tuned models
  • 6. Data Visualization & Analytics

  • Create analytical charts, trends, and insights using :
  • Pandas
  • Matplotlib
  • Seaborn
  • Plotly
  • Build simple internal dashboards or visual summaries for reports
  • Transform raw datasets into meaningful visual insights
  • 7. Automation & Infrastructure

  • Write modular, maintainable Python scripts
  • Containerize workflows with Docker
  • Maintain version control with Git
  • Ensure reproducibility and pipeline stability
  • Required Technical Skills

  • Strong proficiency in Python
  • Experience with Firecrawl , Playwright, Scrapy, or similar tools
  • Strong background in document parsing , text processing, and OCR
  • Familiarity with LangChain or LlamaIndex
  • Experience with vector databases
  • Hands-on experience with HuggingFace , Transformer models, and fine-tuning
  • Ability to write clean, efficient data pipelines
  • Experience with Matplotlib , Seaborn , Plotly , or other visualization tools
  • Comfort using Docker and Git
  • Nice to Have

  • Experience serving models or building small APIs (FastAPI)
  • Exposure to GPU training environments
  • Background in large-scale unstructured data work
  • Ability to create lightweight dashboards (Plotly Dash, Streamlit)
  • Ideal Candidate

  • Comfortable owning full pipelines independently
  • Detail-oriented and analytical
  • Strong problem-solving ability
  • Can work with minimal supervision
  • Enjoys building structured systems from scratch
  • Create a job alert for this search

    Ai Data Engineer • narela, delhi, in

    Related jobs
    Lead AI Engineer

    Lead AI Engineer

    Blend • Ghaziabad, IN
    We are looking for an AI Engineer with hands-on experience designing and deploying scalable AI solutions.In this role, you will be part of a cross-functional team working on cutting-edge projects i...Show more
    Last updated: 10 days ago • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    BeGig • Delhi, IN
    Agentic AI Engineer – Real Estate & Construction.We are seeking a talented engineer to.You will own the full lifecycle of agent frameworks, including toolchain architecture, prompt engineering, dat...Show more
    Last updated: 12 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Sesheng Company • Delhi, IN
    GenAI Engineer (Semantic Search & RAG Systems).Remote in India (to work in US Time zone).You will be instrumental in designing and deploying a cutting-edge semantic search capability to power our n...Show more
    Last updated: 14 days ago • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    Intellectt Inc • Delhi, IN
    Agentic AI Engineer (100% Remote).Intellectt is seeking a highly experienced.The ideal candidate will have hands-on expertise in. LLMs, LangChain, LangGraph, RAG.AI applications for real-world use.O...Show more
    Last updated: 8 days ago • Promoted
    AI and Databricks Engineer

    AI and Databricks Engineer

    Quantum Integrators • Delhi, IN
    Please find the detailed Job Description below : .Strong experience developing industry grade LLM and Gen-AI applications using OpenAI, Anthropic (Claude), or other major providers.Hands-on experienc...Show more
    Last updated: 17 hours ago • Promoted • New!
    AI Engineer

    AI Engineer

    MightyBot • Ghaziabad, IN
    Join our team as an AI Engineer, where we're focused on graduating AI from interesting demos to indispensable products.You will build reliable, self-improving systems that empower subject matter ex...Show more
    Last updated: 30+ days ago • Promoted
    AI Data Engineer

    AI Data Engineer

    Peak Trust Global Real Estate • Delhi, IN
    This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
    Last updated: 1 day ago • Promoted
    AI Engineer

    AI Engineer

    Aura Recruitment Solutions • Ghaziabad, IN
    Pay starts from 150,000 INR per Month.We’re hiring on behalf of our client, a fast-growing, AI-first company building cutting-edge AI-native applications that transform complex, real-world data int...Show more
    Last updated: 8 days ago • Promoted
    AI Engineer

    AI Engineer

    Idea Elan India • Delhi, IN
    AI Engineer (2 - 4 Years Experience).Idea Elan LLC is a product based company that provides comprehensive software solutions for. Universities and Institutions worldwide.We are seeking a skilled AI ...Show more
    Last updated: 17 hours ago • Promoted • New!
    AI Engineer

    AI Engineer

    TechWaves Recruitment • Delhi, IN
    Location : Role is remote, however you must be based within a City location in India.Our client who provides world’s leading Agent and Agentic solution platform designed for teams looking to streaml...Show more
    Last updated: 14 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Digivance Solutions • Ghaziabad, IN
    Minimum 8 years (Total & Relevant).We are seeking an experienced leader to.GenAI Expertise & Development : .Design, develop, and deploy GenAI solutions for business use cases (content generation, sum...Show more
    Last updated: 17 hours ago • Promoted • New!
    Data Engineer

    Data Engineer

    Recro • Ghaziabad, IN
    Data Pipeline Engineering : Design, build, and maintain ingestion, transformation, and storage pipelines using Azure Data Factory, Synapse Analytics, and Data Lake. AI Data Enablement : Collaborate wi...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    Digitalzone • Delhi, IN
    As a Data Engineer, you will design, build, and optimize data pipelines and real-time systems that power AI-driven decisioning and analytics. Develop and maintain scalable ETL / ELT pipelines using Py...Show more
    Last updated: 14 days ago • Promoted
    AI Platform Engineer

    AI Platform Engineer

    BayOne Solutions • Delhi, IN
    We are seeking a highly skilled.In this role, you will work on advanced AI systems including.Retrieval-Augmented Generation (RAG). Model Context Protocol (MCP) tools.OpenWebUI or custom-built soluti...Show more
    Last updated: 5 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Live Connections • Delhi, IN
    We’re Hiring | Generative AI Lead / Principal Engineer.Are you passionate about building cutting-edge.Generative AI and LLM solutions. We’re looking for an experienced.Generative AI Lead / Principal...Show more
    Last updated: 6 days ago • Promoted
    AI Engineer

    AI Engineer

    Sutra.AI • Ghaziabad, IN
    Our mission is to help enterprises transform raw data into intelligent, actionable insights through AI, automation, and decision intelligence. The ideal candidate is hands-on, detail-oriented, and t...Show more
    Last updated: 6 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Philodesign Technologies Inc • Delhi, IN
    Retrieval-Augmented Generation) or.Generative AI solution performance.Show more
    Last updated: 8 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Capgemini • Delhi, IN
    HANDS ON EXPERIENCE ON GENAI, MACHINE LEARNING, NLP, LLM, PYTHO.Show more
    Last updated: 17 hours ago • Promoted • New!