Talent.com
AI Data Engineer
AI Data EngineerPeak Trust Global Real Estate • baddi, himachal pradesh, in
AI Data Engineer

AI Data Engineer

Peak Trust Global Real Estate • baddi, himachal pradesh, in
1 day ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 60K / Month

Role Summary

We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation

  • Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks
  • Extract multi-format documents (PDFs, HTML, text, images)
  • Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation

  • Clean and process unstructured documents
  • Apply OCR (Tesseract, PaddleOCR) for scanned files
  • Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.
  • Prepare data in formats such as JSON, JSONL, or CSV
  • 3. Dataset Preparation

  • Segment and structure text for ML training
  • Create Q&A datasets, summaries, instruction-response pairs, and labeled text
  • Build high-quality datasets compatible with fine-tuning frameworks
  • 4. Retrieval & Indexing Pipelines

  • Implement document chunking strategies
  • Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )
  • Build retrieval workflows using LangChain or LlamaIndex
  • Optimize retrieval accuracy and latency
  • 5. Model Training & Fine-Tuning

  • Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods
  • Monitor training performance and refine datasets
  • Package and deploy fine-tuned models
  • 6. Data Visualization & Analytics

  • Create analytical charts, trends, and insights using :
  • Pandas
  • Matplotlib
  • Seaborn
  • Plotly
  • Build simple internal dashboards or visual summaries for reports
  • Transform raw datasets into meaningful visual insights
  • 7. Automation & Infrastructure

  • Write modular, maintainable Python scripts
  • Containerize workflows with Docker
  • Maintain version control with Git
  • Ensure reproducibility and pipeline stability
  • Required Technical Skills

  • Strong proficiency in Python
  • Experience with Firecrawl , Playwright, Scrapy, or similar tools
  • Strong background in document parsing , text processing, and OCR
  • Familiarity with LangChain or LlamaIndex
  • Experience with vector databases
  • Hands-on experience with HuggingFace , Transformer models, and fine-tuning
  • Ability to write clean, efficient data pipelines
  • Experience with Matplotlib , Seaborn , Plotly , or other visualization tools
  • Comfort using Docker and Git
  • Nice to Have

  • Experience serving models or building small APIs (FastAPI)
  • Exposure to GPU training environments
  • Background in large-scale unstructured data work
  • Ability to create lightweight dashboards (Plotly Dash, Streamlit)
  • Ideal Candidate

  • Comfortable owning full pipelines independently
  • Detail-oriented and analytical
  • Strong problem-solving ability
  • Can work with minimal supervision
  • Enjoys building structured systems from scratch
  • Create a job alert for this search

    Ai Data Engineer • baddi, himachal pradesh, in

    Related jobs
    Engineer-AI

    Engineer-AI

    Sakon • baddi, himachal pradesh, in
    Role : AI Engineer – Agentic Systems & LLM Applications.We’re looking for a well-rounded, forward-thinking AI Engineer who can design, build, and deploy intelligent systems powered by LLMs, retrieva...Show more
    Last updated: 14 days ago • Promoted
    AI Data Engineer - 17852

    AI Data Engineer - 17852

    Turing • Chandigarh, India, India
    We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show more
    Last updated: 4 days ago • Promoted
    AI Engineer

    AI Engineer

    Aura Recruitment Solutions • panchkula, haryana, in
    Pay starts from 150,000 INR per Month.We’re hiring on behalf of our client, a fast-growing, AI-first company building cutting-edge AI-native applications that transform complex, real-world data int...Show more
    Last updated: 8 days ago • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    Intellectt Inc • baddi, himachal pradesh, in
    Agentic AI Engineer (100% Remote).Intellectt is seeking a highly experienced.The ideal candidate will have hands-on expertise in. LLMs, LangChain, LangGraph, RAG.AI applications for real-world use.O...Show more
    Last updated: 8 days ago • Promoted
    Lead AI Engineer

    Lead AI Engineer

    Genisys Group • baddi, himachal pradesh, in
    As an AI / ML Engineer at Genisys Group, you will be instrumental in developing and.AI solutions, with a strong focus on Large Language Models. LLMs) and Retrieval-Augmented Generation (RAG) technique...Show more
    Last updated: 14 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Turing • panchkula, haryana, in
    Turing is looking for people with LLM experience to join us in solving business problems for our Fortune 500 customers.You will be a key member of the Turing GenAI delivery organization and part of...Show more
    Last updated: 10 days ago • Promoted
    Gen AI Engineer / Senior Gen AI Engineer

    Gen AI Engineer / Senior Gen AI Engineer

    Piramal Finance • panchkula, haryana, in
    Collaborate with cross-functional teams to integrate generative models into data products.Develop, implement, and optimize generative models for enhanced data-driven solutions.Preprocess, clean, an...Show more
    Last updated: 14 days ago • Promoted
    Lead AI Engineer

    Lead AI Engineer

    Blend • panchkula, haryana, in
    We are looking for an AI Engineer with hands-on experience designing and deploying scalable AI solutions.In this role, you will be part of a cross-functional team working on cutting-edge projects i...Show more
    Last updated: 9 days ago • Promoted
    AI and Databricks

    AI and Databricks

    Quantum Integrators • panchkula, haryana, in
    Role : AI and Databricks Engineer.Strong experience developing industry grade LLM and Gen-AI applications using OpenAI, Anthropic (Claude), or other major providers. Spark, Delta Lake, MLflow, Databr...Show more
    Last updated: 4 hours ago • Promoted • New!
    AI Engineer

    AI Engineer

    Idea Elan India • panchkula, haryana, in
    AI Engineer (2 - 4 Years Experience).Idea Elan LLC is a product based company that provides comprehensive software solutions for. Universities and Institutions worldwide.We are seeking a skilled AI ...Show more
    Last updated: 4 hours ago • Promoted • New!
    Generative AI Engineer

    Generative AI Engineer

    Digivance Solutions • panchkula, haryana, in
    Minimum 8 years (Total & Relevant).We are seeking an experienced leader to.GenAI Expertise & Development : .Design, develop, and deploy GenAI solutions for business use cases (content generation, sum...Show more
    Last updated: 5 hours ago • Promoted • New!
    AI / ML Engineer (Data Science & Data Engineering – LLMs / GenAI)

    AI / ML Engineer (Data Science & Data Engineering – LLMs / GenAI)

    ReSun Technologies Inc • baddi, himachal pradesh, in
    Data Science, Data Engineering, and Generative AI / LLM development.This role will work on end-to-end AI solutions — from building datasets and pipelines, to developing LLM applications, to deploying...Show more
    Last updated: 4 hours ago • Promoted • New!
    Full-stack AI Engineer - Founding Engineer

    Full-stack AI Engineer - Founding Engineer

    Taglynk • panchkula, haryana, in
    Build full-stack features end-to-end for our AI hiring platform.Work with LLMs, agentic systems, and voice / speech models to create magical user experiences. Architect scalable systems on AWS and own...Show more
    Last updated: 5 hours ago • Promoted • New!
    Azure Generative AI Engineer

    Azure Generative AI Engineer

    LTIMindtree • panchkula, haryana, in
    Required Skills & Qualifications : .Strong programming experience in Python.RAG (Retrieval-Augmented Generation).Familiarity with ML / DL frameworks such as. Solid knowledge of machine learning algorith...Show more
    Last updated: 5 hours ago • Promoted • New!
    AI Data Engineer

    AI Data Engineer

    Peak Trust Global Real Estate • panchkula, haryana, in
    This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
    Last updated: 1 day ago • Promoted
    AI and Databricks Engineer

    AI and Databricks Engineer

    Quantum Integrators • baddi, himachal pradesh, in
    Please find the detailed Job Description below : .Strong experience developing industry grade LLM and Gen-AI applications using OpenAI, Anthropic (Claude), or other major providers.Hands-on experienc...Show more
    Last updated: 1 hour ago • Promoted • New!
    AI / ML Engineer

    AI / ML Engineer

    Reqpedia • panchkula, haryana, in
    Proficiency in enterprise-approved AI tools as part of their day-to-day responsibilities.This includes, but is not limited to : . Consistent Use : Maintain a minimum of 90% weekly usage of AI tools suc...Show more
    Last updated: 4 hours ago • Promoted • New!
    AWS Data Engineer

    AWS Data Engineer

    Atyeti Inc • baddi, himachal pradesh, in
    Looking for Data Engineer who will be responsible for design, build and maintenance of data pipelines running on Airflow, Spark on the AWS Cloud platform. Build and maintain all facets of Data Pipel...Show more
    Last updated: 4 hours ago • Promoted • New!