Talent.com
No longer accepting applications
Ai Data Engineer

Ai Data Engineer

Peak Trust Global Real EstateLudhiāna, Republic Of India, IN
4 days ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 60K / Month

Role Summary

We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation

  • Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks
  • Extract multi-format documents (PDFs, HTML, text, images)
  • Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation

  • Clean and process unstructured documents
  • Apply OCR (Tesseract, PaddleOCR) for scanned files
  • Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.
  • Prepare data in formats such as JSON, JSONL, or CSV
  • 3. Dataset Preparation

  • Segment and structure text for ML training
  • Create Q&A datasets, summaries, instruction-response pairs, and labeled text
  • Build high-quality datasets compatible with fine-tuning frameworks
  • 4. Retrieval & Indexing Pipelines

  • Implement document chunking strategies
  • Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )
  • Build retrieval workflows using LangChain or LlamaIndex
  • Optimize retrieval accuracy and latency
  • 5. Model Training & Fine-Tuning

  • Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods
  • Monitor training performance and refine datasets
  • Package and deploy fine-tuned models
  • 6. Data Visualization & Analytics

  • Create analytical charts, trends, and insights using :
  • Pandas
  • Matplotlib
  • Seaborn
  • Plotly
  • Build simple internal dashboards or visual summaries for reports
  • Transform raw datasets into meaningful visual insights
  • 7. Automation & Infrastructure

  • Write modular, maintainable Python scripts
  • Containerize workflows with Docker
  • Maintain version control with Git
  • Ensure reproducibility and pipeline stability
  • Required Technical Skills

  • Strong proficiency in Python
  • Experience with Firecrawl , Playwright, Scrapy, or similar tools
  • Strong background in document parsing , text processing, and OCR
  • Familiarity with LangChain or LlamaIndex
  • Experience with vector databases
  • Hands-on experience with HuggingFace , Transformer models, and fine-tuning
  • Ability to write clean, efficient data pipelines
  • Experience with Matplotlib , Seaborn , Plotly , or other visualization tools
  • Comfort using Docker and Git
  • Nice to Have

  • Experience serving models or building small APIs (FastAPI)
  • Exposure to GPU training environments
  • Background in large-scale unstructured data work
  • Ability to create lightweight dashboards (Plotly Dash, Streamlit)
  • Ideal Candidate

  • Comfortable owning full pipelines independently
  • Detail-oriented and analytical
  • Strong problem-solving ability
  • Can work with minimal supervision
  • Enjoys building structured systems from scratch
  • Create a job alert for this search

    Ai Data Engineer • Ludhiāna, Republic Of India, IN

    Related jobs
    • Promoted
    • New!
    AWS Data Engineer

    AWS Data Engineer

    COVET IT INCludhiana, punjab, in
    Please go through the below requirements and let me know your interest and forward your resume along with your contact information to raja@covetitinc. We are seeking a highly skilled AWS Data Engine...Show moreLast updated: 7 hours ago
    • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Sesheng Companyludhiana, punjab, in
    GenAI Engineer (Semantic Search & RAG Systems).Remote in India (to work in US Time zone).You will be instrumental in designing and deploying a cutting-edge semantic search capability to power our n...Show moreLast updated: 17 days ago
    • Promoted
    Senior AI Engineer

    Senior AI Engineer

    MindBrainludhiana, punjab, in
    We are looking for an experienced AI Engineer with strong hands-on expertise in building and deploying autonomous agent systems. The ideal candidate will contribute to designing, coding, and optimiz...Show moreLast updated: 30+ days ago
    • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Qubrid AIludhiana, punjab, in
    Work from Home! Compensation 8-9 LPA.Plus Please do not apply if compensation is not acceptable.Qubrid AI is a leading provider of high-performance GPU cloud infrastructure and advanced AI software...Show moreLast updated: 1 day ago
    • Promoted
    Lead AI / ML Engineer

    Lead AI / ML Engineer

    Optumludhiana, punjab, in
    Lead AI / ML Engineer – Clinical AI systems.Optum is a global organization that delivers care, aided by technology, to help millions of people live healthier lives. The work you do with our team will ...Show moreLast updated: 14 days ago
    • Promoted
    • New!
    AI Engineer

    AI Engineer

    Asiteludhiana, punjab, in
    We start with a simple idea : the built environment should be smarter, safer and more sustainable.Everything we do is about helping the people behind major construction and infrastructure projects w...Show moreLast updated: 7 hours ago
    • Promoted
    Senior GenAI Engineer

    Senior GenAI Engineer

    Mitra AIludhiana, punjab, in
    AI System Design & Development : .Architect, develop, and deploy large-scale Generative AI, LLM-based systems, including intelligent agents and automation workflows. LLM Integration & Optimization : .In...Show moreLast updated: 1 day ago
    • Promoted
    Senior Full-Stack AI Engineer

    Senior Full-Stack AI Engineer

    BayInfotechludhiana, punjab, in
    In order to proceed further, Please take the test.Senior Full-Stack AI Engineer – AI-Enabled Help Desk (GCP).This test is mandatory as part of this role. Please share a working public URL.Own archit...Show moreLast updated: 1 day ago
    • Promoted
    AI Engineer

    AI Engineer

    MightyBotludhiana, punjab, in
    Join our team as an AI Engineer, where we're focused on graduating AI from interesting demos to indispensable products.You will build reliable, self-improving systems that empower subject matter ex...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    AI Engineer

    AI Engineer

    Tensor Pilotludhiana, punjab, in
    Tensor Pilot, through its flagship product Tensor AI, provides a sophisticated desktop-based AI assistant for interacting with local files such as code, documents, images, and videos.Tensor AI emph...Show moreLast updated: 3 hours ago
    • Promoted
    AI Software Engineer

    AI Software Engineer

    Taskify AIludhiana, punjab, in
    This role is ideal for professionals passionate about artificial intelligence, machine learning, and software engineering who want to make a tangible impact on real-world applications.As an AI Soft...Show moreLast updated: 2 days ago
    • Promoted
    AI Data Engineer - 17852

    AI Data Engineer - 17852

    Turingludhiana, punjab, in
    We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show moreLast updated: 16 days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    TransPerfectludhiana, punjab, in
    We are seeking a Senior AI / ML Engineer to join our client’s AI team and contribute to the development of cutting-edge intelligent systems. In this role, you’ll be responsible for designing, training...Show moreLast updated: 30+ days ago
    • Promoted
    Full Stack Engineer AI (4-6 YOE)

    Full Stack Engineer AI (4-6 YOE)

    Redica Systemsludhiana, punjab, in
    Redica Systems is a SaaS start-up serving more than 200 customers within the life science sector, with a specific focus on Pharmaceuticals and MedTech. Our workforce is distributed globally, with he...Show moreLast updated: 3 days ago
    • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Turingludhiana, punjab, in
    Turing is looking for people with LLM experience to join us in solving business problems for our Fortune 500 customers.You will be a key member of the Turing GenAI delivery organization and part of...Show moreLast updated: 14 days ago
    • Promoted
    ML / Gen AI Engineer

    ML / Gen AI Engineer

    Intuition IT – Intuitive Technology Recruitmentludhiana, punjab, in
    Design, deploy, and manage scalable ML and GenAI workloads using AWS services including SageMaker Studio and Bedrock.Implement and maintain infrastructure using AWS Lambda, EKS, ECS on Fargate, and...Show moreLast updated: 1 day ago
    • Promoted
    Senior AI Engineer

    Senior AI Engineer

    eTeamludhiana, punjab, in
    We’re building AI products that are transforming the.Our mission is to revolutionize how organizations discover, develop, and engage talent through intelligent, human-centered AI solutions.We combi...Show moreLast updated: 3 days ago
    • Promoted
    AI Engineer

    AI Engineer

    TechKareerludhiana, punjab, in
    Mumbai / Bengaluru / Gurgaon (Hybrid : 3 days / week in office).Remote option for exceptional candidates.We’re building production-grade AI workflows and agentic applications that power real user expe...Show moreLast updated: 30+ days ago