Talent.com
Ai Data Engineer
Ai Data EngineerPeak Trust Global Real Estate • Guwahati, Republic Of India, IN
No longer accepting applications
Ai Data Engineer

Ai Data Engineer

Peak Trust Global Real Estate • Guwahati, Republic Of India, IN
8 days ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 60K / Month

Role Summary

We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation

  • Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks
  • Extract multi-format documents (PDFs, HTML, text, images)
  • Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation

  • Clean and process unstructured documents
  • Apply OCR (Tesseract, PaddleOCR) for scanned files
  • Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.
  • Prepare data in formats such as JSON, JSONL, or CSV
  • 3. Dataset Preparation

  • Segment and structure text for ML training
  • Create Q&A datasets, summaries, instruction-response pairs, and labeled text
  • Build high-quality datasets compatible with fine-tuning frameworks
  • 4. Retrieval & Indexing Pipelines

  • Implement document chunking strategies
  • Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )
  • Build retrieval workflows using LangChain or LlamaIndex
  • Optimize retrieval accuracy and latency
  • 5. Model Training & Fine-Tuning

  • Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods
  • Monitor training performance and refine datasets
  • Package and deploy fine-tuned models
  • 6. Data Visualization & Analytics

  • Create analytical charts, trends, and insights using :
  • Pandas
  • Matplotlib
  • Seaborn
  • Plotly
  • Build simple internal dashboards or visual summaries for reports
  • Transform raw datasets into meaningful visual insights
  • 7. Automation & Infrastructure

  • Write modular, maintainable Python scripts
  • Containerize workflows with Docker
  • Maintain version control with Git
  • Ensure reproducibility and pipeline stability
  • Required Technical Skills

  • Strong proficiency in Python
  • Experience with Firecrawl , Playwright, Scrapy, or similar tools
  • Strong background in document parsing , text processing, and OCR
  • Familiarity with LangChain or LlamaIndex
  • Experience with vector databases
  • Hands-on experience with HuggingFace , Transformer models, and fine-tuning
  • Ability to write clean, efficient data pipelines
  • Experience with Matplotlib , Seaborn , Plotly , or other visualization tools
  • Comfort using Docker and Git
  • Nice to Have

  • Experience serving models or building small APIs (FastAPI)
  • Exposure to GPU training environments
  • Background in large-scale unstructured data work
  • Ability to create lightweight dashboards (Plotly Dash, Streamlit)
  • Ideal Candidate

  • Comfortable owning full pipelines independently
  • Detail-oriented and analytical
  • Strong problem-solving ability
  • Can work with minimal supervision
  • Enjoys building structured systems from scratch
  • Create a job alert for this search

    Ai Data Engineer • Guwahati, Republic Of India, IN

    Related jobs
    Applied AI Engineer

    Applied AI Engineer

    Strategic Talent Partner • guwahati, assam, in
    Design and deploy advanced multi-agent pipelines for credit analysis.Optimize inference and prompt chains using frameworks like DSPy, GEPA, and LangChain. Implement reasoning techniques (CoT, ToT, G...Show more
    Last updated: 21 days ago • Promoted
    Edge AI Engineer - Healthcare Applications

    Edge AI Engineer - Healthcare Applications

    Cancard Inc. • guwahati, assam, in
    Position : Edge AI Engineer - Cancard Inc.Cancard Inc and Advaa Health are seeking an experienced, engaged, and hands-on healthcare industry Edge Computing / IOT / CV expert for the role of Edge AI ...Show more
    Last updated: 3 days ago • Promoted
    AI Platform Engineer

    AI Platform Engineer

    Antriksh Cloud Private Limited • guwahati, assam, in
    Antriksh Cloud Private Limited is a leader in sustainable AI infrastructure, specializing in eco-efficient GPU data centers powered by hydroelectric energy. We enable global AI innovation with scala...Show more
    Last updated: 3 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Turing • guwahati, assam, in
    Turing is looking for people with LLM experience to join us in solving business problems for our Fortune 500 customers.You will be a key member of the Turing GenAI delivery organization and part of...Show more
    Last updated: 18 days ago • Promoted
    AIML Engineer

    AIML Engineer

    Tata Consultancy Services • guwahati, assam, in
    Competencies (Technical / Behavioral Competency).AI / ML, Azure ML Studio, AI / ML On Databricks, Python & CICD Devops.Supervised and unsupervised ML and Predictive Analytics using Python • Feature gener...Show more
    Last updated: 21 days ago • Promoted
    AI Lead Engineer

    AI Lead Engineer

    TekGenio • guwahati, assam, in
    Experience : 5+ Years | Type : Full-Time | Location : WFH.Minimum of 5+ years of experience in AI / ML engineering, data science, or algorithm development. Strong experience in machine learning, deep lea...Show more
    Last updated: 10 hours ago • Promoted • New!
    Generative AI Engineer

    Generative AI Engineer

    Philodesign Technologies Inc • guwahati, assam, in
    Gen AI Engineer | Remote | 4+ Years Experience | Budget : 1 LPM.We are looking for an experienced.GenAI solutions for global clients. If you have a solid background in AI / ML engineering and have deli...Show more
    Last updated: 15 days ago • Promoted
    Machine Learning Engineer - Agentic AI & AIOps

    Machine Learning Engineer - Agentic AI & AIOps

    Platform9 • guwahati, assam, in
    Platform9 is a leader in simplifying enterprise private clouds.Our flagship product, Private Cloud Director, turns existing infrastructure into a full-featured private cloud.Enterprise IT teams can...Show more
    Last updated: 19 days ago • Promoted
    Data Engineer

    Data Engineer

    System Soft Technologies • guwahati, assam, in
    Location : Remote (3–4-hour time zone overlaps with EST if off shore).Experience with next flow is required, as the consultant will make targeted enhancements to existing workflows and pipelines.Whi...Show more
    Last updated: 1 day ago • Promoted
    Senior AI Engineer

    Senior AI Engineer

    BeGig • guwahati, assam, in
    We are looking for an experienced.The ideal candidate should be proficient in FastAPI, Python, LangChain, and modern AI application design. You will be responsible for designing, developing, and dep...Show more
    Last updated: 1 day ago • Promoted
    AI Engineer

    AI Engineer

    TechKareer • guwahati, assam, in
    Mumbai / Bengaluru / Gurgaon (Hybrid : 3 days / week in office).Remote option for exceptional candidates.We’re building production-grade AI workflows and agentic applications that power real user expe...Show more
    Last updated: 30+ days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    telerapp • guwahati, assam, in
    As a Gen AI Engineer at Telerapp, you will play a crucial role in designing, developing, and implementing cutting-edge generative artificial intelligence models and systems.Your expertise will cont...Show more
    Last updated: 1 day ago • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Edstem Technologies • guwahati, assam, in
    The ideal candidate will have hands-on expertise across the full ML lifecycle—from data exploration and feature engineering to model training, optimization, and production deployment.You will work ...Show more
    Last updated: 21 days ago • Promoted
    AI Engineer

    AI Engineer

    Asite • guwahati, assam, in
    We start with a simple idea : the built environment should be smarter, safer and more sustainable.Everything we do is about helping the people behind major construction and infrastructure projects w...Show more
    Last updated: 4 days ago • Promoted
    AI Software Engineer

    AI Software Engineer

    Taskify AI • guwahati, assam, in
    This role is ideal for professionals passionate about artificial intelligence, machine learning, and software engineering who want to make a tangible impact on real-world applications.As an AI Soft...Show more
    Last updated: 6 days ago • Promoted
    AI Engineer

    AI Engineer

    MightyBot • guwahati, assam, in
    Join our team as an AI Engineer, where we're focused on graduating AI from interesting demos to indispensable products.You will build reliable, self-improving systems that empower subject matter ex...Show more
    Last updated: 30+ days ago • Promoted
    AI / ML Engineer

    AI / ML Engineer

    TransPerfect • guwahati, assam, in
    We are seeking a Senior AI / ML Engineer to join our client’s AI team and contribute to the development of cutting-edge intelligent systems. In this role, you’ll be responsible for designing, training...Show more
    Last updated: 30+ days ago • Promoted
    PySpark Data Engineer

    PySpark Data Engineer

    EXTRAGIG • guwahati, assam, in
    Contract Assistant – Data Engineer Support (Remote, EST Hours).PySpark Data Engineer with daily activities.This is a remote contract role. Execute creative software and data solutions, including des...Show more
    Last updated: 30+ days ago • Promoted