Talent.com
AI Engineer - GPT / LangChain / RAG / Data Pipelines
AI Engineer - GPT / LangChain / RAG / Data PipelinesPeak Trust Global Real Estate • amritsar, punjab, in
No longer accepting applications
AI Engineer - GPT / LangChain / RAG / Data Pipelines

AI Engineer - GPT / LangChain / RAG / Data Pipelines

Peak Trust Global Real Estate • amritsar, punjab, in
6 days ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 70K / Month based on experience

Role Summary

We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation

  • Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks
  • Extract multi-format documents (PDFs, HTML, text, images)
  • Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation

  • Clean and process unstructured documents
  • Apply OCR (Tesseract, PaddleOCR) for scanned files
  • Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.
  • Prepare data in formats such as JSON, JSONL, or CSV
  • 3. Dataset Preparation

  • Segment and structure text for ML training
  • Create Q&A datasets, summaries, instruction-response pairs, and labeled text
  • Build high-quality datasets compatible with fine-tuning frameworks
  • 4. Retrieval & Indexing Pipelines

  • Implement document chunking strategies
  • Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )
  • Build retrieval workflows using LangChain or LlamaIndex
  • Optimize retrieval accuracy and latency
  • 5. Model Training & Fine-Tuning

  • Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods
  • Monitor training performance and refine datasets
  • Package and deploy fine-tuned models
  • 6. Data Visualization & Analytics

  • Create analytical charts, trends, and insights using :
  • Pandas
  • Matplotlib
  • Seaborn
  • Plotly
  • Build simple internal dashboards or visual summaries for reports
  • Transform raw datasets into meaningful visual insights
  • 7. Automation & Infrastructure

  • Write modular, maintainable Python scripts
  • Containerize workflows with Docker
  • Maintain version control with Git
  • Ensure reproducibility and pipeline stability
  • Required Technical Skills

  • Strong proficiency in Python
  • Experience with Firecrawl , Playwright, Scrapy, or similar tools
  • Strong background in document parsing , text processing, and OCR
  • Familiarity with LangChain or LlamaIndex
  • Experience with vector databases
  • Hands-on experience with HuggingFace , Transformer models, and fine-tuning
  • Ability to write clean, efficient data pipelines
  • Experience with Matplotlib , Seaborn , Plotly , or other visualization tools
  • Comfort using Docker and Git
  • Nice to Have

  • Experience serving models or building small APIs (FastAPI)
  • Exposure to GPU training environments
  • Background in large-scale unstructured data work
  • Ability to create lightweight dashboards (Plotly Dash, Streamlit)
  • Ideal Candidate

  • Comfortable owning full pipelines independently
  • Detail-oriented and analytical
  • Strong problem-solving ability
  • Can work with minimal supervision
  • Enjoys building structured systems from scratch
  • Create a job alert for this search

    Ai Data Engineer • amritsar, punjab, in

    Related jobs
    Lead LLM & Machine Learning Engineers (Python)

    Lead LLM & Machine Learning Engineers (Python)

    MillionLogics • amritsar, punjab, in
    MillionLogics, a trusted Oracle Partner, is a global IT solutions leader with headquarters in London, UK, and a dynamic development hub in Hyderabad, India. The company specialises in delivering sma...Show more
    Last updated: 3 days ago • Promoted
    AI / ML Engineer – LLM & Agentic AI Systems (3 to 9 yrs)

    AI / ML Engineer – LLM & Agentic AI Systems (3 to 9 yrs)

    AIMLEAP • amritsar, punjab, in
    AI / ML Engineer – LLM & Agentic AI Systems.Tech in Computer Science, AI / ML, or related field.LLM and agentic AI development. AI pipelines, APIs, and integrations.LangChain, LlamaIndex, AutoGen.AI sys...Show more
    Last updated: 23 hours ago • Promoted
    GCP Data Engineer

    GCP Data Engineer

    Adastra • amritsar, punjab, in
    We are looking for a proactive and solution-oriented GCP Data Engineer to join our team.This role requires hands-on experience in Google Cloud Platform (GCP), especially with BigQuery and Airflow, ...Show more
    Last updated: 17 days ago • Promoted
    Backend Python Engineer

    Backend Python Engineer

    Turing • amritsar, punjab, in
    We’re looking for experienced Python engineers to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models think, reason, and ...Show more
    Last updated: 19 hours ago • Promoted • New!
    Machine Learning Engineer

    Machine Learning Engineer

    Delta System & Software, Inc. • amritsar, punjab, in
    Python in production environments.Expert-level proficiency in Python and core libraries (Pandas, NumPy, AsyncIO, FastAPI, or similar).Show more
    Last updated: 1 day ago • Promoted
    Artificial Intelligence Engineer

    Artificial Intelligence Engineer

    Donyati • amritsar, punjab, in
    We are seeking a skilled AI Engineer with 3–5 years of hands-on experience in designing, developing, and deploying AI / ML solutions in cloud environments. The ideal candidate will have strong profici...Show more
    Last updated: 30+ days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Philodesign Technologies Inc • amritsar, punjab, in
    Gen AI Engineer – Remote | 6+ Years Experience.We are seeking a highly skilled.AI / ML and Generative AI solutions.The ideal candidate will have practical expertise in. RAG pipelines, agent workflows,...Show more
    Last updated: 24 days ago • Promoted
    AI Software Engineer

    AI Software Engineer

    Next Jobs • amritsar, India
    This role is ideal for professionals passionate about artificial intelligence, machine learning, and software engineering who want to make a tangible impact on real-world applications.As an AI Soft...Show more
    Last updated: 10 hours ago • Promoted • New!
    Senior Data Engineer

    Senior Data Engineer

    Ironbook AI • amritsar, punjab, in
    The ideal candidate will have strong experience with cloud platforms, modern ETL / ELT tools, and deep technical skills in Python, SQL, and distributed data frameworks. Design, develop, and maintain s...Show more
    Last updated: 15 hours ago • Promoted • New!
    Generative AI Engineer

    Generative AI Engineer

    Avensys Consulting UK • amritsar, punjab, in
    Rate : 450-500 GBP Per Day – Inside IR35 MAX.The Gen AI Engineer will be a specialized type of artificial intelligence professional, focused on designing, developing & implementing generative AI mod...Show more
    Last updated: 1 day ago • Promoted
    AI Data Engineer

    AI Data Engineer

    Turing • amritsar, punjab, in
    We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show more
    Last updated: 1 day ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Turing • amritsar, punjab, in
    Turing is looking for people with LLM experience to join us in solving business problems for our Fortune 500 customers.You will be a key member of the Turing GenAI delivery organization and part of...Show more
    Last updated: 27 days ago • Promoted
    Lead Data Engineer

    Lead Data Engineer

    Ironbook AI • amritsar, punjab, in
    We are seeking an experienced and driven Lead Data Engineer to spearhead the.AI use cases across the organization.Minimum 7 years of experience in data engineering, with at.Strong hands-on experien...Show more
    Last updated: 15 hours ago • Promoted • New!
    Lead Data Engineer

    Lead Data Engineer

    Confidential • amritsar, punjab, in
    Expertise in big data technologies such as Apache Spark and real-time streaming technologies like Apache Kafka.Strong programming skills in Python, Java, C++, SQL etc. Advanced knowledge of a major ...Show more
    Last updated: 15 hours ago • Promoted • New!
    Tech Lead –.Net / Python & AI

    Tech Lead –.Net / Python & AI

    Skillvera • amritsar, India
    Technical Skills & Stack Requirements : .API development, and service orchestration.AWS or Azure cloud architecture.Bedrock, Lambda, ECS / EKS, Step Functions, S3, and SageMaker and Azure equivalents.U...Show more
    Last updated: 10 hours ago • Promoted • New!
    AI Applications Engineer

    AI Applications Engineer

    Linksoft Technologies • amritsar, punjab, in
    Position : AI Applications Engineer.We’re building agentic AI apps for real business use—voice / chat agents that orchestrate workflows across CRMs / ERPs and internal tools. You’ll help us shipfeatures ...Show more
    Last updated: 1 day ago • Promoted
    AI Analyst

    AI Analyst

    Aventis Solutions • amritsar, punjab, in
    Aventis Solutions is igniting the AI revolution : .They have just launched The AI Executive podcast, which can be found on the website : . Now, our tech partner is establishing a new AI Innovation Hub i...Show more
    Last updated: 30+ days ago • Promoted
    Senior GenAI Engineer

    Senior GenAI Engineer

    Mitra AI • amritsar, punjab, in
    AI System Design & Development : .Architect, develop, and deploy large-scale Generative AI, LLM-based systems, including intelligent agents and automation workflows. LLM Integration & Optimization : .In...Show more
    Last updated: 14 days ago • Promoted