Talent.com
AI Engineer - GPT / LangChain / RAG / Data Pipelines
AI Engineer - GPT / LangChain / RAG / Data PipelinesPeak Trust Global Real Estate • Hyderabad, IN
AI Engineer - GPT / LangChain / RAG / Data Pipelines

AI Engineer - GPT / LangChain / RAG / Data Pipelines

Peak Trust Global Real Estate • Hyderabad, IN
6 hours ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 70K / Month based on experience

Role Summary

We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation

  • Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks
  • Extract multi-format documents (PDFs, HTML, text, images)
  • Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation

  • Clean and process unstructured documents
  • Apply OCR (Tesseract, PaddleOCR) for scanned files
  • Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.
  • Prepare data in formats such as JSON, JSONL, or CSV
  • 3. Dataset Preparation

  • Segment and structure text for ML training
  • Create Q&A datasets, summaries, instruction-response pairs, and labeled text
  • Build high-quality datasets compatible with fine-tuning frameworks
  • 4. Retrieval & Indexing Pipelines

  • Implement document chunking strategies
  • Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )
  • Build retrieval workflows using LangChain or LlamaIndex
  • Optimize retrieval accuracy and latency
  • 5. Model Training & Fine-Tuning

  • Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods
  • Monitor training performance and refine datasets
  • Package and deploy fine-tuned models
  • 6. Data Visualization & Analytics

  • Create analytical charts, trends, and insights using :
  • Pandas
  • Matplotlib
  • Seaborn
  • Plotly
  • Build simple internal dashboards or visual summaries for reports
  • Transform raw datasets into meaningful visual insights
  • 7. Automation & Infrastructure

  • Write modular, maintainable Python scripts
  • Containerize workflows with Docker
  • Maintain version control with Git
  • Ensure reproducibility and pipeline stability
  • Required Technical Skills

  • Strong proficiency in Python
  • Experience with Firecrawl , Playwright, Scrapy, or similar tools
  • Strong background in document parsing , text processing, and OCR
  • Familiarity with LangChain or LlamaIndex
  • Experience with vector databases
  • Hands-on experience with HuggingFace , Transformer models, and fine-tuning
  • Ability to write clean, efficient data pipelines
  • Experience with Matplotlib , Seaborn , Plotly , or other visualization tools
  • Comfort using Docker and Git
  • Nice to Have

  • Experience serving models or building small APIs (FastAPI)
  • Exposure to GPU training environments
  • Background in large-scale unstructured data work
  • Ability to create lightweight dashboards (Plotly Dash, Streamlit)
  • Ideal Candidate

  • Comfortable owning full pipelines independently
  • Detail-oriented and analytical
  • Strong problem-solving ability
  • Can work with minimal supervision
  • Enjoys building structured systems from scratch
  • Create a job alert for this search

    Ai Data Engineer • Hyderabad, IN

    Related jobs
    AI Engineer (Data Pipelines & RAG)

    AI Engineer (Data Pipelines & RAG)

    BeGig • Hyderabad, IN
    Job Role- AI Engineer (Data Pipelines & RAG).Work Mode- Remote(6 days working).We are looking for a hands-on AI / Data Engineer (4–7 years) to build and scale data pipelines powering GenAI and agenti...Show more
    Last updated: 11 days ago • Promoted
    AI Engineer

    AI Engineer

    Intellectt Inc • Hyderabad, Telangana, India
    About the Role We are seeking a highly skilled Agentic AI Engineer to architect, develop, and operationalize advanced agentic AI systems for enterprise-grade, healthcare-focused applications.This ...Show more
    Last updated: 15 days ago • Promoted
    Senior AI / ML Engineer

    Senior AI / ML Engineer

    Luxoft • Hyderabad, IN
    Our client, one of the leading Agriculture Companies, is modernising their landscape and adopting AI and innovations in their process. We are seeking a highly skilled and innovative Python / AI Engine...Show more
    Last updated: 21 days ago • Promoted
    AI Engineer (Gen AI, Agentic AI)

    AI Engineer (Gen AI, Agentic AI)

    Chetty Technologies • Hyderabad, IN
    Chetty Technologies is a specialized technology company with expertise in delivering advanced consulting services to industries including Banking & Finance, Telecom, Healthcare, and Hospitality.Wit...Show more
    Last updated: 1 day ago • Promoted
    Databricks Gen AI Engineer

    Databricks Gen AI Engineer

    Syren • Hyderabad, Telangana, India
    Model Serving, Vector Search, and embedding workflows.Clustering, Unity Catalog, Delta Lake).OpenAI / Azure OpenAI), and GenAI app patterns (RAG / Agents). Proficiency in SQL, Spark performance tuning...Show more
    Last updated: 6 days ago • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    Intellectt Inc • Hyderabad, Telangana, India
    The ideal candidate will have deep expertise in.LLMs, multi-agent frameworks, and data-driven AI.Python, LangChain, LlamaIndex, RAG, and vector databases.Show more
    Last updated: 24 days ago • Promoted
    AI-Enhanced Data Pipeline Engineer

    AI-Enhanced Data Pipeline Engineer

    Metasys Technologies • Hyderabad, Republic Of India, IN
    Hyderabad, Bangalore, NCR, Pune.Hands-on with Kafka / Kinesis for real-time data processing.Strong SQL skills with experience in database design and data modeling. Experience with Hadoop, Hive, Spark ...Show more
    Last updated: 3 hours ago • Promoted • New!
    Generative AI Engineer

    Generative AI Engineer

    Sesheng Company • Hyderabad, IN
    GenAI Engineer (Semantic Search & RAG Systems).Remote in India (to work in US Time zone).You will be instrumental in designing and deploying a cutting-edge semantic search capability to power our n...Show more
    Last updated: 24 days ago • Promoted
    Big Data Engineer with Gen AI

    Big Data Engineer with Gen AI

    Metasys Technologies • Hyderabad, Telangana, India
    Hyderabad, Bangalore, NCR, Pune.Hands-on with Kafka / Kinesis for real-time data processing.Strong SQL skills with experience in database design and data modeling. Experience with Hadoop, Hive, Spark ...Show more
    Last updated: 1 hour ago • Promoted • New!
    Sr Python Gen AI engineers

    Sr Python Gen AI engineers

    Adecco • Hyderabad, Telangana, India
    Develop generative AI solutions on AWS, focusing on LLMs, prompt engineering, RAG, and agentic AI using n8n and Python libraries like LangChain. Anthropic Claude) on AWS Bedrock with boto3.Fine-tune...Show more
    Last updated: 14 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    NeerInfo Solutions • Hyderabad, India
    Good To Have Skills : Experience with Python (Programming Language),.Maintain and update existing documentation, including technical guides and manuals, and FAQs / onboarding material, to support user...Show more
    Last updated: 8 days ago • Promoted
    AI Lead Engineer

    AI Lead Engineer

    TekGenio • Hyderabad, IN
    Experience : 5+ Years | Type : Full-Time | Location : WFH.Minimum of 5+ years of experience in AI / ML engineering, data science, or algorithm development. Strong experience in machine learning, deep lea...Show more
    Last updated: 3 days ago • Promoted
    Generative AI Engineer - RAG / Python

    Generative AI Engineer - RAG / Python

    Adecco India • Hyderabad
    Description : Job Title : Senior Python Gen AI Engineer Location : Hyderabad Employment Type : Full-time &...Show more
    Last updated: 10 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Live Connections • Hyderabad, IN
    Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field; Master’s or PhD preferred. ML / Data Science with a focus on generative AI, LLMs, or co...Show more
    Last updated: 16 days ago • Promoted
    Applied AI Engineer

    Applied AI Engineer

    Strategic Talent Partner • Hyderabad, IN
    Design and deploy advanced multi-agent pipelines for credit analysis.Optimize inference and prompt chains using frameworks like DSPy, GEPA, and LangChain. Implement reasoning techniques (CoT, ToT, G...Show more
    Last updated: 24 days ago • Promoted
    AI Engineer

    AI Engineer

    NyxaLabs • Hyderabad, IN
    We're seeking an exceptional AI Engineer with deep expertise in TensorFlow model training to design and build next-generation AI systems. This role focuses on developing sophisticated machine learni...Show more
    Last updated: 2 days ago • Promoted
    Generative AI, Python

    Generative AI, Python

    Tata Consultancy Services • Hyderabad, Telangana, India
    LOCATION : Chennai / Hyderabad / Kolkata.Notice Period- • • • 30 DAYS • • •.Experience in automation preferably in DevOps or workflow automation using platforms like ServiceNow. Experience with AI / GenAI to...Show more
    Last updated: 13 days ago • Promoted
    Azure Gen AI Engineer

    Azure Gen AI Engineer

    LTIMindtree • Hyderabad, India
    Job Title - AI Engineer + Azure.Primary Skills - GenAI,Azure,Machine Learning, Data Science, Python.Location - Bangalore / Bengaluru ,Bhubaneswar Chennai - TamilNadu ,Coimbatore, Hyderabad, Kolkata...Show more
    Last updated: 13 days ago • Promoted