Talent.com
AI Engineer - GPT / LangChain / RAG / Data Pipelines
AI Engineer - GPT / LangChain / RAG / Data PipelinesPeak Trust Global Real Estate • secunderabad, telangana, in
AI Engineer - GPT / LangChain / RAG / Data Pipelines

AI Engineer - GPT / LangChain / RAG / Data Pipelines

Peak Trust Global Real Estate • secunderabad, telangana, in
11 hours ago
Job description

Location : Remote

Type : Full-time

Experience : 3+ Years

Salary : up to 70K / Month based on experience

Role Summary

We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.

This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.

Key Responsibilities (Technical) 1. Data Acquisition & Automation

  • Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworks
  • Extract multi-format documents (PDFs, HTML, text, images)
  • Handle large-scale crawling, rate limits, error handling, and scheduling

2. Document Processing & Transformation

  • Clean and process unstructured documents
  • Apply OCR (Tesseract, PaddleOCR) for scanned files
  • Convert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.
  • Prepare data in formats such as JSON, JSONL, or CSV
  • 3. Dataset Preparation

  • Segment and structure text for ML training
  • Create Q&A datasets, summaries, instruction-response pairs, and labeled text
  • Build high-quality datasets compatible with fine-tuning frameworks
  • 4. Retrieval & Indexing Pipelines

  • Implement document chunking strategies
  • Generate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )
  • Build retrieval workflows using LangChain or LlamaIndex
  • Optimize retrieval accuracy and latency
  • 5. Model Training & Fine-Tuning

  • Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methods
  • Monitor training performance and refine datasets
  • Package and deploy fine-tuned models
  • 6. Data Visualization & Analytics

  • Create analytical charts, trends, and insights using :
  • Pandas
  • Matplotlib
  • Seaborn
  • Plotly
  • Build simple internal dashboards or visual summaries for reports
  • Transform raw datasets into meaningful visual insights
  • 7. Automation & Infrastructure

  • Write modular, maintainable Python scripts
  • Containerize workflows with Docker
  • Maintain version control with Git
  • Ensure reproducibility and pipeline stability
  • Required Technical Skills

  • Strong proficiency in Python
  • Experience with Firecrawl , Playwright, Scrapy, or similar tools
  • Strong background in document parsing , text processing, and OCR
  • Familiarity with LangChain or LlamaIndex
  • Experience with vector databases
  • Hands-on experience with HuggingFace , Transformer models, and fine-tuning
  • Ability to write clean, efficient data pipelines
  • Experience with Matplotlib , Seaborn , Plotly , or other visualization tools
  • Comfort using Docker and Git
  • Nice to Have

  • Experience serving models or building small APIs (FastAPI)
  • Exposure to GPU training environments
  • Background in large-scale unstructured data work
  • Ability to create lightweight dashboards (Plotly Dash, Streamlit)
  • Ideal Candidate

  • Comfortable owning full pipelines independently
  • Detail-oriented and analytical
  • Strong problem-solving ability
  • Can work with minimal supervision
  • Enjoys building structured systems from scratch
  • Create a job alert for this search

    Ai Data Engineer • secunderabad, telangana, in

    Related jobs
    AI Engineer (Data Pipelines & RAG)

    AI Engineer (Data Pipelines & RAG)

    BeGig • Hyderabad, IN
    Job Role- AI Engineer (Data Pipelines & RAG).Work Mode- Remote(6 days working).We are looking for a hands-on AI / Data Engineer (4–7 years) to build and scale data pipelines powering GenAI and agenti...Show more
    Last updated: 11 days ago • Promoted
    AI Engineer

    AI Engineer

    Intellectt Inc • Hyderabad, Telangana, India
    The ideal candidate will design, build, and deploy intelligent, scalable AI applications that leverage.OpenAI, Anthropic, Gemini, or Llama APIs. Retrieval-Augmented Generation (RAG).FAISS, Pinecone)...Show more
    Last updated: 15 days ago • Promoted
    Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

    Ai Engineer - Gpt / Langchain / Rag / Data Pipelines

    Peak Trust Global Real Estate • Secunderabad, Republic Of India, IN
    This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
    Last updated: 6 hours ago • Promoted • New!
    Senior AI / ML Engineer

    Senior AI / ML Engineer

    Luxoft • Hyderabad, IN
    Our client, one of the leading Agriculture Companies, is modernising their landscape and adopting AI and innovations in their process. We are seeking a highly skilled and innovative Python / AI Engine...Show more
    Last updated: 21 days ago • Promoted
    AI Solutions Engineer

    AI Solutions Engineer

    NeerInfo Solutions • Hyderabad, Republic Of India, IN
    Good To Have Skills : Experience with Python (Programming Language),.Maintain and update existing documentation, including technical guides and manuals, and FAQs / onboarding material, to support user...Show more
    Last updated: 9 days ago • Promoted
    AI Engineer (Gen AI, Agentic AI)

    AI Engineer (Gen AI, Agentic AI)

    Chetty Technologies • Hyderabad, IN
    Chetty Technologies is a specialized technology company with expertise in delivering advanced consulting services to industries including Banking & Finance, Telecom, Healthcare, and Hospitality.Wit...Show more
    Last updated: 1 day ago • Promoted
    Databricks Gen AI Engineer

    Databricks Gen AI Engineer

    Syren • Hyderabad, Telangana, India
    Model Serving, Vector Search, and embedding workflows.Clustering, Unity Catalog, Delta Lake).OpenAI / Azure OpenAI), and GenAI app patterns (RAG / Agents). Proficiency in SQL, Spark performance tuning...Show more
    Last updated: 7 days ago • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    Intellectt Inc • Hyderabad, Telangana, India
    Minimum 5+ years in Data Science & 5+ years in Agentic AI).We are seeking an accomplished.LLMs, RAG, and multi-agent architectures. The ideal candidate will have deep expertise in.LangGraph, LangCha...Show more
    Last updated: 24 days ago • Promoted
    AI-Enhanced Data Pipeline Engineer

    AI-Enhanced Data Pipeline Engineer

    Metasys Technologies • Hyderabad, Republic Of India, IN
    Hyderabad, Bangalore, NCR, Pune.Hands-on with Kafka / Kinesis for real-time data processing.Strong SQL skills with experience in database design and data modeling. Experience with Hadoop, Hive, Spark ...Show more
    Last updated: 14 hours ago • Promoted • New!
    AI Engineer - GPT / LangChain / RAG / Data Pipelines

    AI Engineer - GPT / LangChain / RAG / Data Pipelines

    Peak Trust Global Real Estate • Secunderabad, Telangana, India
    Location : Remote Type : Full-time Experience : 3+ Years Salary : up to 70K / Month based on experience Role Summary We are looking for a hands-on AI Data Engineer who can independently manage en...Show more
    Last updated: 8 hours ago • Promoted • New!
    Big Data Engineer with Gen AI

    Big Data Engineer with Gen AI

    Metasys Technologies • Hyderabad, Telangana, India
    Hyderabad, Bangalore, NCR, Pune.Hands-on with Kafka / Kinesis for real-time data processing.Strong SQL skills with experience in database design and data modeling. Experience with Hadoop, Hive, Spark ...Show more
    Last updated: 13 hours ago • Promoted • New!
    Sr Python Gen AI engineers

    Sr Python Gen AI engineers

    Adecco • Hyderabad, Telangana, India
    Develop generative AI solutions on AWS, focusing on LLMs, prompt engineering, RAG, and agentic AI using n8n and Python libraries like LangChain. Anthropic Claude) on AWS Bedrock with boto3.Fine-tune...Show more
    Last updated: 14 days ago • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Intellectt Inc • Hyderabad, India
    Our client is hiring an Agentic AI Engineer to design and deploy autonomous AI systems for healthcare-focused enterprise applications. The role involves building agentic workflows using LangChain / La...Show more
    Last updated: 3 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    NeerInfo Solutions • Hyderabad, India
    Good To Have Skills : Experience with Python (Programming Language),.Maintain and update existing documentation, including technical guides and manuals, and FAQs / onboarding material, to support user...Show more
    Last updated: 8 days ago • Promoted
    AI Lead Engineer

    AI Lead Engineer

    TekGenio • Hyderabad, IN
    Experience : 5+ Years | Type : Full-Time | Location : WFH.Minimum of 5+ years of experience in AI / ML engineering, data science, or algorithm development. Strong experience in machine learning, deep lea...Show more
    Last updated: 3 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Live Connections • Hyderabad, IN
    Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field; Master’s or PhD preferred. ML / Data Science with a focus on generative AI, LLMs, or co...Show more
    Last updated: 16 days ago • Promoted
    AI Engineer

    AI Engineer

    NyxaLabs • Hyderabad, IN
    We're seeking an exceptional AI Engineer with deep expertise in TensorFlow model training to design and build next-generation AI systems. This role focuses on developing sophisticated machine learni...Show more
    Last updated: 2 days ago • Promoted
    Generative AI, Python

    Generative AI, Python

    Tata Consultancy Services • Hyderabad, Telangana, India
    LOCATION : Chennai / Hyderabad / Kolkata.Notice Period- • • • 30 DAYS • • •.Experience in automation preferably in DevOps or workflow automation using platforms like ServiceNow. Experience with AI / GenAI to...Show more
    Last updated: 14 days ago • Promoted