AI Engineer - GPT / LangChain / RAG / Data PipelinesPeak Trust Global Real Estate • Sangli, Maharashtra, India

No longer accepting applications

AI Engineer - GPT / LangChain / RAG / Data Pipelines

Peak Trust Global Real Estate • Sangli, Maharashtra, India

3 days ago

Job description

Location : RemoteType : Full-timeExperience : 3+ YearsSalary : up to 70K / Month based on experienceRole Summary We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical) 1. Data Acquisition & Automation Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworksExtract multi-format documents (PDFs, HTML, text, images)Handle large-scale crawling, rate limits, error handling, and scheduling2. Document Processing & Transformation Clean and process unstructured documentsApply OCR (Tesseract, PaddleOCR) for scanned filesConvert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.Prepare data in formats such as JSON, JSONL, or CSV3. Dataset Preparation Segment and structure text for ML trainingCreate Q&A datasets, summaries, instruction-response pairs, and labeled textBuild high-quality datasets compatible with fine-tuning frameworks4. Retrieval & Indexing Pipelines Implement document chunking strategiesGenerate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )Build retrieval workflows using LangChain or LlamaIndexOptimize retrieval accuracy and latency5. Model Training & Fine-Tuning Run fine-tuning jobs using HuggingFace Transformers , LoRA / QLoRA , or similar methodsMonitor training performance and refine datasetsPackage and deploy fine-tuned models6. Data Visualization & Analytics Create analytical charts, trends, and insights using : PandasMatplotlibSeabornPlotlyBuild simple internal dashboards or visual summaries for reportsTransform raw datasets into meaningful visual insights7. Automation & Infrastructure Write modular, maintainable Python scriptsContainerize workflows with DockerMaintain version control with GitEnsure reproducibility and pipeline stabilityRequired Technical Skills Strong proficiency in PythonExperience with Firecrawl , Playwright, Scrapy, or similar toolsStrong background in document parsing , text processing, and OCRFamiliarity with LangChain or LlamaIndexExperience with vector databasesHands-on experience with HuggingFace , Transformer models, and fine-tuningAbility to write clean, efficient data pipelinesExperience with Matplotlib , Seaborn , Plotly , or other visualization toolsComfort using Docker and GitNice to Have Experience serving models or building small APIs (FastAPI)Exposure to GPU training environmentsBackground in large-scale unstructured data workAbility to create lightweight dashboards (Plotly Dash, Streamlit)Ideal Candidate Comfortable owning full pipelines independentlyDetail-oriented and analyticalStrong problem-solving abilityCan work with minimal supervisionEnjoys building structured systems from scratch

Create a job alert for this search

Ai Data Engineer • Sangli, Maharashtra, India

Related jobs

Python Developer

Turing • sangli, maharashtra, in

We’re looking for experienced Python developers to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models think, reason, and...Show more

Last updated: 26 days ago • Promoted

Data Scientist

Turing • sangli, maharashtra, in

Join our global AI team and help shape the future of intelligent systems.Data Scientists and Analysts skilled in Python.Work with top US-based companies creating. LLMs to analyzing large-scale datas...Show more

Last updated: 10 hours ago • Promoted • New!

Generative AI Engineer

Turing • sangli, maharashtra, in

Turing is looking for people with LLM experience to join us in solving business problems for our Fortune 500 customers.You will be a key member of the Turing GenAI delivery organization and part of...Show more

Last updated: 26 days ago • Promoted

Generative AI Engineer

Avensys Consulting UK • sangli, maharashtra, in

Rate : 450-500 GBP Per Day – Inside IR35 MAX.The Gen AI Engineer will be a specialized type of artificial intelligence professional, focused on designing, developing & implementing generative AI mod...Show more

Last updated: 10 hours ago • Promoted • New!

Machine Learning Engineer

OWOW • sangli, maharashtra, in

We're seeking experienced Machine Learning Engineers and Software Engineers with ML experience to design and build high-quality RL training environments for LLM agents. As an RL Environment Engineer...Show more

Last updated: 28 days ago • Promoted

Generative AI Engineer

Cognizant • sangli, India

We are seeking a talented and experienced AI & Generative AI Developer to join our team and help us create groundbreaking generative AI models and applications. In this role, you will work closely w...Show more

Last updated: 1 day ago • Promoted

AI Applications Engineer

Linksoft Technologies • sangli, maharashtra, in

Position : AI Applications Engineer.We’re building agentic AI apps for real business use—voice / chat agents that orchestrate workflows across CRMs / ERPs and internal tools. You’ll help us shipfeatures ...Show more

Last updated: 10 hours ago • Promoted • New!

Artificial Intelligence Engineer

Ascendion • sangli, maharashtra, in

Senior AI Engineer - Agentic AI (Remote - PAN India).Ready to build the future of autonomous AI?.AI to join our 100% remote, innovation-driven team. This is a high-impact role for a true specialist....Show more

Last updated: 26 days ago • Promoted

Data Engineer

ShimentoX Technologies • sangli, maharashtra, in

Data Engineer (Strong with Building data connectors).Key Skills : Python, Data Connectors, Metadata, API Integration-Rest / GraphQL. Must have proven background in building data connectors.Experience s...Show more

Last updated: 10 hours ago • Promoted • New!

AI Software Engineer

Get Hired • sangli, maharashtra, in

This role is ideal for professionals passionate about artificial intelligence, machine learning, and software engineering who want to make a tangible impact on real-world applications.As an AI Soft...Show more

Last updated: 10 hours ago • Promoted • New!

Research Engineer – Generative AI (LLMs)

Abacus.AI • sangli, maharashtra, in

Research Engineer – Generative AI (LLMs).AI is a leading Generative AI company building a future where AI assists and automates most work and business processes for enterprises and professionals.We...Show more

Last updated: 10 hours ago • Promoted • New!

Sr Azure Data Engineer - Remote work

Techolution • sangli, maharashtra, in

Remote

The ideal candidate will have a strong foundation in.Job Title : Azure Data Engineer.Work Timings : 5 : 00 PM to 2 : 00 AM IST. If your expertise is primarily in.Lead the migration of large-scale SQL work...Show more

Last updated: 30+ days ago • Promoted

AI Data Engineer

Turing • sangli, maharashtra, in

We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show more

Last updated: 10 hours ago • Promoted • New!

Generative AI Engineer

Philodesign Technologies Inc • sangli, maharashtra, in

Gen AI Engineer – Remote | 6+ Years Experience.We are seeking a highly skilled.AI / ML and Generative AI solutions.The ideal candidate will have practical expertise in. RAG pipelines, agent workflows,...Show more

Last updated: 23 days ago • Promoted

Machine Learning Engineer

Delta System & Software, Inc. • sangli, maharashtra, in

Python in production environments.Expert-level proficiency in Python and core libraries (Pandas, NumPy, AsyncIO, FastAPI, or similar).Show more

Last updated: 10 hours ago • Promoted • New!

Lead Data Engineer

Guidanz Inc • sangli, maharashtra, in

BI Connector is the industry leading solution for integrating Oracle Fusion Cloud data into modern BI platforms like Power BI, Tableau, and Data Warehouse, without complex ETL.Our Data Architecture...Show more

Last updated: 10 hours ago • Promoted • New!

Azure DevOps Data Engineer

Paritas Recruitment • sangli, maharashtra, in

Azure DevOps (Data) Engineer - 6+ Month Rolling Contract.Remote - (Sunday to Thursday working days).Paritas is working with a global IT Consultancy & leading Energy client who are seeking a skilled...Show more

Last updated: 10 hours ago • Promoted • New!

Principal Data Engineer

Nexuspoint Consultant • sangli, maharashtra, in

We are looking for an accomplished Principal Data Engineer to lead the ideation, architecture,.You will work closely with cloud, security, and infrastructure architects to ensure seamless.Lead the ...Show more

Last updated: 10 hours ago • Promoted • New!