Talent.com
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)Sixteen Alpha AI • Gurgaon, Haryana, India
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AI • Gurgaon, Haryana, India
21 days ago
Job description

About the Project We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.

Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.

Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.

Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.

Optimize crawling logic for speed, reliability, and stealth across distributed environments.

Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).

Strong understanding of browser automation, session management, and anti-detection mechanisms .

Experience integrating AI / ML / NLP pipelines — e.g., text classification, entity recognition, or embedding-based similarity.

Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).

Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.

Ability to design robust data flows that connect crawling → AI inference → storage / visualization.

Nice to Have Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .

Experience with data cleaning, deduplication, and normalization pipelines .

Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .

Prior experience integrating real-time analytics dashboards or monitoring tools.

What We Offer Competitive freelance pay based on expertise and delivery.

Flexible, async-first remote collaboration.

Opportunity to shape an AI-first data platform from the ground up.

Potential for long-term partnership if the collaboration is successful.

Create a job alert for this search

Engineer • Gurgaon, Haryana, India

Related jobs
Lead Applied AI Engineer

Lead Applied AI Engineer

Taggd • Gurugram, Haryana, India
Applied AI / LLMs; solid traditional ML).We’re building agentic AI for recruitment workflows—sourcing, screening, interview assistance, and offer orchestration. You’ll own LLM / agent design, retrieval,...Show more
Last updated: 11 days ago • Promoted
Full Stack Engineer

Full Stack Engineer

Amicon Hub Services • Gurugram, Haryana, India
Uses AI-powered chat messages to personalize communication with website visitors.Qualifies anonymous visitors to identify potential customers. Provides tools to track customer engagement and improve...Show more
Last updated: 1 day ago • Promoted
Full Stack Engineer

Full Stack Engineer

Convexicon Software Solutions • Gurugram, Haryana, India
We are seeking a skilled Senior Full Stack Developer with 5+ years of experience in modern web development.The ideal candidate will have strong expertise in React JS and. Net MVC for building scalab...Show more
Last updated: 2 days ago • Promoted
Full-Stack / Python Developer (Web Scraping & Automation Specialist)

Full-Stack / Python Developer (Web Scraping & Automation Specialist)

Youngun • Gurugram, Haryana, India
Flexible, results-driven environment.MeldIt develops large-scale social media automation and data intelligence solutions. Our platform manages campaigns across multiple brands and collects high-volu...Show more
Last updated: 3 days ago • Promoted
Data Engineer

Data Engineer

Impetus • Gurugram, Haryana, India
GCP Developer for Gurgaon / Bangalore location.We are looking for a person with Bigdata, Spark, Pyspark, Python & with GCP combination - Bigquery, Dataproc, Pubsub, Cloud Composer, GCS or etc.If you ...Show more
Last updated: 30+ days ago • Promoted
Azure AI Foundry Developer

Azure AI Foundry Developer

Undocked • gurgaon, haryana, in
At Undocked, we help companies excel in e-commerce by delivering bespoke optimizations and cutting-edge analytics.Our experiences in retail and supply chain product strategy, technology, and operat...Show more
Last updated: 30+ days ago • Promoted
Full Stack Engineer

Full Stack Engineer

IGT Solutions • Gurugram, Haryana, India
Job Description : Full Stack Developer (Java + Angular).Primary Skills : Java8, Spring Boot, Restful API, Angular 8+, Jenkin, Maven, Git, MySQL, MongoDB. The ideal candidate should have hands-on exper...Show more
Last updated: 4 days ago • Promoted
AI Engineer

AI Engineer

Recro • Gurgaon, Haryana, India
What you’ll work on : • Designing & deploying agentic workflows (Semantic Kernel / LangGraph / AutoGen / CrewAI) • Building tool-calling flows, RAG pipelines, and hybrid search • Deploying AI age...Show more
Last updated: 30+ days ago • Promoted
Lead Full Stack Engineer

Lead Full Stack Engineer

Convertway by Unicommerce • Gurugram, Haryana, India
About Convertway by Unicommerce.D2C and eCommerce brands boost conversions through personalized WhatsApp and omnichannel campaigns. We empower brands to drive measurable growth, improve retention, a...Show more
Last updated: 4 days ago • Promoted
Full Stack Web Developer (Agentic AI Application)

Full Stack Web Developer (Agentic AI Application)

Aryng • Gurugram, HR, IN
Remote
Quick Apply
Welcome! You made it to the job description page!.This is a 100% REMOTE job opportunity.You can work from anywhere, given that you have strong internet connectivity and a personal device (laptop) t...Show more
Last updated: 30+ days ago
Webflow Developer

Webflow Developer

Uptut • Gurugram, Haryana, India
SVH 83 Metro Street • Gurugram, Haryana, India.If you eat, sleep, and dream in Webflow, we already like you.You’ll be joining a fun, collaborative, and super-creative team that loves experimenting ...Show more
Last updated: 1 day ago • Promoted
Full-Stack+AI Automation Tech Lead - Proptech

Full-Stack+AI Automation Tech Lead - Proptech

BigStep Technologies • Gurugram, Haryana, India
Full-Stack+AI Automation Tech Lead - Proptech.AI-first, cloud-native product engineering company.We are dedicated to leveraging cutting-edge AI and cloud technologies to transform business processe...Show more
Last updated: 4 hours ago • Promoted • New!
Java Web Developer

Java Web Developer

Reqpedia • Gurugram, Haryana, India
Job Title : Full Stack Developer (Java / Python / React).We are looking for an experienced Full Stack Developer with strong hands-on expertise in Java, Python, Spring Boot, React, and SQL, along with pr...Show more
Last updated: 2 days ago • Promoted
Full Stack Developer - FastAPI

Full Stack Developer - FastAPI

Nagarro • Gurugram, Haryana, India
FastAPI (Capable),Node JS,React (Expert),Java.Define and lead application architecture across complex digital systems, emphasizing modular, reusable, and scalable designs.Architect solutions using ...Show more
Last updated: 22 days ago • Promoted
AI Data Engineer

AI Data Engineer

Turing • Gurugram, Haryana, India
We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show more
Last updated: 22 hours ago • Promoted • New!
Technical Lead – Web Crawling Systems, Data Pipelines

Technical Lead – Web Crawling Systems, Data Pipelines

AIMLEAP • gurgaon, haryana, in
Tech / MCA / Computer Science / IT.Industry : IT / Data / AI / E-commerce / FinTech / Healthcare.Proven experience leading data engineering teams with strong ownership of web crawling systems and pi...Show more
Last updated: 18 hours ago • Promoted • New!
SAP Datasphere Developer

SAP Datasphere Developer

Antal International • Gurugram, Haryana, India
Our client is a global healthcare leader with a strong digital backbone, building next-generation analytics capabilities across their enterprise. They’re scaling a modern data ecosystem powered by S...Show more
Last updated: 4 hours ago • Promoted • New!
Full Stack Engineer

Full Stack Engineer

Convertway by Unicommerce • Gurugram, Haryana, India
Senior Software Engineer (SSE).About Convertway by Unicommerce.Convertway is a fast-growing marketing automation and customer engagement platform helping D2C and eCommerce brands boost conversions ...Show more
Last updated: 3 days ago • Promoted