Talent.com
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)Sixteen Alpha AI • New Delhi, Delhi, India
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AI • New Delhi, Delhi, India
14 days ago
Job description

About the Project We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.

Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.

Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.

Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.

Optimize crawling logic for speed, reliability, and stealth across distributed environments.

Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).

Strong understanding of browser automation, session management, and anti-detection mechanisms .

Experience integrating AI / ML / NLP pipelines — e.g., text classification, entity recognition, or embedding-based similarity.

Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).

Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.

Ability to design robust data flows that connect crawling → AI inference → storage / visualization.

Nice to Have Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .

Experience with data cleaning, deduplication, and normalization pipelines .

Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .

Prior experience integrating real-time analytics dashboards or monitoring tools.

What We Offer Competitive freelance pay based on expertise and delivery.

Flexible, async-first remote collaboration.

Opportunity to shape an AI-first data platform from the ground up.

Potential for long-term partnership if the collaboration is successful.

Create a job alert for this search

Engineer • New Delhi, Delhi, India

Related jobs
Data Engineer - Web Scraping

Data Engineer - Web Scraping

Alternative Path • Ghaziabad, IN
Alternative Path is seeking skilled software developers to collaborate on client projects with an asset management firm.In this role, you will collaborate with individuals across various company de...Show more
Last updated: 30+ days ago • Promoted
Full Stack AI engineer

Full Stack AI engineer

AnswerThis (YC F25) • Ghaziabad, IN
Remote (Applications open worldwide).Semantic Search, Vector Databases, Prompt Engineering, GenAI Frameworks, React Agents, Graph Agents, Document Parsing, Python, Scalable APIs.AnswerThis is an AI...Show more
Last updated: 30+ days ago • Promoted
Senior Full Stack Engineer (Lead) – AI Platform & Agentic Systems

Senior Full Stack Engineer (Lead) – AI Platform & Agentic Systems

ISIR AI • Noida, Uttar Pradesh, India
Senior Full Stack Engineer (Lead) – AI Platform & Agentic Systems.Noida (Onsite) | 🕒 Full-Time | 🚀 Immediate Start.AI is building next-generation. Senior Full Stack Engineer (Lead).UI engineering ...Show more
Last updated: 13 days ago • Promoted
Web Development Engineer

Web Development Engineer

InstaAstro • Noida, Republic Of India, IN
The ideal candidate will have strong web development skills and a solid understanding of programming using.HTML, CSS, JavaScript, jQuery, and either JavaScript or Python. You will be responsible for...Show more
Last updated: 2 days ago • Promoted
Web Crawling Engineer

Web Crawling Engineer

Forage AI • Ghaziabad, IN
The ideal candidate will have strong Python programming skills and experience in web scraping frameworks, browser automation tools, and handling anti-scraping mechanisms. Forage AI is a pioneering A...Show more
Last updated: 15 days ago • Promoted
Full Stack and AI Engineer

Full Stack and AI Engineer

Loam.ai • Ghaziabad, IN
AI Consulting startup that designs and deploys custom artificial‑intelligence solutions for forward‑thinking businesses.We couple state‑of‑the‑art GenAI techniques with rock‑solid engineering to tu...Show more
Last updated: 2 days ago • Promoted
AI Web Scraping Engineer

AI Web Scraping Engineer

S2T AI - AI-Powered Investigations • Ghaziabad, IN
We're seeking a forward-thinking.AI tools to accelerate development and streamline data extraction processes.Join our India team and work at the intersection of traditional scraping expertise and c...Show more
Last updated: 30+ days ago • Promoted
Web3 Engineer

Web3 Engineer

{xpay} • Delhi, India
Agent to Agent payments in the Agentic Economy with its cutting-edge control plane for managing x402 payments.The platform enables businesses to prevent runaway agent costs, monetize APIs instantly...Show more
Last updated: 15 days ago • Promoted
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AI • New Delhi, Delhi, India
The crawler will be integrated with an.AI or NLP / LLM-based components.JavaScript, infinite scrolling, or APIs.LLM-based data labeling, or automated content enrichment modules.Airflow, Prefect, or c...Show more
Last updated: 13 days ago • Promoted
Senior Web Scraping Engineer

Senior Web Scraping Engineer

Zomunk • Ghaziabad, IN
We're building a product that relies heavily on collecting structured data from a number of known websites.We need someone experienced who can own this part of the system end-to-end; from writing s...Show more
Last updated: 8 days ago • Promoted
Senior Full-Stack Web Developer (Remote)

Senior Full-Stack Web Developer (Remote)

Neo Media Group Ltd • Noida, Uttar Pradesh, India
We are seeking a highly experienced Senior Full-Stack Web Developer with expert-level skills in Laravel, Node.This is a fully remote position for a self-starter who can work independently, deliver ...Show more
Last updated: 9 days ago • Promoted
Data Engineer (Webscraping)

Data Engineer (Webscraping)

Solytics Partners • Delhi, India
Company Profile : Solytics Partners is a Global Analytics firm, recognized with multiple industry awards for innovation and excellence. Our team comprises experts with deep knowledge in risk, analyti...Show more
Last updated: 16 days ago • Promoted
Senior Web Applications Engineer

Senior Web Applications Engineer

Neo Media Group Ltd • Noida, Republic Of India, IN
We are seeking a highly experienced Senior Full-Stack Web Developer with expert-level skills in Laravel, Node.This is a fully remote position for a self-starter who can work independently, deliver ...Show more
Last updated: 9 days ago • Promoted
Web Scraping Engineer

Web Scraping Engineer

noon • Delhi, India
Job title : Web Scraping Engineer Location : Gurgaon.About the Role We are looking for a.The ideal candidate will design and implement robust scrapers to collect, clean, and normalize product data (p...Show more
Last updated: 8 days ago • Promoted
Full-stack AI Engineer - Founding Engineer

Full-stack AI Engineer - Founding Engineer

Taglynk • Delhi, India
Founding Engineer , you won’t just write code — you will help.You will : Build full-stack features end-to-end for our AI hiring platform. Work with LLMs, agentic systems, and voice / speech models to c...Show more
Last updated: 8 days ago • Promoted
Web Developer Search Engine Optimization

Web Developer Search Engine Optimization

Orchid Hotel And Catering Supplies • Karol Bagh, Delhi, India
Orchid Dinex is a leading supplier of premium tableware and buffetware for the HoReCa industry.We specialize in porcelain crockery, innovative buffet solutions, banquet and catering buffet displays...Show more
Last updated: 30+ days ago • Promoted
Python Automation & Web Scraping Engineer (2 to 4 yrs)

Python Automation & Web Scraping Engineer (2 to 4 yrs)

AIMLEAP • Delhi, India
Python Automation & Web Scraping Engineer (WFH).Bachelor’s degree in Computer Science / Information Technology.Selenium, BeautifulSoup, Requests , and automation-driven scraping workflows.HTML, CSS...Show more
Last updated: 2 days ago • Promoted
Senior Full Stack Engineer (Lead) - AI Platform & Agentic Systems

Senior Full Stack Engineer (Lead) - AI Platform & Agentic Systems

ISIR AI • Noida, Gautam Buddha Nagar (district)
Senior Full Stack Engineer (Lead) – AI Platform & Agentic Systems.Noida (Onsite) | 🕒 Full-Time | 🚀 Immediate Start.AI is building next-generation. Senior Full Stack Engineer (Lead).UI engineering ...Show more
Last updated: 13 days ago • Promoted