Talent.com
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)Sixteen Alpha AI • Gurgaon, Haryana, India
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AI • Gurgaon, Haryana, India
16 days ago
Job description

About the Project We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.

Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.

Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.

Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.

Optimize crawling logic for speed, reliability, and stealth across distributed environments.

Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).

Strong understanding of browser automation, session management, and anti-detection mechanisms .

Experience integrating AI / ML / NLP pipelines — e.g., text classification, entity recognition, or embedding-based similarity.

Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).

Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.

Ability to design robust data flows that connect crawling → AI inference → storage / visualization.

Nice to Have Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .

Experience with data cleaning, deduplication, and normalization pipelines .

Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .

Prior experience integrating real-time analytics dashboards or monitoring tools.

What We Offer Competitive freelance pay based on expertise and delivery.

Flexible, async-first remote collaboration.

Opportunity to shape an AI-first data platform from the ground up.

Potential for long-term partnership if the collaboration is successful.

Create a job alert for this search

Engineer • Gurgaon, Haryana, India

Related jobs
AI Engineer

AI Engineer

Recro • Gurugram, Haryana, India
Designing & deploying agentic workflows (Semantic Kernel / LangGraph / AutoGen / CrewAI).Building tool-calling flows, RAG pipelines, and hybrid search. Deploying AI agents on cloud (containers, iden...Show more
Last updated: 30+ days ago • Promoted
Lead Applied AI Engineer

Lead Applied AI Engineer

Taggd • Gurugram, Haryana, India
We’re building agentic AI for recruitment workflows—sourcing, screening, interview assistance, and offer orchestration.You’ll own LLM / agent design, retrieval, evaluation, safety, and targeted.Featu...Show more
Last updated: 6 days ago • Promoted
Python Web Scraping Engineer – Automation (3 to 10 yrs)

Python Web Scraping Engineer – Automation (3 to 10 yrs)

AIMLEAP • gurugram, uttar pradesh, in
Python Web Scraping Engineer – Advanced Automation (WFH).Bachelor’s degree in Computer Science, IT, or related field .IT / Software Services / Data & AI . Strong hands-on experience handling.Seleniu...Show more
Last updated: 11 hours ago • Promoted • New!
AI Engineer

AI Engineer

Magna Hire • Gurgaon
Job Description : We're building the future of AI-driven security automation and are looking for a hands-on AI Engineer who can design, deploy, and scal...Show more
Last updated: 30+ days ago • Promoted
Web Scraping Engineer

Web Scraping Engineer

noon • Gurugram, Haryana, India
Job title : Web Scraping Engineer.The ideal candidate will design and implement robust scrapers to collect, clean, and normalize product data (pricing, availability, reviews, images, etc.Develop and...Show more
Last updated: 6 days ago • Promoted
Generative AI Engineer

Generative AI Engineer

Reqpedia • Gurugram, Haryana, India
We seek a motivated Junior Generative AI Developer to design, implement, and optimize cutting-edge generative AI solutions. You’ll work closely with senior engineers to build applications leveraging...Show more
Last updated: 6 days ago • Promoted
Full Stack Developer - FastAPI

Full Stack Developer - FastAPI

Nagarro • Gurugram, Haryana, India
FastAPI (Capable),Node JS,React (Expert),Java.Define and lead application architecture across complex digital systems, emphasizing modular, reusable, and scalable designs.Architect solutions using ...Show more
Last updated: 17 days ago • Promoted
Senior Web Developer (Full Stack)

Senior Web Developer (Full Stack)

Gem3s Technologies Pvt. Ltd. • Gurgaon, Haryana, India
Job Summary : We are seeking a highly skilled Full Stack Developer who is proficient in both front-end and back-end development. The ideal candidate will have experience with all stages of software ...Show more
Last updated: 8 hours ago • Promoted • New!
Full-Stack Developer - 20414

Full-Stack Developer - 20414

Turing • gurgaon, India
Turing is looking for experienced Full Stack Developers to build modern solutions that power AI products and evaluation workflows. LLM behavior with real-world user needs.This is a remote, flexible ...Show more
Last updated: 30+ days ago • Promoted
Full Stack Engineer

Full Stack Engineer

Programmers.io • gurgaon, haryana, in
Job Title : Senior Full Stack Developer (Laravel + Vue).We are seeking highly skilled Senior Full Stack Developers with 7–10 years of experience in Laravel and modern frontend frameworks (Vue.The ca...Show more
Last updated: 14 days ago • Promoted
Generative AI Engineer

Generative AI Engineer

Live Connections • gurgaon, haryana, in
Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field; Master’s or PhD preferred. ML / Data Science with a focus on generative AI, LLMs, or co...Show more
Last updated: 16 days ago • Promoted
Full Stack Web Developer (Agentic AI Application)

Full Stack Web Developer (Agentic AI Application)

Aryng • Gurugram, HR, IN
Remote
Quick Apply
Welcome! You made it to the job description page!.This is a 100% REMOTE job opportunity.You can work from anywhere, given that you have strong internet connectivity and a personal device (laptop) t...Show more
Last updated: 30+ days ago
Lead Full-Stack + AI Engineer (Founding Team)

Lead Full-Stack + AI Engineer (Founding Team)

Grovio AI • gurugram, uttar pradesh, in
We’re building an autonomous, multi-agent AI OS that plans, executes, and optimizes marketing across modern digital ecosystems. Think : an AI that acts like a virtual CMO — planning, writing, analyz...Show more
Last updated: 3 hours ago • Promoted • New!
Founding AI Engineer

Founding AI Engineer

Ourguide.ai • Gurugram, Haryana, India
Jobs Title : Founding AI Engineer – Computer / Browser Use Systems.Type : Full-time | Start : Immediate.We’re building an AI desktop app that can see your screen and take the next step for you—a true co...Show more
Last updated: 5 days ago • Promoted
Web & Product Engineer

Web & Product Engineer

Kapable • Gurgaon, Haryana, India
About Kapable Kapable is a leadership transformation platform helping CXOs, founders, and senior professionals from top global companies become better leaders through our “Thinkable, Speakable, Wor...Show more
Last updated: 8 hours ago • Promoted • New!
Fullstack AI / ML Developer

Fullstack AI / ML Developer

Cybotic System • Gurgaon, Haryana, India
Role Overview : Full Stack Engineer As an Full Stack Engineer within the Digital department, you will be responsible for designing, building, and deploying advanced AI solutions.These include chatbo...Show more
Last updated: 4 days ago • Promoted
CoreOps.AI - Generative AI Engineer

CoreOps.AI - Generative AI Engineer

COREOPS.AI PRIVATE LIMITED • Gurugram
Job Description - Generative AI Software Engineer Overview : CoreOps.AI is looking for an experienced Generat...Show more
Last updated: 9 days ago • Promoted
Full Stack Engineer

Full Stack Engineer

AideWiser SolTek • gurgaon, haryana, in
AWS (EC2, Lambda, S3, RDS, DynamoDB, etc.Design, develop, and maintain backend services using.Net Core / MVC and frontend components using React. Build and scale backend systems on AWS cloud infrastru...Show more
Last updated: 30+ days ago • Promoted