Talent.com
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)Sixteen Alpha AI • Panipat, India
No longer accepting applications
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AI • Panipat, India
9 days ago
Job description

About the Project We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.

Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.

Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.

Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.

Optimize crawling logic for speed, reliability, and stealth across distributed environments.

Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).

Strong understanding of browser automation, session management, and anti-detection mechanisms .

Experience integrating AI / ML / NLP pipelines — e.g., text classification, entity recognition, or embedding-based similarity.

Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).

Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.

Ability to design robust data flows that connect crawling → AI inference → storage / visualization.

Nice to Have Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .

Experience with data cleaning, deduplication, and normalization pipelines .

Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .

Prior experience integrating real-time analytics dashboards or monitoring tools.

What We Offer Competitive freelance pay based on expertise and delivery.

Flexible, async-first remote collaboration.

Opportunity to shape an AI-first data platform from the ground up.

Potential for long-term partnership if the collaboration is successful.

Create a job alert for this search

Engineer • Panipat, India

Related jobs
Snowflake Data Engineer

Snowflake Data Engineer

Live Connections • panipat, India
Role - Snowflake Data Engineer.Required Notice Period - Immediate Joiner.To apply, connect with Abhishek via.Show more
Last updated: 12 days ago • Promoted
Deep Learning Engineer

Deep Learning Engineer

SystemBender • panipat, India
An experienced Deep Learning Engineer specializing in Computer Vision, Sensor Fusion, and Multimodal AI to advance R&D; in autonomous aerial systems and geospatial intelligence, working with large-...Show more
Last updated: 5 hours ago • Promoted • New!
Full Stack Engineer

Full Stack Engineer

Beast Insights • panipat, India
We’re building the Payment Command Center for high-risk merchants — a platform that helps businesses recover failed payments, prevent chargebacks, and boost approval rates using data and intelligen...Show more
Last updated: 3 days ago • Promoted
AI Engineer

AI Engineer

NyxaLabs • panipat, India
We're seeking an exceptional AI Engineer with deep expertise in TensorFlow model training to design and build next-generation AI systems. This role focuses on developing sophisticated machine learni...Show more
Last updated: 11 hours ago • Promoted • New!
AWS Data Engineer (Remote)

AWS Data Engineer (Remote)

Mindcraft Labs • panipat, India
Remote
This role focuses on building and maintaining data pipelines and analytics infrastructure on AWS.You will work daily with S3, Glue, Redshift, Athena, Lake Formation, Airflow, SNS / SQS, and Postgres ...Show more
Last updated: 5 hours ago • Promoted • New!
No‑code AI Automation Engineer - 300,000 INR / month - (n8n + Bubble, Webflow a plus)

No‑code AI Automation Engineer - 300,000 INR / month - (n8n + Bubble, Webflow a plus)

OpenTrain AI • panipat, India
No‑code AI Automation Engineer (n8n or Make or Zapier + Bubble, Webflow a plus).OpenTrain works with large AI companies to provide the. We design workflows, tools, and teams so that AI systems can l...Show more
Last updated: 11 hours ago • Promoted • New!
Ai Engineer

Ai Engineer

BeGig • Pānīpat, Republic Of India, IN
Job Role- AI Engineer (Data Pipelines & RAG).Work Mode- Remote(6 days working).We are looking for a hands-on AI / Data Engineer (4–7 years) to build and scale data pipelines powering GenAI and agenti...Show more
Last updated: 10 days ago • Promoted
Software Engineer (Full Stack) - 17853

Software Engineer (Full Stack) - 17853

Turing • panipat, haryana, in
Turing is seeking experienced Full Stack Software Engineers to help build end-to-end AI-driven applications for US customers — spanning backend services, web frontends, and evaluation tooling.In th...Show more
Last updated: 30+ days ago • Promoted
Full Stack Engineer (4-6 YOE)

Full Stack Engineer (4-6 YOE)

Redica Systems • panipat, India
Redica Systems is a SaaS start-up serving more than 200 customers within the life science sector, with a specific focus on Pharmaceuticals and MedTech. Our workforce is distributed globally, with he...Show more
Last updated: 3 days ago • Promoted
Full-stack Product Engineer

Full-stack Product Engineer

Rocket Equities • panipat, India
US$5,000 per month + performance bonus tied to shipped deliverables and product impact.We are hiring a Full-Stack Product Engineer to build, ship, and iterate on our company-investor matchmaking pl...Show more
Last updated: 11 hours ago • Promoted • New!
Ai Engineer

Ai Engineer

NyxaLabs • Pānīpat, Republic Of India, IN
We're seeking an exceptional AI Engineer with deep expertise in TensorFlow model training to design and build next-generation AI systems. This role focuses on developing sophisticated machine learni...Show more
Last updated: 19 hours ago • Promoted • New!
Full-Stack Developer (AI Projects)

Full-Stack Developer (AI Projects)

AJAZ Solutions • panipat, India
Full-Stack Developer (AI Projects) – Remote.AJAZ Solutions (Recruiting on behalf of a client).Experience Level : Minimum FOUR YEARS of AI-Centric Experience. AJAZ Solutions is hiring on behalf of a f...Show more
Last updated: 5 hours ago • Promoted • New!
Full Stack Engineer

Full Stack Engineer

Programmers.io • panipat, India
Job Title : Senior Full Stack Developer (Laravel + Vue).We are seeking highly skilled Senior Full Stack Developers with 7–10 years of experience in Laravel and modern frontend frameworks (Vue.The ca...Show more
Last updated: 12 days ago • Promoted
AI Engineer (Data Pipelines & RAG)

AI Engineer (Data Pipelines & RAG)

BeGig • panipat, India
Job Role- AI Engineer (Data Pipelines & RAG).Work Mode- Remote(6 days working).We are looking for a hands-on AI / Data Engineer (4–7 years) to build and scale data pipelines powering GenAI and agenti...Show more
Last updated: 10 days ago • Promoted
Python / JavaScript Developer – Web Scraping ( 4 to 7 Years)

Python / JavaScript Developer – Web Scraping ( 4 to 7 Years)

AIMLEAP • panipat, India
Python / JavaScript Developer – Web Scraping - 4 to 7 years.Bachelor's degree in Computer Science, Information Technology. Strong expertise in web scraping with hands-on experience in large-scale data...Show more
Last updated: 5 hours ago • Promoted • New!
Full-Stack Developer (Ai Projects)

Full-Stack Developer (Ai Projects)

AJAZ Solutions • Pānīpat, Republic Of India, IN
Full-Stack Developer (AI Projects) – Remote.AJAZ Solutions (Recruiting on behalf of a client).Experience Level : Minimum FOUR YEARS of AI-Centric Experience. AJAZ Solutions is hiring on behalf of a f...Show more
Last updated: 3 hours ago • Promoted • New!
Web Scraper Expert (Part-Time / Freelance) – L4RG

Web Scraper Expert (Part-Time / Freelance) – L4RG

Lead For Revenue Generation • panipat, India
Interested candidates can share their CV at : .You will be responsible for developing reliable, scalable, and efficient data-scraping solutions that support our digital marketing, research, and autom...Show more
Last updated: 5 hours ago • Promoted • New!
AWS AI / ML Engineer (Remote)

AWS AI / ML Engineer (Remote)

Mindcraft Labs • panipat, India
Remote
This is a hands-on engineering role focused on building and maintaining AI and ML services on AWS.You will help turn ideas and prototypes into robust, production-ready APIs and ML flows using Amazo...Show more
Last updated: 5 hours ago • Promoted • New!