Talent.com
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)Sixteen Alpha AI • Ludhiana, Punjab, India
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AI • Ludhiana, Punjab, India
14 days ago
Job description

About the Project We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.

Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.

Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.

Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.

Optimize crawling logic for speed, reliability, and stealth across distributed environments.

Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).

Strong understanding of browser automation, session management, and anti-detection mechanisms .

Experience integrating AI / ML / NLP pipelines — e.g., text classification, entity recognition, or embedding-based similarity.

Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).

Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.

Ability to design robust data flows that connect crawling → AI inference → storage / visualization.

Nice to Have Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .

Experience with data cleaning, deduplication, and normalization pipelines .

Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .

Prior experience integrating real-time analytics dashboards or monitoring tools.

What We Offer Competitive freelance pay based on expertise and delivery.

Flexible, async-first remote collaboration.

Opportunity to shape an AI-first data platform from the ground up.

Potential for long-term partnership if the collaboration is successful.

Create a job alert for this search

Engineer • Ludhiana, Punjab, India

Related jobs
Web Developer (Freelance)

Web Developer (Freelance)

Sweet • ludhiana, punjab, in
Sweet is the AI-native business platform built for creators — a business partner that clears the clutter, automates the back-office, and gives creators the freedom to focus on craft, while Sweet gr...Show more
Last updated: 2 days ago • Promoted
Data Engineer

Data Engineer

System Soft Technologies • ludhiana, punjab, in
Location : Remote (3–4-hour time zone overlaps with EST if off shore).Experience with next flow is required, as the consultant will make targeted enhancements to existing workflows and pipelines.Whi...Show more
Last updated: 2 days ago • Promoted
Full Stack Engineer

Full Stack Engineer

Insight Global • ludhiana, punjab, in
Contract with Insight Global Client.React, React Native, TypeScript.React, React Native, and TypeScript.Deploy containerized solutions using. Ensure high-quality deliverables through.CI / CD pipelines...Show more
Last updated: 30+ days ago • Promoted
Full Stack and AI Engineer

Full Stack and AI Engineer

Loam.ai • ludhiana, punjab, in
AI Consulting startup that designs and deploys custom artificial‑intelligence solutions for forward‑thinking businesses.We couple state‑of‑the‑art GenAI techniques with rock‑solid engineering to tu...Show more
Last updated: 2 days ago • Promoted
Senior Snowflake Data Engineer

Senior Snowflake Data Engineer

Luxoft • ludhiana, punjab, in
We are seeking a highly skilled Snowflake Data Engineer with 7 years of IT experience to design, build, and optimize scalable data pipelines and cloud-based solutions across AWS, Azure, and GCP.The...Show more
Last updated: 2 days ago • Promoted
Full Stack Engineer

Full Stack Engineer

Beast Insights • ludhiana, punjab, in
We’re building the Payment Command Center for high-risk merchants — a platform that helps businesses recover failed payments, prevent chargebacks, and boost approval rates using data and intelligen...Show more
Last updated: 2 days ago • Promoted
Search Engineer

Search Engineer

YourTribe • ludhiana, punjab, in
Design & implement search solutions.Architect and develop advanced search features using.OpenSearch / Elasticsearch, including custom analysers, tokenisers, and scoring algorithms.Create and maintain...Show more
Last updated: 30+ days ago • Promoted
Full Stack Engineer (4-6 YOE)

Full Stack Engineer (4-6 YOE)

Redica Systems • Ludhiana, Punjab, India
About UsRedica Systems is a SaaS start-up serving more than 200 customers within the life science sector, with a specific focus on Pharmaceuticals and MedTech. Our workforce is distributed globally,...Show more
Last updated: 6 hours ago • Promoted • New!
Full Stack Developer

Full Stack Developer

ShopTrade (Shopify Premier Agency) • ludhiana, punjab, in
Full Stack Developer Location : Bengaluru, Remote.We are a full-service Shopify-focused eCommerce agency and a featured official Shopify Plus s...Show more
Last updated: 23 days ago • Promoted
Artificial Intelligence Engineer

Artificial Intelligence Engineer

ACL Digital • ludhiana, punjab, in
We are Hiring : AI Engineer : Remote Opportunity.Design, develop and deploy scalable.Machine Learning and AI models.Perform data extraction, cleaning, transformation and modeling using.Develop end-to...Show more
Last updated: 2 days ago • Promoted
Full Stack Engineer

Full Stack Engineer

Programmers.io • ludhiana, punjab, in
Job Title : Senior Full Stack Developer (Laravel + Vue).We are seeking highly skilled Senior Full Stack Developers with 7–10 years of experience in Laravel and modern frontend frameworks (Vue.The ca...Show more
Last updated: 12 days ago • Promoted
AI Data Engineer - 17852

AI Data Engineer - 17852

Turing • ludhiana, punjab, in
We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show more
Last updated: 21 days ago • Promoted
Ai Engineer

Ai Engineer

NyxaLabs • Ludhiāna, Republic Of India, IN
We're seeking an exceptional AI Engineer with deep expertise in TensorFlow model training to design and build next-generation AI systems. This role focuses on developing sophisticated machine learni...Show more
Last updated: 4 hours ago • Promoted • New!
Python Automation & Web Scraping Engineer (2 to 4 yrs)

Python Automation & Web Scraping Engineer (2 to 4 yrs)

AIMLEAP • Ludhiana, Punjab, India
Python Automation & Web Scraping Engineer (WFH)Experience : 2–4 YearsLocation : RemoteMode of Engagement : Full-timeNo of Positions : 3Educational Qualifications : Bachelor’s degree in Computer Sci...Show more
Last updated: 6 hours ago • Promoted • New!
Web Developer

Web Developer

Smart Moves Consultants • Ludhiana, Punjab, India
Key Responsibilities : Design and develop high-performance, responsive web portals using React.Build scalable backend services and APIs with Node. Integrate and optimize Snowflake for secure data stor...Show more
Last updated: 2 hours ago • Promoted • New!
Full-stack Product Engineer

Full-stack Product Engineer

Rocket Equities • ludhiana, punjab, in
US$5,000 per month + performance bonus tied to shipped deliverables and product impact.We are hiring a Full-Stack Product Engineer to build, ship, and iterate on our company-investor matchmaking pl...Show more
Last updated: 17 hours ago • Promoted • New!
Full Stack Engineer

Full Stack Engineer

UsefulBI Corporation • Ludhiana, Punjab, India
About UsefulBI : UsefulBI is a leading AI-driven data solutions provider specializing in data engineering, cloud transformations, and AI-powered analytics for Fortune 500 companies.We help busines...Show more
Last updated: 22 days ago • Promoted
Full Stack Engineer

Full Stack Engineer

ValueLabs • Ludhiana, Punjab, India
Role Overview : We are seeking an experienced Full Stack Developer who will own the complete development lifecycle of our application, ensuring robust, scalable, and secure backend and frontend s...Show more
Last updated: 30+ days ago • Promoted