Talent.com
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)Sixteen Alpha AI • New Delhi, Delhi, India
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AI • New Delhi, Delhi, India
15 days ago
Job description

About the Project

We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities

  • Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.
  • Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.
  • Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.
  • Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.
  • Optimize crawling logic for speed, reliability, and stealth across distributed environments.
  • Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise

  • Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).
  • Strong understanding of browser automation, session management, and anti-detection mechanisms .
  • Experience integrating AI / ML / NLP pipelines — e.g., text classification, entity recognition, or embedding-based similarity.
  • Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).
  • Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.
  • Ability to design robust data flows that connect crawling → AI inference → storage / visualization.
  • Nice to Have

  • Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .
  • Experience with data cleaning, deduplication, and normalization pipelines .
  • Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .
  • Prior experience integrating real-time analytics dashboards or monitoring tools.
  • What We Offer

  • Competitive freelance pay based on expertise and delivery.
  • Flexible, async-first remote collaboration.
  • Opportunity to shape an AI-first data platform from the ground up.
  • Potential for long-term partnership if the collaboration is successful.
  • Create a job alert for this search

    Engineer • New Delhi, Delhi, India

    Related jobs
    Full Stack AI engineer

    Full Stack AI engineer

    AnswerThis (YC F25) • Ghaziabad, IN
    Remote (Applications open worldwide).Semantic Search, Vector Databases, Prompt Engineering, GenAI Frameworks, React Agents, Graph Agents, Document Parsing, Python, Scalable APIs.AnswerThis is an AI...Show more
    Last updated: 30+ days ago • Promoted
    Senior Web Developer (Full Stack)

    Senior Web Developer (Full Stack)

    Gem3s Technologies Pvt. Ltd. • Ghaziabad, IN
    We are seeking a highly skilled Full Stack Developer who is proficient in both front-end and back-end development.The ideal candidate will have experience with all stages of software development an...Show more
    Last updated: 7 hours ago • Promoted • New!
    Senior Full Stack Engineer (Lead) – AI Platform & Agentic Systems

    Senior Full Stack Engineer (Lead) – AI Platform & Agentic Systems

    ISIR AI • Noida, Uttar Pradesh, India
    Senior Full Stack Engineer (Lead) – AI Platform & Agentic Systems.Noida (Onsite) | 🕒 Full-Time | 🚀 Immediate Start.AI is building next-generation. Senior Full Stack Engineer (Lead).UI engineering ...Show more
    Last updated: 15 days ago • Promoted
    AI Web Scraping Engineer

    AI Web Scraping Engineer

    S2T AI - AI-Powered Investigations • Delhi, India
    We're seeking a forward-thinking.AI tools to accelerate development and streamline data extraction processes.Join our India team and work at the intersection of traditional scraping expertise and c...Show more
    Last updated: 30+ days ago • Promoted
    Web Crawling Engineer

    Web Crawling Engineer

    Forage AI • Ghaziabad, IN
    The ideal candidate will have strong Python programming skills and experience in web scraping frameworks, browser automation tools, and handling anti-scraping mechanisms. Forage AI is a pioneering A...Show more
    Last updated: 17 days ago • Promoted
    Full Stack and AI Engineer

    Full Stack and AI Engineer

    Loam.ai • Ghaziabad, IN
    AI Consulting startup that designs and deploys custom artificial‑intelligence solutions for forward‑thinking businesses.We couple state‑of‑the‑art GenAI techniques with rock‑solid engineering to tu...Show more
    Last updated: 4 days ago • Promoted
    Webflow Developer

    Webflow Developer

    Parentheses • Delhi, India
    We’re looking for a Webflow Developer with 2–4 years of experience in bringing Figma designs to life through responsive, motion-rich web experiences. Full-time | Work-from-studio | Bengaluru.WHAT WE...Show more
    Last updated: 16 days ago • Promoted
    Webflow Developer

    Webflow Developer

    Zemoso Technologies • Delhi, India
    Job Title : Webflow Developer (Marketing & Ops).Zemoso Labs is an innovation studio offering innovation-as-a-service, with design, engineering, and growth services. Our clientele includes Fortune 500...Show more
    Last updated: 21 days ago • Promoted
    Web3 Engineer

    Web3 Engineer

    {xpay} • Delhi, India
    Agent to Agent payments in the Agentic Economy with its cutting-edge control plane for managing x402 payments.The platform enables businesses to prevent runaway agent costs, monetize APIs instantly...Show more
    Last updated: 16 days ago • Promoted
    Senior Web Scraping Engineer

    Senior Web Scraping Engineer

    Zomunk • Ghaziabad, IN
    We're building a product that relies heavily on collecting structured data from a number of known websites.We need someone experienced who can own this part of the system end-to-end; from writing s...Show more
    Last updated: 10 days ago • Promoted
    Senior Full-Stack Web Developer (Remote)

    Senior Full-Stack Web Developer (Remote)

    Neo Media Group Ltd • Noida, Uttar Pradesh, India
    We are seeking a highly experienced Senior Full-Stack Web Developer with expert-level skills in Laravel, Node.This is a fully remote position for a self-starter who can work independently, deliver ...Show more
    Last updated: 11 days ago • Promoted
    Data Engineer (Webscraping)

    Data Engineer (Webscraping)

    Solytics Partners • Delhi, India
    Company Profile : Solytics Partners is a Global Analytics firm, recognized with multiple industry awards for innovation and excellence. Our team comprises experts with deep knowledge in risk, analyti...Show more
    Last updated: 17 days ago • Promoted
    AI Engineer

    AI Engineer

    Recro • Delhi, India
    What you’ll work on : • Designing & deploying agentic workflows (Semantic Kernel / LangGraph / AutoGen / CrewAI) • Building tool-calling flows, RAG pipelines, and hybrid search • Deploying AI agents...Show more
    Last updated: 19 hours ago • Promoted • New!
    Web Developer (Freelance)

    Web Developer (Freelance)

    Sweet • Ghaziabad, IN
    Sweet is the AI-native business platform built for creators — a business partner that clears the clutter, automates the back-office, and gives creators the freedom to focus on craft, while Sweet gr...Show more
    Last updated: 4 days ago • Promoted
    Web Scraping Engineer

    Web Scraping Engineer

    noon • Delhi, India
    Job title : Web Scraping Engineer Location : Gurgaon.About the Role We are looking for a.The ideal candidate will design and implement robust scrapers to collect, clean, and normalize product data (p...Show more
    Last updated: 9 days ago • Promoted
    Full-stack AI Engineer - Founding Engineer

    Full-stack AI Engineer - Founding Engineer

    Taglynk • Delhi, India
    Founding Engineer , you won’t just write code — you will help.You will : Build full-stack features end-to-end for our AI hiring platform. Work with LLMs, agentic systems, and voice / speech models to c...Show more
    Last updated: 9 days ago • Promoted
    Web Developer Search Engine Optimization

    Web Developer Search Engine Optimization

    Orchid Hotel And Catering Supplies • Karol Bagh, Delhi, India
    Orchid Dinex is a leading supplier of premium tableware and buffetware for the HoReCa industry.We specialize in porcelain crockery, innovative buffet solutions, banquet and catering buffet displays...Show more
    Last updated: 30+ days ago • Promoted
    Web & Product Engineer

    Web & Product Engineer

    Kapable • Ghaziabad, IN
    Kapable is a leadership transformation platform helping CXOs, founders, and senior professionals from top global companies become better leaders through our “Thinkable, Speakable, Workable” program...Show more
    Last updated: 7 hours ago • Promoted • New!