Talent.com
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)Sixteen Alpha AI • New Delhi, Delhi, India
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AI • New Delhi, Delhi, India
16 days ago
Job description

About the Project

We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities

  • Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.
  • Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.
  • Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.
  • Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.
  • Optimize crawling logic for speed, reliability, and stealth across distributed environments.
  • Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise

  • Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).
  • Strong understanding of browser automation, session management, and anti-detection mechanisms .
  • Experience integrating AI / ML / NLP pipelines — e.g., text classification, entity recognition, or embedding-based similarity.
  • Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).
  • Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.
  • Ability to design robust data flows that connect crawling → AI inference → storage / visualization.
  • Nice to Have

  • Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .
  • Experience with data cleaning, deduplication, and normalization pipelines .
  • Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .
  • Prior experience integrating real-time analytics dashboards or monitoring tools.
  • What We Offer

  • Competitive freelance pay based on expertise and delivery.
  • Flexible, async-first remote collaboration.
  • Opportunity to shape an AI-first data platform from the ground up.
  • Potential for long-term partnership if the collaboration is successful.
  • Create a job alert for this search

    Engineer • New Delhi, Delhi, India

    Related jobs
    Full Stack AI engineer

    Full Stack AI engineer

    AnswerThis (YC F25) • Ghaziabad, IN
    Remote (Applications open worldwide).Semantic Search, Vector Databases, Prompt Engineering, GenAI Frameworks, React Agents, Graph Agents, Document Parsing, Python, Scalable APIs.AnswerThis is an AI...Show more
    Last updated: 30+ days ago • Promoted
    Senior Full Stack Engineer (Lead) – AI Platform & Agentic Systems

    Senior Full Stack Engineer (Lead) – AI Platform & Agentic Systems

    ISIR AI • Noida, Uttar Pradesh, India
    Senior Full Stack Engineer (Lead) – AI Platform & Agentic Systems.Noida (Onsite) | 🕒 Full-Time | 🚀 Immediate Start.AI is building next-generation. Senior Full Stack Engineer (Lead).UI engineering ...Show more
    Last updated: 16 days ago • Promoted
    Full Stack and AI Engineer

    Full Stack and AI Engineer

    Loam.ai • Delhi, India
    AI Consulting startup that designs and deploys custom artificial‑intelligence solutions for forward‑thinking businesses.We couple state‑of‑the‑art GenAI techniques with rock‑solid engineering to tu...Show more
    Last updated: 4 days ago • Promoted
    Lead Full-Stack + AI Engineer (Founding Team)

    Lead Full-Stack + AI Engineer (Founding Team)

    Grovio AI • delhi, delhi, in
    We’re building an autonomous, multi-agent AI OS that plans, executes, and optimizes marketing across modern digital ecosystems. Think : an AI that acts like a virtual CMO — planning, writing, analyz...Show more
    Last updated: 15 hours ago • Promoted • New!
    Senior Web Scraping Engineer

    Senior Web Scraping Engineer

    Zomunk • Ghaziabad, IN
    We're building a product that relies heavily on collecting structured data from a number of known websites.We need someone experienced who can own this part of the system end-to-end; from writing s...Show more
    Last updated: 11 days ago • Promoted
    Senior Full-Stack Web Developer (Remote)

    Senior Full-Stack Web Developer (Remote)

    Neo Media Group Ltd • Noida, Uttar Pradesh, India
    We are seeking a highly experienced Senior Full-Stack Web Developer with expert-level skills in Laravel, Node.This is a fully remote position for a self-starter who can work independently, deliver ...Show more
    Last updated: 12 days ago • Promoted
    Data Engineer (Webscraping)

    Data Engineer (Webscraping)

    Solytics Partners • Delhi, India
    Company Profile : Solytics Partners is a Global Analytics firm, recognized with multiple industry awards for innovation and excellence. Our team comprises experts with deep knowledge in risk, analyti...Show more
    Last updated: 18 days ago • Promoted
    AI Engineer

    AI Engineer

    Recro • Delhi, India
    What you’ll work on : • Designing & deploying agentic workflows (Semantic Kernel / LangGraph / AutoGen / CrewAI) • Building tool-calling flows, RAG pipelines, and hybrid search • Deploying AI agents...Show more
    Last updated: 1 day ago • Promoted
    Senior Web Developer (Full Stack)

    Senior Web Developer (Full Stack)

    Gem3s Technologies Pvt. Ltd. • Delhi, India
    We are seeking a highly skilled Full Stack Developer who is proficient in both front-end and back-end development.The ideal candidate will have experience with all stages of software development an...Show more
    Last updated: 17 hours ago • Promoted • New!
    AI Engineer - SDE 2

    AI Engineer - SDE 2

    Attri • Delhi, India
    We are looking for an ambitious and skilled Software Development Engineer II to join our specialized AI / ML engineering team. This role is crucial for developing and deploying scalable, production-re...Show more
    Last updated: 17 hours ago • Promoted • New!
    Web Developer (Freelance)

    Web Developer (Freelance)

    Sweet • Ghaziabad, IN
    Sweet is the AI-native business platform built for creators — a business partner that clears the clutter, automates the back-office, and gives creators the freedom to focus on craft, while Sweet gr...Show more
    Last updated: 5 days ago • Promoted
    Web Scraping Engineer

    Web Scraping Engineer

    noon • Delhi, India
    Job title : Web Scraping Engineer Location : Gurgaon.About the Role We are looking for a.The ideal candidate will design and implement robust scrapers to collect, clean, and normalize product data (p...Show more
    Last updated: 10 days ago • Promoted
    Sr. Back End Engineer (Python)

    Sr. Back End Engineer (Python)

    Jeeva AI • Delhi, India
    This is an on site position in Kharadi, Pune.PM - 3 AM Remote / Hybrid options are.Notice Period - Immediate joiners or those with a maximum notice period of 30 days are preferred.We’re a fast-growin...Show more
    Last updated: 24 days ago • Promoted
    Full-stack AI Engineer - Founding Engineer

    Full-stack AI Engineer - Founding Engineer

    Taglynk • Delhi, India
    Founding Engineer , you won’t just write code — you will help.You will : Build full-stack features end-to-end for our AI hiring platform. Work with LLMs, agentic systems, and voice / speech models to c...Show more
    Last updated: 10 days ago • Promoted
    Python Web Scraping Engineer – Automation (3 to 10 yrs)

    Python Web Scraping Engineer – Automation (3 to 10 yrs)

    AIMLEAP • Delhi, India
    Python Web Scraping Engineer – Advanced Automation (WFH).Remote (Work from Home) Mode of Engagement : .Bachelor’s degree in Computer Science, IT, or related field Industry : .IT / Software Services / D...Show more
    Last updated: 17 hours ago • Promoted • New!
    Web Developer Search Engine Optimization

    Web Developer Search Engine Optimization

    Orchid Hotel And Catering Supplies • Karol Bagh, Delhi, India
    Orchid Dinex is a leading supplier of premium tableware and buffetware for the HoReCa industry.We specialize in porcelain crockery, innovative buffet solutions, banquet and catering buffet displays...Show more
    Last updated: 30+ days ago • Promoted
    Web & Product Engineer

    Web & Product Engineer

    Kapable • Delhi, India, India
    Kapable is a leadership transformation platform helping CXOs, founders, and senior professionals from top global companies become better leaders through our “Thinkable, Speakable, Workable” program...Show more
    Last updated: 19 hours ago • Promoted • New!
    Lead AI Engineer

    Lead AI Engineer

    APPIT Software Inc • Delhi, India
    What You’ll Do Architect, build, and optimize production-grade Generative AI applications using modern frameworks such as LangChain, LlamaIndex, Semantic Kernel, or custom orchestration layers.Lead...Show more
    Last updated: 3 days ago • Promoted