Talent.com
Freelance Deep Web Crawler Engineer (Ai-Integrated Data Pipeline)
Freelance Deep Web Crawler Engineer (Ai-Integrated Data Pipeline)Sixteen Alpha AI • Narela, Republic Of India, IN
Freelance Deep Web Crawler Engineer (Ai-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (Ai-Integrated Data Pipeline)

Sixteen Alpha AI • Narela, Republic Of India, IN
14 days ago
Job description

About the Project

We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities

  • Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.
  • Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.
  • Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.
  • Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.
  • Optimize crawling logic for speed, reliability, and stealth across distributed environments.
  • Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise

  • Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).
  • Strong understanding of browser automation, session management, and anti-detection mechanisms .
  • Experience integrating AI / ML / NLP pipelines — e.G., text classification, entity recognition, or embedding-based similarity.
  • Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).
  • Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.
  • Ability to design robust data flows that connect crawling → AI inference → storage / visualization.
  • Nice to Have

  • Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .
  • Experience with data cleaning, deduplication, and normalization pipelines .
  • Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .
  • Prior experience integrating real-time analytics dashboards or monitoring tools.
  • What We Offer

  • Competitive freelance pay based on expertise and delivery.
  • Flexible, async-first remote collaboration.
  • Opportunity to shape an AI-first data platform from the ground up.
  • Potential for long-term partnership if the collaboration is successful.
  • Create a job alert for this search

    Engineer • Narela, Republic Of India, IN

    Related jobs
    Full Stack and AI Engineer

    Full Stack and AI Engineer

    Loam.ai • narela, delhi, in
    AI Consulting startup that designs and deploys custom artificial‑intelligence solutions for forward‑thinking businesses.We couple state‑of‑the‑art GenAI techniques with rock‑solid engineering to tu...Show more
    Last updated: 2 days ago • Promoted
    Founding Engineer at JustCopy.AI

    Founding Engineer at JustCopy.AI

    JustCopy Inc • Narela, Delhi, India
    AI provides a platform for cloning production-ready software applications instantly, eliminating the need for extensive coding and AI prompting. Our innovative solution allows users to copy battle-t...Show more
    Last updated: 12 days ago • Promoted
    Full Stack Engineer

    Full Stack Engineer

    Allianze Infosoft • Narela, Delhi, India
    About the Role : We’re looking for a highly skilled Full Stack Developer with strong experience in WordPress and Next.You’ll independently design, develop, and deploy fully functional, high-per...Show more
    Last updated: 22 days ago • Promoted
    Full Stack Engineer

    Full Stack Engineer

    Soopra.ai • Narela, Delhi, India
    AI Personas — digital twins that can think, speak, and engage just like their human counterparts.Join us as we create a world where anyone can build their own AI twin that learns, earns, and lives...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer / AI Engineer

    Software Engineer / AI Engineer

    FlairX • Narela, Delhi, India
    FlairX is a fast-growing Interview-as-a-Service platform helping companies streamline their hiring process through expert-led technical interviews. We are building scalable, AI-driven systems to tra...Show more
    Last updated: 1 day ago • Promoted
    Full Stack Developer

    Full Stack Developer

    WeConnect • narela, delhi, in
    We are seeking an experienced Full-Stack Developer to join a dynamic and globally recognized academic institution on a 9-month contractual engagement. The role involves developing and maintaining we...Show more
    Last updated: 2 days ago • Promoted
    Senior Deep Learning Engineer

    Senior Deep Learning Engineer

    Nanonets • Narela, Delhi, India
    Join Nanonets to push the boundaries of what's possible with deep learning.We're not just implementing models – we're setting new benchmarks in document AI, with our open-source models achieving n...Show more
    Last updated: 30+ days ago • Promoted
    Senior Front-End Web Developer (HTML & Bootstrap)

    Senior Front-End Web Developer (HTML & Bootstrap)

    KBM Resorts • Narela, Delhi, India
    HOW TO APPLY Copy the following with answers and email to raghav@kbmresorts.Number of years and version experience in : A) Bootstrap : B) Angular : C) UI / CSS Design : D) UI Testing Frameworks 2) Des...Show more
    Last updated: 30+ days ago • Promoted
    Senior Snowflake Data Engineer

    Senior Snowflake Data Engineer

    Luxoft • narela, delhi, in
    We are seeking a highly skilled Snowflake Data Engineer with 7 years of IT experience to design, build, and optimize scalable data pipelines and cloud-based solutions across AWS, Azure, and GCP.The...Show more
    Last updated: 2 days ago • Promoted
    Web3 Engineer

    Web3 Engineer

    {xpay} • Narela, Delhi, India
    Agent to Agent payments in the Agentic Economy with its cutting-edge control plane for managing x402 payments.The platform enables businesses to prevent runaway agent costs, monetize APIs instantly...Show more
    Last updated: 15 days ago • Promoted
    Web Developer

    Web Developer

    Smart Moves Consultants • Narela, Delhi, India
    Key Responsibilities : Design and develop high-performance, responsive web portals using React.Build scalable backend services and APIs with Node. Integrate and optimize Snowflake for secure data sto...Show more
    Last updated: 1 day ago • Promoted
    AI Agent Developer

    AI Agent Developer

    Sikich India • Narela, Delhi, India
    Sikich is seeking a talented and driven developers with 3-5 years of experience to help us design, build, and deploy intelligent agents using Microsoft’s ecosystem. This role involves working with M...Show more
    Last updated: 30+ days ago • Promoted
    Full Stack Engineer

    Full Stack Engineer

    Awign Expert • Narela, Delhi, India
    Duration : Permanent Location : Remote Timings : Full Time - IST (As per company timings) Notice Period : (Immediate Joiner - Only) Experience : 4-6 Years Requirements 4+ years of experience building co...Show more
    Last updated: 10 hours ago • Promoted • New!
    Web Crawling Engineer

    Web Crawling Engineer

    Forage AI • Narela, Delhi, India
    We are seeking a Web Crawling Engineer who will be responsible for building and maintaining web crawlers, extracting valuable insights from the web, and ensuring data quality.The ideal candidate ...Show more
    Last updated: 15 days ago • Promoted
    Python Automation and Web Scraping Specialist

    Python Automation and Web Scraping Specialist

    Foresiet • Narela, Delhi, India
    Company Description Foresiet is an AI-enabled SaaS-based Cybersecurity company that provides a comprehensive solution for Digital Risk Prevention. Leveraging the Cyber Digital Investigator platform,...Show more
    Last updated: 30+ days ago • Promoted
    Senior IoT Full Stack Engineer

    Senior IoT Full Stack Engineer

    IntraEdge • Narela, Delhi, India
    We’re rebuilding a legacy IoT monolith into a modern microservices-based platform on Azure.Looking for a hands-on IoT engineer who can own development across cloud and edge services.This role focus...Show more
    Last updated: 22 days ago • Promoted
    Sr Full Stack developer AWS

    Sr Full Stack developer AWS

    Falkondata • Narela, Delhi, India
    ONLY IMMEDIATE SR Joiners apply Company Description Falkondata specializes in delivering innovative cloud solutions that seamlessly connect fragmented healthcare systems, improve workflows, and en...Show more
    Last updated: 11 days ago • Promoted
    Senior.Net Web Developer and SQL Expert

    Senior.Net Web Developer and SQL Expert

    Atigro • Narela, Delhi, India
    Net developer with a passion for cutting edge? Join Atigro and play a key role in shaping the future of AI-powered enterprise solutions! We’re a fast-growing AI team working on innovative, challeng...Show more
    Last updated: 22 days ago • Promoted