Talent.com
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)Sixteen Alpha AI • baddi, himachal pradesh, in
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AI • baddi, himachal pradesh, in
17 days ago
Job description

About the Project

We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities

  • Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.
  • Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.
  • Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.
  • Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.
  • Optimize crawling logic for speed, reliability, and stealth across distributed environments.
  • Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise

  • Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).
  • Strong understanding of browser automation, session management, and anti-detection mechanisms .
  • Experience integrating AI / ML / NLP pipelines — e.g., text classification, entity recognition, or embedding-based similarity.
  • Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).
  • Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.
  • Ability to design robust data flows that connect crawling → AI inference → storage / visualization.
  • Nice to Have

  • Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .
  • Experience with data cleaning, deduplication, and normalization pipelines .
  • Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .
  • Prior experience integrating real-time analytics dashboards or monitoring tools.
  • What We Offer

  • Competitive freelance pay based on expertise and delivery.
  • Flexible, async-first remote collaboration.
  • Opportunity to shape an AI-first data platform from the ground up.
  • Potential for long-term partnership if the collaboration is successful.
  • Create a job alert for this search

    Engineer • baddi, himachal pradesh, in

    Related jobs
    Full Stack Engineer

    Full Stack Engineer

    Programmers.io • baddi, himachal pradesh, in
    Job Title : Senior Full Stack Developer (Laravel + Vue).We are seeking highly skilled Senior Full Stack Developers with 7–10 years of experience in Laravel and modern frontend frameworks (Vue.The ca...Show more
    Last updated: 15 days ago • Promoted
    Junior Software Engineer

    Junior Software Engineer

    Tilda Research • baddi, himachal pradesh, in
    A passion for building scalable AI agents.Build scalable back-end services using.Optimize graph database queries and models in Neo4j. Collaborate cross-functionally with Product, Engineering, and Cl...Show more
    Last updated: 11 hours ago • Promoted • New!
    Back End Developer

    Back End Developer

    InstaSupply.ca • baddi, India
    InstaSupply (Construction supply + logistics platform).InstaSupply is building a full marketplace ecosystem for construction materials : . Supplier sync with Shopify, Clover, QuickBooks and more.You w...Show more
    Last updated: 3 hours ago • Promoted • New!
    Full Stack Engineer

    Full Stack Engineer

    Insight Global • baddi, himachal pradesh, in
    Duration : 6 month contract with potential to convert permanent.JS; primary codebase is frontend-heavy.Proficient with Git for source code management. Hands on experience with AWS Elastic Beans, EC2,...Show more
    Last updated: 30+ days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Live Connections • baddi, himachal pradesh, in
    Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field; Master’s or PhD preferred. ML / Data Science with a focus on generative AI, LLMs, or co...Show more
    Last updated: 17 days ago • Promoted
    Full Stack Engineer

    Full Stack Engineer

    AideWiser SolTek • baddi, himachal pradesh, in
    AWS (EC2, Lambda, S3, RDS, DynamoDB, etc.Design, develop, and maintain backend services using.Net Core / MVC and frontend components using React. Build and scale backend systems on AWS cloud infrastru...Show more
    Last updated: 30+ days ago • Promoted
    Freelance Scala Data Engineer (Airflow • SQL)

    Freelance Scala Data Engineer (Airflow • SQL)

    ThreatXIntel • baddi, himachal pradesh, in
    ThreatXIntel is a cybersecurity startup that specializes in providing tailored, cost-effective solutions for businesses and organizations to safeguard their digital assets.As experts in cloud secur...Show more
    Last updated: 11 hours ago • Promoted • New!
    Software Engineer (Full Stack) - 17853

    Software Engineer (Full Stack) - 17853

    Turing • baddi, himachal pradesh, in
    Turing is seeking experienced Full Stack Software Engineers to help build end-to-end AI-driven applications for US customers — spanning backend services, web frontends, and evaluation tooling.In th...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI Engineer

    Senior AI Engineer

    Xtnsion.AI • baddi, himachal pradesh, in
    AI is building the agentic CX layer for modern businesses — AI voice + chat agents that autonomously handle bookings, lead follow-up, support workflows, CRM actions, and more across phone, WhatsApp...Show more
    Last updated: 11 hours ago • Promoted • New!
    Full-Stack Developer - 20414

    Full-Stack Developer - 20414

    Turing • Baddi, Himachal Pradesh, India
    Role Overview : Turing is looking for experienced Full Stack Developers to build modern solutions that power AI products and evaluation workflows. React / Angular / Vue) to implement features, improve c...Show more
    Last updated: 30+ days ago • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Cozzera • baddi, himachal pradesh, in
    We are looking for an experienced AI / ML Engineer with a strong background in machine learning and deep learning, especially in time-series, sensor, and behavioral data. Strong foundation in ML and d...Show more
    Last updated: 11 hours ago • Promoted • New!
    Senior Software Engineer

    Senior Software Engineer

    Programmers.io • baddi, himachal pradesh, in
    We are seeking a highly skilled and experienced Senior Azure Data Engineer to join our team.The ideal candidate will have deep expertise in Microsoft Azure data services, cloud-based data engineeri...Show more
    Last updated: 30+ days ago • Promoted
    Lead Software Engineer

    Lead Software Engineer

    DigiFocal IT Solutions Pvt Ltd • Baddi, Himachal Pradesh, India
    Lead Software Engineer (TypeScript | React | Node.AWS Serverless) We are looking only for Immediate Joiner.Experience : 7+ Years Mode : Full-Time Location : Remote working About the Role We’re l...Show more
    Last updated: 16 days ago • Promoted
    Ai Web Scraping Engineer

    Ai Web Scraping Engineer

    Alternative Path • Baddi, Republic Of India, IN
    Alternative Path is seeking skilled software developers to collaborate on client projects with an asset management firm.In this role, you will collaborate with individuals across various company de...Show more
    Last updated: 19 hours ago • Promoted • New!
    Freelance Data Engineer

    Freelance Data Engineer

    Leading MNC • baddi, himachal pradesh, in
    Looking for a Freelance Data Engineer to join a team of rockstar developers.The candidate should have a minimum of 8+ yrs. If you're looking for freelance / part time opportunity (along with your day...Show more
    Last updated: 14 days ago • Promoted
    Blockchain Developer

    Blockchain Developer

    GoQuant • baddi, himachal pradesh, in
    SOLANA BLOCKCHAIN ENGINEER (RUST) (Paid).Job Title : Solana Blockchain Engineer - Smart Contracts & Settlement Infrastructure. Company : GoQuant Technologies Inc.Smart Contract Development (60%).Desig...Show more
    Last updated: 11 hours ago • Promoted • New!
    Freelance Senior Data Engineer (ADF • Databricks • Vectr • Cribl)

    Freelance Senior Data Engineer (ADF • Databricks • Vectr • Cribl)

    ThreatXIntel • baddi, himachal pradesh, in
    ThreatXIntel is a startup cybersecurity company focused on delivering advanced and tailored solutions to protect businesses and organizations from cyber threats. Our expertise spans cloud security, ...Show more
    Last updated: 11 hours ago • Promoted • New!
    Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

    Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

    AIMLEAP • baddi, himachal pradesh, in
    Data Engineering Manager – Web Crawling & Pipeline Architecture.Tech / MCA / Computer Science / IT .IT / Data / AI / E-commerce / FinTech / Healthcare . Experience working with cloud platforms such ...Show more
    Last updated: 11 hours ago • Promoted • New!