Talent.com
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)Sixteen Alpha AI • rajahmundry, andhra pradesh, in
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AI • rajahmundry, andhra pradesh, in
11 days ago
Job description

About the Project

We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities

  • Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.
  • Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.
  • Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.
  • Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.
  • Optimize crawling logic for speed, reliability, and stealth across distributed environments.
  • Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise

  • Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).
  • Strong understanding of browser automation, session management, and anti-detection mechanisms .
  • Experience integrating AI / ML / NLP pipelines — e.g., text classification, entity recognition, or embedding-based similarity.
  • Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).
  • Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.
  • Ability to design robust data flows that connect crawling → AI inference → storage / visualization.
  • Nice to Have

  • Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .
  • Experience with data cleaning, deduplication, and normalization pipelines .
  • Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .
  • Prior experience integrating real-time analytics dashboards or monitoring tools.
  • What We Offer

  • Competitive freelance pay based on expertise and delivery.
  • Flexible, async-first remote collaboration.
  • Opportunity to shape an AI-first data platform from the ground up.
  • Potential for long-term partnership if the collaboration is successful.
  • Create a job alert for this search

    Engineer • rajahmundry, andhra pradesh, in

    Related jobs
    Data Engineer

    Data Engineer

    Recro • Rajahmundry, IN
    Data Pipeline Engineering : Design, build, and maintain ingestion, transformation, and storage pipelines using Azure Data Factory, Synapse Analytics, and Data Lake. AI Data Enablement : Collaborate wi...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Innodata Inc. • rajahmundry, andhra pradesh, in
    Design and develop frontend interfaces (React, Angular) tailored for AI-driven workflows and visualization of model outputs. Python Flask / FastAPI) that serve AI models and manage user data securely....Show more
    Last updated: 19 days ago • Promoted
    Web Crawling Engineer

    Web Crawling Engineer

    Forage AI • Rajahmundry, IN
    The ideal candidate will have strong Python programming skills and experience in web scraping frameworks, browser automation tools, and handling anti-scraping mechanisms. Forage AI is a pioneering A...Show more
    Last updated: 12 days ago • Promoted
    Remote GenAI Engineer

    Remote GenAI Engineer

    EazyML • Rajahmundry, IN
    Remote
    Founded by Bell Labs research veterans, and associated with breakthrough startups like Amelia, EazyML, specializes in Transparent Machine Learning. Early on EazyML founders saw the need for Transpa...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer-Agentic AI

    Machine Learning Engineer-Agentic AI

    Innodata Inc. • Rajahmundry, IN
    Design and implement multi-agent systems using LangChain, LangGraph, CrewAI, AutoGen or similar frameworks.Build A2A (agent-to-agent) orchestration and implement MCP (multi-context protocol) for co...Show more
    Last updated: 19 days ago • Promoted
    Lead AI Engineer

    Lead AI Engineer

    Blend • Rajahmundry, IN
    We are looking for an AI Engineer with hands-on experience designing and deploying scalable AI solutions.In this role, you will be part of a cross-functional team working on cutting-edge projects i...Show more
    Last updated: 15 days ago • Promoted
    Full Stack AI Developer

    Full Stack AI Developer

    HJ Recruitment • rajahmundry, andhra pradesh, in
    TypeScript • LLMs • Agents • RAG).We’re building next-generation AI systems real agents, real intelligence, real impact.If you want to push the frontier of what’s possible with LLMs, autonomous wor...Show more
    Last updated: 6 days ago • Promoted
    Technical Lead - Gen AI

    Technical Lead - Gen AI

    Aceolution • Rajahmundry, IN
    Freelance Remote Opportunity : Tech Lead – GenAI Code Initiatives.Tech Lead / Senior Software Engineer.AI-driven code generation systems. Write, evaluate, and refine complex code solutions.This is a ...Show more
    Last updated: 30+ days ago • Promoted
    Snowflake Data Engineer

    Snowflake Data Engineer

    Live Connections • rajahmundry, andhra pradesh, in
    Role - Snowflake Data Engineer.Required Notice Period - Immediate Joiner.To apply, connect with Abhishek via abhishek.Show more
    Last updated: 9 days ago • Promoted
    Looker Developer

    Looker Developer

    RiDiK (a Subsidiary of CLPS. Nasdaq : CLPS) • Rajahmundry, IN
    We are looking for a skilled BI Developer to support a large client in the Healthcare domain.The role will involve enhancing and maintaining reports using Looker / LookML, developing dimensional data...Show more
    Last updated: 19 days ago • Promoted
    AI Platform Engineer

    AI Platform Engineer

    BayOne Solutions • Rajahmundry, IN
    We are seeking a highly skilled.In this role, you will work on advanced AI systems including.Retrieval-Augmented Generation (RAG). Model Context Protocol (MCP) tools.OpenWebUI or custom-built soluti...Show more
    Last updated: 10 days ago • Promoted
    Web Designer

    Web Designer

    Sweet • Rajahmundry, IN
    Project-based, potential for ongoing work).Sweet is the AI-native business platform built for creators — a business partner that clears the clutter, automates the back-office, and gives creators th...Show more
    Last updated: 11 hours ago • Promoted • New!
    WebMethod Developer

    WebMethod Developer

    MSH India • Rajahmundry, IN
    Hiring : Senior webMethods Consultant | 8+ Years Experience | Offshore (India).Join our Integration team as a Senior webMethods Consultant!. We’re looking for an experienced professional who can desi...Show more
    Last updated: 30+ days ago • Promoted
    Python Automation & Web Scraping Engineer (2 to 4 yrs)

    Python Automation & Web Scraping Engineer (2 to 4 yrs)

    AIMLEAP • Rajahmundry, IN
    Python Automation & Web Scraping Engineer (WFH).Bachelor’s degree in Computer Science / Information Technology.Selenium, BeautifulSoup, Requests. Experience in backend / API development using.Strong s...Show more
    Last updated: 11 hours ago • Promoted • New!
    Full Stack Engineer

    Full Stack Engineer

    Programmers.io • rajahmundry, andhra pradesh, in
    Job Title : Senior Full Stack Developer (Laravel + Vue).We are seeking highly skilled Senior Full Stack Developers with 7–10 years of experience in Laravel and modern frontend frameworks (Vue.The ca...Show more
    Last updated: 9 days ago • Promoted
    (Laravel / PHP) Web developer with React Native Experience

    (Laravel / PHP) Web developer with React Native Experience

    TellByte • Rajahmundry, IN
    PHP / Laravel applications into a.The ideal candidate will have a solid background in backend development, database management, and API design, with hands-on experience enabling smooth integration wi...Show more
    Last updated: 1 day ago • Promoted
    Senior.Net Web Developer and SQL Expert

    Senior.Net Web Developer and SQL Expert

    Atigro • Rajahmundry, IN
    Net developer with a passion for cutting edge? Join Atigro and play a key role in shaping the future of AI-powered enterprise solutions! We’re a fast-growing AI team working on innovative, challeng...Show more
    Last updated: 19 days ago • Promoted
    Full Stack AI engineer

    Full Stack AI engineer

    AnswerThis (YC F25) • Rajahmundry, IN
    Remote (Applications open worldwide).Semantic Search, Vector Databases, Prompt Engineering, GenAI Frameworks, React Agents, Graph Agents, Document Parsing, Python, Scalable APIs.AnswerThis is an AI...Show more
    Last updated: 30+ days ago • Promoted