Talent.com
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AInagpur, maharashtra, in
8 days ago
Job description

About the Project

We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities

  • Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.
  • Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.
  • Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.
  • Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.
  • Optimize crawling logic for speed, reliability, and stealth across distributed environments.
  • Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise

  • Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).
  • Strong understanding of browser automation, session management, and anti-detection mechanisms .
  • Experience integrating AI / ML / NLP pipelines — e.g., text classification, entity recognition, or embedding-based similarity.
  • Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).
  • Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.
  • Ability to design robust data flows that connect crawling → AI inference → storage / visualization.
  • Nice to Have

  • Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .
  • Experience with data cleaning, deduplication, and normalization pipelines .
  • Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .
  • Prior experience integrating real-time analytics dashboards or monitoring tools.
  • What We Offer

  • Competitive freelance pay based on expertise and delivery.
  • Flexible, async-first remote collaboration.
  • Opportunity to shape an AI-first data platform from the ground up.
  • Potential for long-term partnership if the collaboration is successful.
  • Create a job alert for this search

    Engineer • nagpur, maharashtra, in

    Related jobs
    • Promoted
    AI-Powered Web Crawling Specialist (Freelance)

    AI-Powered Web Crawling Specialist (Freelance)

    Sixteen Alpha AINew Delhi, Republic Of India, IN
    The crawler will be integrated with an.AI or NLP / LLM-based components.JavaScript, infinite scrolling, or APIs.LLM-based data labeling, or automated content enrichment modules.Airflow, Prefect, or c...Show moreLast updated: 7 days ago
    • Promoted
    • New!
    Databricks Gen AI Engineer

    Databricks Gen AI Engineer

    SyrenNagpur, IN
    Model Serving, Vector Search, and embedding workflows.Clustering, Unity Catalog, Delta Lake).OpenAI / Azure OpenAI), and GenAI app patterns (RAG / Agents). Proficiency in SQL, Spark performance tuning...Show moreLast updated: 14 hours ago
    • Promoted
    • New!
    Senior Full-Stack AI Engineer

    Senior Full-Stack AI Engineer

    BayInfotechNagpur, IN
    In order to proceed further, Please take the test.Senior Full-Stack AI Engineer – AI-Enabled Help Desk (GCP).Please share a working public URL. Own architecture and implementation of a.RAG-based AI ...Show moreLast updated: 14 hours ago
    • Promoted
    Web Crawling Engineer

    Web Crawling Engineer

    Forage AINagpur, IN
    The ideal candidate will have strong Python programming skills and experience in web scraping frameworks, browser automation tools, and handling anti-scraping mechanisms. Forage AI is a pioneering A...Show moreLast updated: 9 days ago
    • Promoted
    Web Analytics & Tracking Lead

    Web Analytics & Tracking Lead

    The Conqueror ChallengesIndia, India
    We are a growing team of passionate, performance-driven individuals on a mission to be the best at growing multiple international e-commerce businesses with great products.Over the past 8 years, we...Show moreLast updated: 27 days ago
    • Promoted
    Ai Web Scraping Engineer

    Ai Web Scraping Engineer

    S2T AI - AI-Powered InvestigationsRepublic Of India, IN
    We're seeking a forward-thinking.AI tools to accelerate development and streamline data extraction processes.Join our India team and work at the intersection of traditional scraping expertise and c...Show moreLast updated: 30+ days ago
    • Promoted
    Web Intelligence Engineer

    Web Intelligence Engineer

    Forage AIRepublic Of India, IN
    The ideal candidate will have strong Python programming skills and experience in web scraping frameworks, browser automation tools, and handling anti-scraping mechanisms. Forage AI is a pioneering A...Show moreLast updated: 9 days ago
    • Promoted
    Full Stack AI engineer

    Full Stack AI engineer

    AnswerThis (YC F25)Nagpur, IN
    Remote (Applications open worldwide).Semantic Search, Vector Databases, Prompt Engineering, GenAI Frameworks, React Agents, Graph Agents, Document Parsing, Python, Scalable APIs.AnswerThis is an AI...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Backend Engineer - AI-Powered Influencer Platform (REMOTE)

    Senior Backend Engineer - AI-Powered Influencer Platform (REMOTE)

    Fame Keeda | Influencer Marketing AgencyNagpur, IN
    Remote
    Senior Backend Engineer - AI-Powered Influencer Platform.Want to build AI-powered systems at scale? Fame Keeda is revolutionizing influencer marketing with intelligent discovery, real-time tracking...Show moreLast updated: 18 hours ago
    • Promoted
    Data Engineer

    Data Engineer

    RecroNagpur, IN
    Data Pipeline Engineering : Design, build, and maintain ingestion, transformation, and storage pipelines using Azure Data Factory, Synapse Analytics, and Data Lake. AI Data Enablement : Collaborate wi...Show moreLast updated: 30+ days ago
    • Promoted
    AI Web Scraping Engineer

    AI Web Scraping Engineer

    S2T AI - AI-Powered InvestigationsIndia, India
    We're seeking a forward-thinking.AI tools to accelerate development and streamline data extraction processes.Join our India team and work at the intersection of traditional scraping expertise and c...Show moreLast updated: 30+ days ago
    • Promoted
    Forward Deployed Engineer

    Forward Deployed Engineer

    Searchability®Nagpur, IN
    Forward Deployed Engineer - AI💻.Remote-based - relocation to Dubai📍.Searchability MENA is working with an innovative AI startup looking for a. This is a rare chance to get involved with a company ...Show moreLast updated: 3 days ago
    • Promoted
    Remote GenAI Engineer

    Remote GenAI Engineer

    EazyMLNagpur, IN
    Remote
    Founded by Bell Labs research veterans, and associated with breakthrough startups like Amelia, EazyML, specializes in Transparent Machine Learning. Early on EazyML founders saw the need for Transpa...Show moreLast updated: 30+ days ago
    • Promoted
    Freelance Deep Web Crawler Engineer (Ai-Integrated Data Pipeline)

    Freelance Deep Web Crawler Engineer (Ai-Integrated Data Pipeline)

    Sixteen Alpha AINew Delhi, Republic Of India, IN
    The crawler will be integrated with an.AI or NLP / LLM-based components.JavaScript, infinite scrolling, or APIs.LLM-based data labeling, or automated content enrichment modules.Airflow, Prefect, or c...Show moreLast updated: 8 days ago
    • Promoted
    Senior Web Scraping Engineer

    Senior Web Scraping Engineer

    ZomunkRepublic Of India, IN
    We're building a product that relies heavily on collecting structured data from a number of known websites.We need someone experienced who can own this part of the system end-to-end;.We're looking ...Show moreLast updated: 2 days ago
    • Promoted
    Data Engineer - Web Scraping

    Data Engineer - Web Scraping

    Alternative PathNagpur, IN
    Alternative Path is seeking skilled software developers to collaborate on client projects with an asset management firm.In this role, you will collaborate with individuals across various company de...Show moreLast updated: 30+ days ago
    • Promoted
    Freelance Webflow Developer

    Freelance Webflow Developer

    Black Pianonagpur, maharashtra, in
    We are looking for a talented and detail-oriented.In this role, you will be responsible for translating high-fidelity designs into responsive, pixel-perfect websites using Webflow.You’ll collaborat...Show moreLast updated: 15 days ago
    • Promoted
    Data Crawl & Pipeline Engineer

    Data Crawl & Pipeline Engineer

    Forage AIRepublic Of India, IN
    The ideal candidate will have strong Python programming skills and experience in web scraping frameworks, browser automation tools, and handling anti-scraping mechanisms. Forage AI is a pioneering A...Show moreLast updated: 1 day ago