Talent.com
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)Sixteen Alpha AI • sangli, maharashtra, in
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AI • sangli, maharashtra, in
16 days ago
Job description

About the Project

We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities

  • Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.
  • Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.
  • Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.
  • Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.
  • Optimize crawling logic for speed, reliability, and stealth across distributed environments.
  • Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise

  • Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).
  • Strong understanding of browser automation, session management, and anti-detection mechanisms .
  • Experience integrating AI / ML / NLP pipelines — e.g., text classification, entity recognition, or embedding-based similarity.
  • Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).
  • Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.
  • Ability to design robust data flows that connect crawling → AI inference → storage / visualization.
  • Nice to Have

  • Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .
  • Experience with data cleaning, deduplication, and normalization pipelines .
  • Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .
  • Prior experience integrating real-time analytics dashboards or monitoring tools.
  • What We Offer

  • Competitive freelance pay based on expertise and delivery.
  • Flexible, async-first remote collaboration.
  • Opportunity to shape an AI-first data platform from the ground up.
  • Potential for long-term partnership if the collaboration is successful.
  • Create a job alert for this search

    Engineer • sangli, maharashtra, in

    Related jobs
    AI Engineer - GPT / LangChain / RAG / Data Pipelines

    AI Engineer - GPT / LangChain / RAG / Data Pipelines

    Peak Trust Global Real Estate • sangli, maharashtra, in
    This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical). Build automated data collection workflows using tools suc...Show more
    Last updated: 4 hours ago • Promoted • New!
    Full Stack Engineer

    Full Stack Engineer

    Insight Global • sangli, maharashtra, in
    Duration : 6 month contract with potential to convert permanent.JS; primary codebase is frontend-heavy.Proficient with Git for source code management. Hands on experience with AWS Elastic Beans, EC2,...Show more
    Last updated: 30+ days ago • Promoted
    Full Stack Web Developer

    Full Stack Web Developer

    SME Solutions Advisory LLP • sangli, maharashtra, in
    MSMEs, SMEs, and Startups across India, offering end-to-end handholding to help businesses scale confidently.We specialise in Business Consulting, MSME Incubation Support, Fund-Raising Advisory (De...Show more
    Last updated: 4 hours ago • Promoted • New!
    Generative AI Engineer

    Generative AI Engineer

    Live Connections • sangli, maharashtra, in
    Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field; Master’s or PhD preferred. ML / Data Science with a focus on generative AI, LLMs, or co...Show more
    Last updated: 16 days ago • Promoted
    Full-Stack Developer - 20414

    Full-Stack Developer - 20414

    Turing • Sangli, Maharashtra, India
    Role Overview : Turing is looking for experienced Full Stack Developers to build modern solutions that power AI products and evaluation workflows. React / Angular / Vue) to implement features, improve c...Show more
    Last updated: 30+ days ago • Promoted
    Python Developer

    Python Developer

    TekXera • sangli, maharashtra, in
    Senior Python Engineer – Service Implementation.India | Pakistan | Nigeria | Kenya | Egypt | Ghana | Bangladesh | Turkey | Mexico. Full-Time Contract (9 Months, Extendable).San Francisco–based AI re...Show more
    Last updated: 4 hours ago • Promoted • New!
    Python Web Scraping Engineer – Automation (3 to 10 yrs)

    Python Web Scraping Engineer – Automation (3 to 10 yrs)

    AIMLEAP • sangli, maharashtra, in
    Python Web Scraping Engineer – Advanced Automation (WFH).Bachelor’s degree in Computer Science, IT, or related field .IT / Software Services / Data & AI . Strong hands-on experience handling.Seleniu...Show more
    Last updated: 4 hours ago • Promoted • New!
    Full Stack Engineer

    Full Stack Engineer

    AideWiser SolTek • sangli, maharashtra, in
    AWS (EC2, Lambda, S3, RDS, DynamoDB, etc.Design, develop, and maintain backend services using.Net Core / MVC and frontend components using React. Build and scale backend systems on AWS cloud infrastru...Show more
    Last updated: 30+ days ago • Promoted
    Full Stack Engineer | FINJO I971

    Full Stack Engineer | FINJO I971

    Omni Recruit Private Limited • Sangli, Maharashtra, India
    Role Summary We are seeking a Full Stack Engineer with strong experience in Python (FastAPI), PostgreSQL, React, GitHub, and Docker. The ideal candidate should have prior experience in a product com...Show more
    Last updated: 2 hours ago • Promoted • New!
    Web Tech Developer

    Web Tech Developer

    Talentgigs • sangli, maharashtra, in
    To Be filled by Hiring Manager.Salary Bracket and variable pay if applicable 8 to 12 LPA (Fixed CTC).Work Location - Hybrid / Work from Office Work From Office - Coimbatore.Total Years of Experienc...Show more
    Last updated: 4 hours ago • Promoted • New!
    Freelance Data Engineer

    Freelance Data Engineer

    Leading MNC • sangli, maharashtra, in
    Looking for a Freelance Data Engineer to join a team of rockstar developers.The candidate should have a minimum of 8+ yrs. If you're looking for freelance / part time opportunity (along with your day...Show more
    Last updated: 13 days ago • Promoted
    Founding Full-Stack Developer (MERN)- 3+ Year Experience

    Founding Full-Stack Developer (MERN)- 3+ Year Experience

    HILL QUEEN TEA • sangli, maharashtra, in
    Founding Full-Stack Developer (MERN) - Backend Focused.This role is not for current students, interns, or freshers! Please avoid applying!. Only apply if you have more than 3+years of professional e...Show more
    Last updated: 4 hours ago • Promoted • New!
    Senior Web Developer (Full Stack)

    Senior Web Developer (Full Stack)

    Gem3s Technologies Pvt. Ltd. • sangli, maharashtra, in
    We are seeking a highly skilled Full Stack Developer who is proficient in both front-end and back-end development.The ideal candidate will have experience with all stages of software development an...Show more
    Last updated: 4 hours ago • Promoted • New!
    Shopify Developer

    Shopify Developer

    Work Store Limited • Sangli, Maharashtra, India
    Job Title : Shopify Developer Responsibilities : - Problem-solving : Analyze business requirements and develop creative Shopify solutions to meet project objectives. Timely delivery : Efficiently mana...Show more
    Last updated: 8 days ago • Promoted
    Full Stack Engineer

    Full Stack Engineer

    DigiFocal IT Solutions Pvt Ltd • sangli, maharashtra, in
    We’re Hiring : Full Stack Developer (Node.Are you a passionate Full Stack Developer who loves building scalable, high-performance applications? We’re looking for a talented engineer to join our grow...Show more
    Last updated: 4 hours ago • Promoted • New!
    AI Software developer

    AI Software developer

    Hello Energy • sangli, maharashtra, in
    We are looking for a Software developer with AI specialisation, that can automate our services with AI.You will be responsible within our Product & tech team to build AI-powered tooling and feature...Show more
    Last updated: 4 hours ago • Promoted • New!
    Web & Product Engineer

    Web & Product Engineer

    Kapable • sangli, maharashtra, in
    Kapable is a leadership transformation platform helping CXOs, founders, and senior professionals from top global companies become better leaders through our “Thinkable, Speakable, Workable” program...Show more
    Last updated: 4 hours ago • Promoted • New!
    Full Stack Engineer

    Full Stack Engineer

    Programmers.io • sangli, maharashtra, in
    Job Title : Senior Full Stack Developer (Laravel + Vue).We are seeking highly skilled Senior Full Stack Developers with 7–10 years of experience in Laravel and modern frontend frameworks (Vue.The ca...Show more
    Last updated: 14 days ago • Promoted