Talent.com
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)Sixteen Alpha AI • bhubaneswar, orissa, in
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AI • bhubaneswar, orissa, in
15 days ago
Job description

About the Project

We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities

  • Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.
  • Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.
  • Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.
  • Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.
  • Optimize crawling logic for speed, reliability, and stealth across distributed environments.
  • Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise

  • Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).
  • Strong understanding of browser automation, session management, and anti-detection mechanisms .
  • Experience integrating AI / ML / NLP pipelines — e.g., text classification, entity recognition, or embedding-based similarity.
  • Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).
  • Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.
  • Ability to design robust data flows that connect crawling → AI inference → storage / visualization.
  • Nice to Have

  • Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .
  • Experience with data cleaning, deduplication, and normalization pipelines .
  • Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .
  • Prior experience integrating real-time analytics dashboards or monitoring tools.
  • What We Offer

  • Competitive freelance pay based on expertise and delivery.
  • Flexible, async-first remote collaboration.
  • Opportunity to shape an AI-first data platform from the ground up.
  • Potential for long-term partnership if the collaboration is successful.
  • Create a job alert for this search

    Engineer • bhubaneswar, orissa, in

    Related jobs
    Full Stack Engineer

    Full Stack Engineer

    Insight Global • bhubaneswar, orissa, in
    Contract with Insight Global Client.React, React Native, TypeScript.React, React Native, and TypeScript.Deploy containerized solutions using. Ensure high-quality deliverables through.CI / CD pipelines...Show more
    Last updated: 30+ days ago • Promoted
    Full-Stack Developer - 20414

    Full-Stack Developer - 20414

    Turing • Bhubaneswar, Odisha, India
    Role Overview : Turing is looking for experienced Full Stack Developers to build modern solutions that power AI products and evaluation workflows. React / Angular / Vue) to implement features, improve c...Show more
    Last updated: 30+ days ago • Promoted
    AWS Data Engineer

    AWS Data Engineer

    Epergne Solutions • Bhubaneswar, Odisha, India
    Quick Apply
    Job Roles & Responsibilities.Design, build, and optimize data pipelines and scalable data assets.Develop and maintain high-performance code using PySpark / Python with best practices.Perform code...Show more
    Last updated: 30+ days ago
    Full Stack Engineer

    Full Stack Engineer

    Awign Expert • Bhubaneswar, Odisha, India
    Duration : PermanentLocation : RemoteTimings : Full Time - IST (As per company timings)Notice Period : (Immediate Joiner - Only)Experience : 4-6 YearsRequirements4+ years of experience building consumer...Show more
    Last updated: 11 hours ago • Promoted • New!
    AI Integration Developer (Full Stack)

    AI Integration Developer (Full Stack)

    Confidential • Bhubaneswar, India
    Confidential (based on experience).GMT+01 : 00) Europe / London (BST).Full Time Permanent position(Payroll and Compliance to be managed by : ScaleXP). Note : This is a requirement for one of Uplers' clien...Show more
    Last updated: 28 days ago • Promoted
    AWS Data Engineer (Remote)

    AWS Data Engineer (Remote)

    Mindcraft Labs • Bhubaneswar, Odisha, India
    Remote
    Role DescriptionThis role focuses on building and maintaining data pipelines and analytics infrastructure on AWS.You will work daily with S3, Glue, Redshift, Athena, Lake Formation, Airflow, SNS / SQ...Show more
    Last updated: 5 hours ago • Promoted • New!
    Data Engineer -Python with AI / ML,pyspark,AWS

    Data Engineer -Python with AI / ML,pyspark,AWS

    Tata Consultancy Services • Bhubaneswar, Odisha, India
    TCS is Hiring for Data Engineer -Python with AI / ML,pyspark,AWS.Experience with ETL / Python / SQL, and data visualization / exploration tools. Experience in building complex SQL queries.Familiarity with t...Show more
    Last updated: 1 day ago • Promoted
    Snowflake Data Engineer - ETL Tools

    Snowflake Data Engineer - ETL Tools

    Risk Resources LLP • Bhubaneshwar
    Required Technical Skill Set : - Snowflake - DBT ...Show more
    Last updated: 30+ days ago • Promoted
    Shopify Website Developer

    Shopify Website Developer

    JOKRS • Bhubaneswar, Odisha, India
    We’re Hiring — Contractual Shopify Website DeveloperBrand : JOKRS (Premium Streetwear for Gen Alpha)Budget : ₹15,000 – ₹20,000 (project-based / contract)JOKRS is a premium streetwear brand for Gen ...Show more
    Last updated: 5 hours ago • Promoted • New!
    Full Stack Engineer

    Full Stack Engineer

    Programmers.io • bhubaneswar, orissa, in
    Job Title : Senior Full Stack Developer (Laravel + Vue).We are seeking highly skilled Senior Full Stack Developers with 7–10 years of experience in Laravel and modern frontend frameworks (Vue.The ca...Show more
    Last updated: 13 days ago • Promoted
    Full-Stack Developer (AI Projects)

    Full-Stack Developer (AI Projects)

    AJAZ Solutions • Bhubaneswar, Odisha, India
    Full-Stack Developer (AI Projects) – RemoteCompany : AJAZ Solutions (Recruiting on behalf of a client)Location : RemoteType : Full-Time / ContractExperience Level : Minimum FOUR YEARS of AI-Centric ...Show more
    Last updated: 5 hours ago • Promoted • New!
    Hybris Developer

    Hybris Developer

    Confidential • Bhubaneswar, Chennai, Mumbai
    Design, develop, and maintain SAP Hybris Commerce applications.Work on data modeling, interceptors, and classification catalog customization. Implement and manage cart and checkout functionalities.D...Show more
    Last updated: 26 days ago • Promoted
    Deep Learning Engineer

    Deep Learning Engineer

    SystemBender • Bhubaneswar, Odisha, India
    An experienced Deep Learning Engineer specializing in Computer Vision, Sensor Fusion, and Multimodal AI to advance R&D; in autonomous aerial systems and geospatial intelligence, working with large-...Show more
    Last updated: 5 hours ago • Promoted • New!
    Lead AI / ML Engineer

    Lead AI / ML Engineer

    Aptita • Bhubaneswar, OR, India
    Quick Apply
    LI-PS1 Position : Lead AI / ML Engineer Experience : &...Show more
    Last updated: 30+ days ago
    Infosys - AI Full stack Developer

    Infosys - AI Full stack Developer

    EdgeVerve • Bhubaneshwar
    This this an opportunity with Infosys Limited Infosys Power Programmers are a select group of highly skilled software engineers within Infosys who are passionate abou...Show more
    Last updated: 16 days ago • Promoted
    AWS AI / ML Engineer (Remote)

    AWS AI / ML Engineer (Remote)

    Mindcraft Labs • Bhubaneswar, Odisha, India
    Remote
    Role DescriptionThis is a hands-on engineering role focused on building and maintaining AI and ML services on AWS.You will help turn ideas and prototypes into robust, production-ready APIs and ML f...Show more
    Last updated: 5 hours ago • Promoted • New!
    Snowflake Developer

    Snowflake Developer

    Confidential • Bhubaneswar, India
    Cloud Data Engineering and Enterprise Analytics — building scalable data warehouses, ELT pipelines and analytics platforms for enterprise customers across finance, retail, and technology.The team d...Show more
    Last updated: 18 days ago • Promoted
    AI Engineer

    AI Engineer

    NyxaLabs • Bhubaneswar, Odisha, India
    We're seeking an exceptional AI Engineer with deep expertise in TensorFlow model training to design and build next-generation AI systems. This role focuses on developing sophisticated machine learni...Show more
    Last updated: 11 hours ago • Promoted • New!