Talent.com
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)Sixteen Alpha AI • bhubaneswar, orissa, in
Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Freelance Deep Web Crawler Engineer (AI-Integrated Data Pipeline)

Sixteen Alpha AI • bhubaneswar, orissa, in
15 days ago
Job description

About the Project

We’re developing a next-generation intelligent web crawling system capable of exploring deep and dynamic web data sources — including sites behind authentication, infinite scrolls, and JavaScript-heavy pages.

The crawler will be integrated with an AI-driven pipeline for automated data understanding, classification, and transformation.

We’re looking for a highly experienced engineer who has previously built large-scale, distributed crawling frameworks and integrated AI or NLP / LLM-based components for contextual data extraction.

Key Responsibilities

  • Design, develop, and deploy scalable deep web crawlers capable of bypassing common anti-bot mechanisms.
  • Implement AI-integrated pipelines for data processing, entity extraction, and semantic categorization.
  • Develop dynamic scraping systems for sites that rely on JavaScript, infinite scrolling, or APIs.
  • Integrate with vector databases , LLM-based data labeling, or automated content enrichment modules.
  • Optimize crawling logic for speed, reliability, and stealth across distributed environments.
  • Collaborate on data pipeline orchestration using tools like Airflow, Prefect, or custom async architectures.

Required Expertise

  • Proven experience building deep or dark web crawlers (Playwright, Scrapy, Puppeteer, or custom async frameworks).
  • Strong understanding of browser automation, session management, and anti-detection mechanisms .
  • Experience integrating AI / ML / NLP pipelines — e.g., text classification, entity recognition, or embedding-based similarity.
  • Skilled in asynchronous Python (asyncio, aiohttp, Playwright async API).
  • Familiar with database and pipeline systems — PostgreSQL, MongoDB, Elasticsearch, or similar.
  • Ability to design robust data flows that connect crawling → AI inference → storage / visualization.
  • Nice to Have

  • Knowledge of LLMs (OpenAI, Hugging Face, LangChain, or custom fine-tuned models) .
  • Experience with data cleaning, deduplication, and normalization pipelines .
  • Familiarity with distributed crawling frameworks (Ray, Celery, Kafka) .
  • Prior experience integrating real-time analytics dashboards or monitoring tools.
  • What We Offer

  • Competitive freelance pay based on expertise and delivery.
  • Flexible, async-first remote collaboration.
  • Opportunity to shape an AI-first data platform from the ground up.
  • Potential for long-term partnership if the collaboration is successful.
  • Create a job alert for this search

    Engineer • bhubaneswar, orissa, in

    Related jobs
    Full Stack Engineer

    Full Stack Engineer

    Insight Global • bhubaneswar, orissa, in
    Contract with Insight Global Client.React, React Native, TypeScript.React, React Native, and TypeScript.Deploy containerized solutions using. Ensure high-quality deliverables through.CI / CD pipelines...Show more
    Last updated: 30+ days ago • Promoted
    AWS Data Engineer

    AWS Data Engineer

    Epergne Solutions • Bhubaneswar, Odisha, India
    Quick Apply
    Job Roles & Responsibilities.Design, build, and optimize data pipelines and scalable data assets.Develop and maintain high-performance code using PySpark / Python with best practices.Perform code...Show more
    Last updated: 30+ days ago
    AI Integration Developer (Full Stack)

    AI Integration Developer (Full Stack)

    Confidential • Bhubaneswar, India
    Confidential (based on experience).GMT+01 : 00) Europe / London (BST).Full Time Permanent position(Payroll and Compliance to be managed by : ScaleXP). Note : This is a requirement for one of Uplers' clien...Show more
    Last updated: 27 days ago • Promoted
    Snowflake Data Engineer - ETL Tools

    Snowflake Data Engineer - ETL Tools

    Risk Resources LLP • Bhubaneshwar
    Required Technical Skill Set : - Snowflake - DBT ...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    Tata Consultancy Services • Bhubaneswar, Odisha, India
    TCS Hiring for Azure Databricks, Data Engineering Role!!.TCS presents an excellent opportunity for Azure Databricks, Data Engineering. Role : Azure Databricks, Data Engineering.Desired Experience Ran...Show more
    Last updated: 30+ days ago • Promoted
    Full Stack Engineer

    Full Stack Engineer

    Programmers.io • bhubaneswar, orissa, in
    Job Title : Senior Full Stack Developer (Laravel + Vue).We are seeking highly skilled Senior Full Stack Developers with 7–10 years of experience in Laravel and modern frontend frameworks (Vue.The ca...Show more
    Last updated: 13 days ago • Promoted
    Data Engineer -Python with AI / ML,pyspark,AWS

    Data Engineer -Python with AI / ML,pyspark,AWS

    Tata Consultancy Services • Bhubaneswar, Odisha, India
    TCS is Hiring for Data Engineer -Python with AI / ML,pyspark,AWS.Experience with ETL / Python / SQL, and data visualization / exploration tools. Experience in building complex SQL queries.Familiarity with t...Show more
    Last updated: 1 day ago • Promoted
    Hybris Developer

    Hybris Developer

    Confidential • Bhubaneswar, Chennai, Mumbai
    Design, develop, and maintain SAP Hybris Commerce applications.Work on data modeling, interceptors, and classification catalog customization. Implement and manage cart and checkout functionalities.D...Show more
    Last updated: 25 days ago • Promoted
    Lead AI / ML Engineer

    Lead AI / ML Engineer

    Aptita • Bhubaneswar, OR, India
    Quick Apply
    LI-PS1 Position : Lead AI / ML Engineer Experience : &...Show more
    Last updated: 30+ days ago
    Full Stack Engineer

    Full Stack Engineer

    Awign Expert • Bhubaneswar, India
    Timings : Full Time - IST (As per company timings).Notice Period : (Immediate Joiner - Only).You are interested in the full-stack opportunity and love building a feature from start to finish.Self-sta...Show more
    Last updated: 2 hours ago • Promoted • New!
    Back End Developer

    Back End Developer

    Zetheta Algorithms Private Limited • Bhubaneswar, India
    ZeTheta Algorithms Private Limited is a FinTech start-up which has been recently set up and is developing innovative AI tools. The Back-End Web Developer Intern will be responsible for server-side a...Show more
    Last updated: 30+ days ago • Promoted
    Website Developer (Webflow Specialist)

    Website Developer (Webflow Specialist)

    Confidential • Bhubaneswar, India
    Confidential (based on experience).Part Time Contract for 6 Months(40 hrs a week / 160 hrs a month).Note : This is a requirement for one of Uplers' client - UK Mental Health Care).What do you need for...Show more
    Last updated: 27 days ago • Promoted
    AI Engineer (Data Pipelines & RAG)

    AI Engineer (Data Pipelines & RAG)

    BeGig • Bhubaneswar, India
    Job Role- AI Engineer (Data Pipelines & RAG).Work Mode- Remote(6 days working).We are looking for a hands-on AI / Data Engineer (4–7 years) to build and scale data pipelines powering GenAI and agenti...Show more
    Last updated: 9 days ago • Promoted
    Infosys - AI Full stack Developer

    Infosys - AI Full stack Developer

    EdgeVerve • Bhubaneshwar
    This this an opportunity with Infosys Limited Infosys Power Programmers are a select group of highly skilled software engineers within Infosys who are passionate abou...Show more
    Last updated: 15 days ago • Promoted
    Data Engineer -Python With Ai / Ml,Pyspark,Aws

    Data Engineer -Python With Ai / Ml,Pyspark,Aws

    Tata Consultancy Services • Bhubaneshwar, Republic Of India, IN
    TCS is Hiring for Data Engineer -Python with AI / ML,pyspark,AWS.Experience with ETL / Python / SQL, and data visualization / exploration tools. Experience in building complex SQL queries.Familiarity with t...Show more
    Last updated: 1 day ago • Promoted
    GCP Data Engineer

    GCP Data Engineer

    LTIMindtree • Bhubaneswar, Odisha, India
    Greetings from LTIMindtree !!!.We are really impressed by your GCP experience.We are having multiple opportunities and projects which are based on GCP, Bigquery. I’d love to tell you a little more a...Show more
    Last updated: 8 days ago • Promoted
    AI Engineer

    AI Engineer

    NyxaLabs • Bhubaneswar, India
    We're seeking an exceptional AI Engineer with deep expertise in TensorFlow model training to design and build next-generation AI systems. This role focuses on developing sophisticated machine learni...Show more
    Last updated: 2 hours ago • Promoted • New!
    Snowflake Developer

    Snowflake Developer

    Confidential • Bhubaneswar, India
    Cloud Data Engineering and Enterprise Analytics — building scalable data warehouses, ELT pipelines and analytics platforms for enterprise customers across finance, retail, and technology.The team d...Show more
    Last updated: 17 days ago • Promoted