Talent.com
Data Crawling and Scraping Engineer

Data Crawling and Scraping Engineer

Forage AIRepublic Of India, IN
7 days ago
Job description

We are seeking a Web Crawling Engineer who will be responsible for building and maintaining web crawlers, extracting valuable insights from the web, and ensuring data quality. The ideal candidate will have strong Python programming skills and experience in web scraping frameworks, browser automation tools, and handling anti-scraping mechanisms.

Salary budget upto - Rs. 9 LPA

About Forage AI : Forage AI is a pioneering AI-powered data extraction and automation company that transforms complex, unstructured web and document data into clean, structured intelligence. Our platform combines web crawling, NLP, LLMs, and agentic AI to deliver highly accurate firmographic and enterprise insights across numerous domains. Trusted by global clients in finance, real estate, and healthcare, Forage AI enables businesses to automate workflows, reduce manual rework, and access high-quality data at scale.

Key Responsibilities :

  • Maintain and enhance existing web scraping and data crawling projects.
  • Develop and refine crawlers using Python-based tools and frameworks.
  • Utilize browser automation tools (e.G., Playwright, Selenium) to handle dynamic content.
  • Clean, validate, and integrate extracted data into downstream storage systems.
  • Implement and manage solutions for anti-bot measures (CAPTCHAs, IP rotation, etc.).
  • Optimize crawling efficiency and ensure compliance with web crawling best practices.
  • Collaborate with cross-functional teams to improve data acquisition strategies.

Required Skills & Qualifications :

  • Proficiency in Python and 2 years of work experience of web scraping frameworks (especially Scrapy).
  • Strong knowledge of browser automation tools such as Playwright or Selenium.
  • Solid understanding of HTML, CSS, and selector languages (XPath / CSS).
  • Experience in handling anti-scraping challenges and ensuring robust data extraction.
  • Familiarity with distributed scraping techniques and data pipelines.
  • Ability to troubleshoot and optimize web crawlers for performance and reliability.
  • Strong analytical and problem-solving skills with attention to detail.
  • Excellent communication and inter-personal skills.
  • Other Infrastructure Requirements

    Since this is a completely work-from-home position, you will also require the following -

  • High-speed internet connectivity for video calls and efficient work.
  • Capable business-grade computer (e.G., modern processor, 8 GB+ of RAM, and
  • no other obstacles to interrupted, efficient work).

  • Headphones with clear audio quality.
  • Stable power connection and backups in case of internet / power failure.
  • Create a job alert for this search

    Data Engineer • Republic Of India, IN

    Related jobs
    • Promoted
    AWS Data Engineer

    AWS Data Engineer

    Tata Consultancy ServicesIndia
    TCS is Hiring AWS Data Engineer Bangalore location.Strong hands-on experience in Python programming and PySpark.Experience using AWS services (RedShift, Glue, EMR, S3 & Lambda).Experience working w...Show moreLast updated: 15 days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Tata Consultancy ServicesNagpur, IN
    Job Title : Senior Data Engineer.Required Skillset : Python, Spark, Databricks, AWS (S3, Glue, AirFlow, Cloudwatch, Lambda). Master Databricks tools (job creation, cluster, notebook) and be able to qu...Show moreLast updated: 30+ days ago
    • Promoted
    GCP Data Engineer

    GCP Data Engineer

    AdastraNagpur, IN
    We are looking for a proactive and solution-oriented GCP Data Engineer to join our team.This role requires hands-on experience in Google Cloud Platform (GCP), especially with BigQuery and Airflow, ...Show moreLast updated: 1 day ago
    • Promoted
    Data Engineer

    Data Engineer

    DigitalzoneNagpur, IN
    As a Data Engineer, you will design, build, and optimize data pipelines and real-time systems that power AI-driven decisioning and analytics. Develop and maintain scalable ETL / ELT pipelines using Py...Show moreLast updated: 15 days ago
    • Promoted
    • New!
    Senior Web Scraping Engineer

    Senior Web Scraping Engineer

    ZomunkIndia
    We're building a product that relies heavily on collecting structured data from a number of known websites.We need someone experienced who can own this part of the system end-to-end; from writing s...Show moreLast updated: 20 hours ago
    • Promoted
    Senior AWS Data Engineer

    Senior AWS Data Engineer

    CYAN360India, India
    Position : Senior AWS Data Engineer.Work Timings : 2 : 30 PM to 11 : 30 PM IST.Need someone who can join immediately or in 15 days • • •. Design, develop, and deploy end-to-end data pipelines on AWS cloud in...Show moreLast updated: 30+ days ago
    • Promoted
    GCP Data Engineer

    GCP Data Engineer

    HCLTechNagpur, IN
    Looking for 5+ Years of experience.Storage Classes, Dataflow, Big query, Pyspark / Python, Airflow.Show moreLast updated: 15 days ago
    • Promoted
    • New!
    Web Scraping Engineer

    Web Scraping Engineer

    noonIndia
    Job title : Web Scraping Engineer.The ideal candidate will design and implement robust scrapers to collect, clean, and normalize product data (pricing, availability, reviews, images, etc.Develop and...Show moreLast updated: 20 hours ago
    • Promoted
    Data Engineer - Web Scraping

    Data Engineer - Web Scraping

    Alternative PathIndia, India
    Alternative Path is seeking skilled software developers to collaborate on client projects with an asset management firm.In this role, you will collaborate with individuals across various company de...Show moreLast updated: 30+ days ago
    • Promoted
    GCP Data Engineer

    GCP Data Engineer

    LTIMindtreeNagpur, IN
    Greetings from LTIMindtree !!!.We are really impressed by your GCP experience.We are having multiple opportunities and projects which are based on GCP, Bigquery. I’d love to tell you a little more a...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    One Tapp ConsultingNagpur, IN
    Design and Development : Design, build, and maintain robust, scalable, and optimized ETL / ELT.Python and standard data warehousing principles. Data Platform Management : Implement and manage data proce...Show moreLast updated: 20 hours ago
    • Promoted
    Data Engineer

    Data Engineer

    RecroNagpur, IN
    Data Pipeline Engineering : Design, build, and maintain ingestion, transformation, and storage pipelines using Azure Data Factory, Synapse Analytics, and Data Lake. AI Data Enablement : Collaborate wi...Show moreLast updated: 30+ days ago
    • Promoted
    AI Web Scraping Engineer

    AI Web Scraping Engineer

    S2T AI - AI-Powered InvestigationsIndia, India
    We're seeking a forward-thinking.AI tools to accelerate development and streamline data extraction processes.Join our India team and work at the intersection of traditional scraping expertise and c...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Data Engineer

    Senior Data Engineer

    VeraxionNagpur, IN
    We are looking for a Senior Data Engineer who can design, build, and scale modern data platforms that power analytics, decision-making, and AI across the organization. If you love solving complex da...Show moreLast updated: 20 hours ago
    • Promoted
    Data Engineer (GCP)

    Data Engineer (GCP)

    HISH IT SERVICESNagpur, IN
    We have a new urgent GCP Data Engineer opportunity open to support a migration initiative from Teradata to Cerebro (BigQuery). This role requires a hands-on developer who can collaborate closely wit...Show moreLast updated: 6 days ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    Natlov Technologies Pvt LtdNagpur, IN
    Hiring : Cloud Data Engineer | Remote | Full-Time | Immediate Joiner.Ltd is looking for a Cloud Data Engineer with strong AWS expertise and working experience in Azure. In this role, you will design,...Show moreLast updated: 20 hours ago
    • Promoted
    • New!
    Senior Data Engineer (AWS / GCP)

    Senior Data Engineer (AWS / GCP)

    ArmakuniNagpur, IN
    We are looking for a Sr Data Engineer who is passionate about managing large volumes of data, eager to learn and take on challenges, and committed to delivering exceptional results.If you are a tea...Show moreLast updated: 20 hours ago
    • Promoted
    • New!
    AWS Data Engineer

    AWS Data Engineer

    Atyeti IncNagpur, IN
    Looking for Data Engineer who will be responsible for design, build and maintenance of data pipelines running on Airflow, Spark on the AWS Cloud platform. Build and maintain all facets of Data Pipel...Show moreLast updated: 20 hours ago