Talent.com
Data Extraction Engineer
Data Extraction Engineernoon • Haryāna, Republic Of India, IN
Data Extraction Engineer

Data Extraction Engineer

noon • Haryāna, Republic Of India, IN
1 day ago
Job description

Job title : Web Scraping Engineer

Location : Gurgaon

About the Role

We are looking for a Web Scraping Engineer with proven experience in building, maintaining, and scaling data extraction systems - especially from e-commerce platforms .

The ideal candidate will design and implement robust scrapers to collect, clean, and normalize product data (pricing, availability, reviews, images, etc.) that power our analytics and competitive pricing systems.

What you'll do :

  • Develop and maintain scalable scraping pipelines to extract structured data from complex, dynamic e-commerce sites .
  • Design and manage distributed scraping infrastructure (e.G., rotating proxies, request throttling).
  • Handle dynamic rendering, anti-bot measures, and API integrations where applicable.
  • Implement data quality validation, normalization, and deduplication routines
  • Optimize performance and reliability of scrapers with scheduling, retry logic, and monitoring
  • Collaborate with data engineers and product teams to define data needs and integration workflows.
  • Maintain compliance with relevant site terms of service, data usage policies, and legal constraints.

What you'll need :

  • 3+ years of professional experience in web scraping , data extraction , or automation.
  • Strong Python skills with libraries such as Scrapy , BeautifulSoup , Requests , or aiohttp.
  • Experience with proxy management and anti-bot evasion (e.G., rotating IPs, user agents, cookies, CAPTCHA handling).
  • Solid understanding of HTML , JavaScript , and network protocols (HTTP, REST) .
  • Familiarity with GCP or AWS for deploying scalable scraping systems.
  • Experience storing and processing data using MySQL , Elasticsearch , or S3.
  • Experience scraping product pages (ASINs, listings, offers) and / or using Product Advertising API is strongly preferred .
  • Experience with distributed scraping frameworks (e.G., Scrapyd, Airflow, Celery).
  • Knowledge of data pipelines or ETL design.
  • Experience with competitive pricing , catalog mapping , or e-commerce analytics.
  • Familiarity with Docker and CI / CD tools.
  • Who will excel?

  • We’re looking for people with high standards who understand that hard work matters.
  • You need to be relentlessly resourceful and operate with a deep bias for action.
  • We need people with the courage to be fiercely original.
  • noon is not for everyone;
  • readiness to adapt, pivot, and learn is essential.

    If the above values resonate with you, then noon might be the place for you!

    Create a job alert for this search

    Data Engineer • Haryāna, Republic Of India, IN

    Related jobs
    Data Engineer

    Data Engineer

    KKR • haryana, haryana, in
    KKR aims to generate attractive investment returns by following a patient and disciplined investment approach, employing world-class people, and supporting growth in its portfolio companies and com...Show more
    Last updated: 1 day ago • Promoted
    Lead Engineer Data [T500-20747]

    Lead Engineer Data [T500-20747]

    REA Cyber City • haryana, haryana, in
    In 1995, in a garage in Melbourne, Australia, REA Group was born from a simple question : “Can we change the way the world experiences property?”. Fast forward 30 years, REA Group is a market leader ...Show more
    Last updated: 1 day ago • Promoted
    Data Engineer

    Data Engineer

    EXL • haryana, haryana, in
    We are looking for a Python & PySpark developer and data engineer who can design and build solutions for one of our Fortune 500 Client programs in the realm of Financial Master & Reference Data Man...Show more
    Last updated: 1 day ago • Promoted
    Lead GCP Data Engineer

    Lead GCP Data Engineer

    Impetus • haryana, haryana, in
    Lead Data Engineer – GCP (BigQuery • Composer • Python • PySpark).You will lead the design, build and operation of large-scale data platforms on the Google Cloud Platform.You will manage a team of ...Show more
    Last updated: 1 day ago • Promoted
    Data Engineer

    Data Engineer

    CashKaro.com • haryana, haryana, in
    India’s #1 cashback platform, trusted by over 25 million users! We drive more sales for Amazon, Flipkart, Myntra, and Ajio than any other paid channels, including Google and Meta.Backed by legendar...Show more
    Last updated: 15 hours ago • Promoted • New!
    Data Engineer

    Data Engineer

    Azilen Technologies • haryana, haryana, in
    Manage and create design schema, SQL query tuning, and code review.Min 4+ years of professional experience in the field of data engineering with knowledge of the data platform and DWH development.D...Show more
    Last updated: 1 day ago • Promoted
    GCP Data Engineer

    GCP Data Engineer

    Impetus • haryana, haryana, in
    Design, build, and maintain large-scale data pipelines on BigQuery and other Google Cloud Platform (GCP) services.Use Python and PySpark / Spark to transform, clean, aggregate and prepare data for an...Show more
    Last updated: 1 day ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    KOGTA FINANCIAL (INDIA) LIMITED • haryana, haryana, in
    ETL & Data Warehouse Developer.As a key member of our data engineering team, you will be responsible for designing, developing, and optimizing ETL pipelines and scalable data warehouse solutions on...Show more
    Last updated: 1 day ago • Promoted
    GCP + Bigdata Engineer

    GCP + Bigdata Engineer

    Tata Consultancy Services • haryana, haryana, in
    TATA Consultancy Services is looking for Big Data Developer with Hands on experience with GCP.Required experience range : 6 to 8 years. Job locations : Chennai, Gurgaon.Demonstrated leadership in desi...Show more
    Last updated: 1 day ago • Promoted
    Lead Data Engineer

    Lead Data Engineer

    MakeMyTrip • haryana, haryana, in
    At MakeMyTrip (MMT), technology is at the heart of everything we do.As a leading player in the travel industry, we leverage cutting-edge solutions like AI, machine learning, and cloud infrastructur...Show more
    Last updated: 1 day ago • Promoted
    Data Engineer

    Data Engineer

    GMG • haryana, haryana, in
    GMG is a global well-being company retailing, distributing and manufacturing a portfolio of leading international and home-grown brands across sport, food and health sectors.Its vision is to inspir...Show more
    Last updated: 1 day ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Atomic North • haryana, haryana, in
    Azure Databricks, PySpark, Python, SQL, Azure Data Factory, Data Modelling, Data Migration, Data Warehousing.India (Remote / Hybrid options available). Azure Data Factory, Azure Databricks, and Data...Show more
    Last updated: 1 day ago • Promoted
    Data Engineer

    Data Engineer

    Movate • haryana, haryana, in
    Develop data mapping specifications and transformation rules.Perform data cleansing, validation, and reconciliation activities. Create and execute data conversion scripts and processes.Document data...Show more
    Last updated: 1 day ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    GSPANN Technologies, Inc • haryana, haryana, in
    Headquartered in California, U.GSPANN provides consulting and IT services to global clients.We help clients transform how they deliver business value by helping them optimize their IT capabilities,...Show more
    Last updated: 1 day ago • Promoted
    Data Engineer

    Data Engineer

    Terra Technology Circle Consulting Private Limited • haryana, haryana, in
    We are seeking a highly skilled and motivated.In this role, you will design, build, and optimize scalable data pipelines and architectures to support analytics, machine learning, and business intel...Show more
    Last updated: 1 day ago • Promoted
    Data Engineer

    Data Engineer

    Confidential • haryana, haryana, in
    A leading global consulting firm based in Gurugram, India is looking for a Data Engineer.You will work alongside consulting teams to demonstrate AI and data tools, empowering them to leverage techn...Show more
    Last updated: 1 day ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Pacific Data Integrators • haryana, haryana, in
    Shift time : Open to work in EST shift (5PM to 2AM IST).Lead the design, development, and implementation of complex data integration solutions using Informatica Intelligent Data Management Cloud (ID...Show more
    Last updated: 1 day ago • Promoted
    Data Engineer

    Data Engineer

    Sirius AI • haryana, haryana, in
    Sirius AI is a US headquartered AI Consulting services and products company with operations in India.Sirius AI focuses on Financial Services enterprises and solutions / services delivered across mu...Show more
    Last updated: 1 day ago • Promoted