Talent.com
This job offer is not available in your country.
Data Engineer - Web Scrapping & Enrichment

Data Engineer - Web Scrapping & Enrichment

Gala IntelligenceThane, IN
30+ days ago
Job description

We're looking for an entrepreneurial, passionate, and driven Data Engineer to join Startup Gala Intelligence backed by Navneet Tech Venture situated in Ahmedabad . As we're building our technology platform from scratch, you'll have the unique opportunity to shape our technology vision, architecture, and engineering culture right from the ground up. You’ll directly contribute to foundational development and establish best practices, while eventually building and contributing to our engineering team.

This role is ideal for someone eager to own the entire tech stack, who thrives on early-stage challenges, and loves building innovative, scalable solutions from day zero.

What You’ll Do

  • Web Scraping & Crawling : Build and maintain automated scrapers to extract structured and unstructured data from websites, APIs, and public datasets.
  • Scalable Scraping Systems : Develop multi-threaded, distributed crawlers capable of handling high-volume data collection without interruptions.
  • Data Parsing & Cleaning : Normalize scraped data, remove noise, and ensure consistency before passing to data pipelines.
  • Anti-bot & Evasion Tactics : Implement proxy rotation, captcha solving, and request throttling techniques to handle scraping restrictions.
  • Integration with Pipelines : Deliver clean, structured datasets into NoSQL stores and ETL pipelines for further enrichment and graph-based storage.
  • Data Quality & Validation : Ensure data accuracy, deduplicate records, and maintain a trust scoring system for data confidence.
  • Documentation & Maintenance : Keep scrapers updated when websites change, and document scraping logic for reproducibility.

Who You Are

Technical Skills :

  • 2+ years of experience in web scraping , crawling, or data collection.
  • Strong proficiency in Python (libraries like BeautifulSoup, Scrapy, Selenium, Playwright, Requests).
  • Familiarity with NoSQL databases (MongoDB, DynamoDB) and data serialization formats (JSON, CSV, Parquet).
  • Experience in handling large-scale scraping with proxy management and rate-limiting.
  • Basic knowledge of ETL processes and integration with data pipelines.
  • Exposure to graph databases (Neo4j) is a plus.
  • Soft Skills :

  • Detail-oriented, ensuring accuracy and reliability of collected data.
  • Strong problem-solving skills, particularly in adapting scrapers to evolving web structures.
  • Curious mindset with a drive to discover new data sources.
  • Comfortable working in a fast-paced, early-stage startup environment.
  • Who We Are & Our Culture

    Gala Intelligence , backed by Navneet Tech Ventures , is a tech-driven startup dedicated to solving one of the most pressing business challenges - fraud detection and prevention. We're building cutting-edge, real-time products designed to empower consumers and businesses to stay ahead of fraudsters, leveraging innovative technology and deep domain expertise.

    Our culture and values :

    We’re united by a single, critical mission - stopping fraud before it impacts businesses. Curiosity, innovation, and proactive action define our approach. We value transparency, collaboration, and individual ownership, creating an environment where talented people can do their best work.

  • Problem-Driven Innovation : We're deeply committed to solving real challenges that genuinely matter for our customers.
  • Rapid Action & Ownership : We encourage autonomy and accountability—own your projects, move quickly, and shape the future of Gala Intelligence.
  • Collaborative Excellence : Cross-team collaboration ensures alignment, sparks innovation, and drives us forward together.
  • Continuous Learning : Fraud evolves rapidly, and so do we. Continuous improvement, experimentation, and learning are core to our success.
  • If you're excited by the opportunity to leverage technology in the fight against fraud, and you're ready to build something impactful from day one, we want to hear from you!

    Create a job alert for this search

    Data Engineer • Thane, IN

    Related jobs
    • Promoted
    Data Engineer

    Data Engineer

    Innodata Inc.Thane, IN
    CI / CD practices, Databricks (Spark), Python, Github and SQL.The ideal candidate should have hands-on expertise in building and automating data pipelines, managing multi-environment deployments, and...Show moreLast updated: 26 days ago
    • Promoted
    • New!
    Data Engineer AWS

    Data Engineer AWS

    Anicalls (Pty) Ltdmumbai, India
    Strong computer science fundamentals such as algorithms, data structures, multithreading, object-oriented development, distributed applications, client-server architecture.Design and implement Mach...Show moreLast updated: 5 hours ago
    • Promoted
    • New!
    GCP Data Engineer

    GCP Data Engineer

    TalentOlamumbai, India
    Primary skill - Google BigQuery.GCP Senior Data Engineer Job description as below : .Solid understanding of GCP ETL framework. Solid knowledge about develop robust, scalable, reusable and efficient ET...Show moreLast updated: 5 hours ago
    • Promoted
    Backend Developer / Data Scraping / Kubernetes

    Backend Developer / Data Scraping / Kubernetes

    Daply | Scaling Digital PublishingKalyan-Dombivli, IN
    IN ORDER TO BE CONSIDERED FOR THIS ROLE, YOU MUST SUBMIT AN APPLICATION HERE : .Web Scraping / Captcha Control.Show moreLast updated: 8 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Kline + CompanyMumbai, IN
    Here at Kline our data capabilities have grown exponentially over the last four years.Having gone through a rapid digitization process and becoming a cloud-native corporation, we are looking for to...Show moreLast updated: 1 day ago
    • Promoted
    Data Engineer

    Data Engineer

    Canopus Infosystems - A CMMI Level 3 Companymumbai city, maharashtra, in
    Python expertise and hands-on experience in handling large datasets, data cleaning, analysis, and visualization.The ideal candidate should be capable of building data pipelines, performing web scra...Show moreLast updated: 18 days ago
    • Promoted
    AI Web Scraping Engineer

    AI Web Scraping Engineer

    S2T AI - AI-Powered InvestigationsMumbai, IN
    We're seeking a forward-thinking.AI tools to accelerate development and streamline data extraction processes.Join our India team and work at the intersection of traditional scraping expertise and c...Show moreLast updated: 1 day ago
    • Promoted
    Senior Web Scraping Engineer

    Senior Web Scraping Engineer

    S2T AI - AI-Powered InvestigationsThane, IN
    We are on the lookout for a highly competent, self-motivated.Gather and process raw data at scale (including writing scripts, web scraping, calling APIs). Able to work independently to complete assi...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Kumaran SystemsThane, IN
    Strong hands-on experience in Databricks.Proven expertise in building and managing data ingestion pipelines.Exposure to Databricks data ingestion jobs along with incident management frameworks.Expe...Show moreLast updated: 8 days ago
    • Promoted
    • New!
    Data Engineer (AWS) Mumbai

    Data Engineer (AWS) Mumbai

    Lumiqmumbai, India
    We are seeking a highly skilled Data Engineer to join our dynamic team.As a Data Engineer at LUMIQ, you will play a.You will work at the intersection of cloud technologies, data engineering, and th...Show moreLast updated: 5 hours ago
    • Promoted
    • New!
    Data - AWS engineer

    Data - AWS engineer

    Anicalls (Pty) Ltdmumbai, India
    Equal or better knowledge of Redshift.Good knowledge of ETL & Data Warehousing concepts with hands-on experience.Good knowledge of AWS services like EC2, S3, Lambda, Glue, Athena, Kinesis data stre...Show moreLast updated: 5 hours ago
    • Promoted
    Data Engineer - Web Scraping

    Data Engineer - Web Scraping

    Alternative PathThane, IN
    Alternative Path is seeking skilled software developers to collaborate on client projects with an asset management firm.In this role, you will collaborate with individuals across various company de...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Bahwan CyberTekKalyan-Dombivli, IN
    Job Title : Data Engineer – Google Cloud Platform (GCP).We are seeking a skilled and motivated Data Engineer with hands-on experience in building scalable data pipelines and cloud-native data soluti...Show moreLast updated: 18 days ago
    • Promoted
    Data Engineer

    Data Engineer

    HISH IT SERVICESKalyan-Dombivli, IN
    Location : Remote(Banglore,Chennai,Pune).Pay : 14LPA - 18 LPA(Based on Experience).Timings : A couple of hours overlap with EST, as the client is Canada-based (till 12AM IST).Start Date : 20th Octob...Show moreLast updated: 7 days ago
    • Promoted
    Data Engineer

    Data Engineer

    EveriseThane, IN
    Join us on our mission to elevate customer experiences for people around the world.As a member of the Everise family, you will be part of a global experience company that believes in being people-f...Show moreLast updated: 7 days ago
    • Promoted
    • New!
    GCP_Data Engineer

    GCP_Data Engineer

    Fractalmumbai, India
    It's fun to work in a company where people truly BELIEVE in what they are doing!.Design and develop data-ingestion frameworks, real-time processing solutions, and data processing and transformation...Show moreLast updated: 5 hours ago
    • Promoted
    • New!
    Senior Websphere Engineer

    Senior Websphere Engineer

    Anicalls (Pty) Ltdmumbai, India
    Proficient in modern Java Language Frameworks and its associated unit testing frameworks : Java / JEE (Core Java, Spring Boot, Spring MVC, Spring Cloud, etc. At least one JavaScript Technologies : React...Show moreLast updated: 5 hours ago
    • Promoted
    Data Engineer - Web Scrapping & Enrichment

    Data Engineer - Web Scrapping & Enrichment

    Gala IntelligenceKalyan-Dombivli, IN
    We're looking for an entrepreneurial, passionate, and driven.Navneet Tech Venture situated in.As we're building our technology platform from scratch, you'll have the unique opportunity to shape our...Show moreLast updated: 30+ days ago
    • Promoted
    AWS Data Engineer

    AWS Data Engineer

    Holcim Global Digital HubNavi Mumbai, Maharashtra, India
    Holcim is the leading partner for sustainable construction, creating value across the built environment from infrastructure and industry to buildings. We offer high-value end-to-end Building Materia...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    AWS Data Engineer

    AWS Data Engineer

    Talent Worxmumbai, India
    We are seeking experienced AWS Data Engineers to design, implement, and maintain robust data pipelines and analytics solutions using AWS services. The ideal candidate will have a strong background i...Show moreLast updated: 5 hours ago