Talent.com
This job offer is not available in your country.
Data Engineer - Web Scrapping & Enrichment

Data Engineer - Web Scrapping & Enrichment

Gala IntelligenceBangalore, IN
30+ days ago
Job description

We're looking for an entrepreneurial, passionate, and driven Data Engineer to join Startup Gala Intelligence backed by Navneet Tech Venture situated in Ahmedabad . As we're building our technology platform from scratch, you'll have the unique opportunity to shape our technology vision, architecture, and engineering culture right from the ground up. You’ll directly contribute to foundational development and establish best practices, while eventually building and contributing to our engineering team.

This role is ideal for someone eager to own the entire tech stack, who thrives on early-stage challenges, and loves building innovative, scalable solutions from day zero.

What You’ll Do

  • Web Scraping & Crawling : Build and maintain automated scrapers to extract structured and unstructured data from websites, APIs, and public datasets.
  • Scalable Scraping Systems : Develop multi-threaded, distributed crawlers capable of handling high-volume data collection without interruptions.
  • Data Parsing & Cleaning : Normalize scraped data, remove noise, and ensure consistency before passing to data pipelines.
  • Anti-bot & Evasion Tactics : Implement proxy rotation, captcha solving, and request throttling techniques to handle scraping restrictions.
  • Integration with Pipelines : Deliver clean, structured datasets into NoSQL stores and ETL pipelines for further enrichment and graph-based storage.
  • Data Quality & Validation : Ensure data accuracy, deduplicate records, and maintain a trust scoring system for data confidence.
  • Documentation & Maintenance : Keep scrapers updated when websites change, and document scraping logic for reproducibility.

Who You Are

Technical Skills :

  • 2+ years of experience in web scraping , crawling, or data collection.
  • Strong proficiency in Python (libraries like BeautifulSoup, Scrapy, Selenium, Playwright, Requests).
  • Familiarity with NoSQL databases (MongoDB, DynamoDB) and data serialization formats (JSON, CSV, Parquet).
  • Experience in handling large-scale scraping with proxy management and rate-limiting.
  • Basic knowledge of ETL processes and integration with data pipelines.
  • Exposure to graph databases (Neo4j) is a plus.
  • Soft Skills :

  • Detail-oriented, ensuring accuracy and reliability of collected data.
  • Strong problem-solving skills, particularly in adapting scrapers to evolving web structures.
  • Curious mindset with a drive to discover new data sources.
  • Comfortable working in a fast-paced, early-stage startup environment.
  • Who We Are & Our Culture

    Gala Intelligence , backed by Navneet Tech Ventures , is a tech-driven startup dedicated to solving one of the most pressing business challenges - fraud detection and prevention. We're building cutting-edge, real-time products designed to empower consumers and businesses to stay ahead of fraudsters, leveraging innovative technology and deep domain expertise.

    Our culture and values :

    We’re united by a single, critical mission - stopping fraud before it impacts businesses. Curiosity, innovation, and proactive action define our approach. We value transparency, collaboration, and individual ownership, creating an environment where talented people can do their best work.

  • Problem-Driven Innovation : We're deeply committed to solving real challenges that genuinely matter for our customers.
  • Rapid Action & Ownership : We encourage autonomy and accountability—own your projects, move quickly, and shape the future of Gala Intelligence.
  • Collaborative Excellence : Cross-team collaboration ensures alignment, sparks innovation, and drives us forward together.
  • Continuous Learning : Fraud evolves rapidly, and so do we. Continuous improvement, experimentation, and learning are core to our success.
  • If you're excited by the opportunity to leverage technology in the fight against fraud, and you're ready to build something impactful from day one, we want to hear from you!

    Create a job alert for this search

    Data Engineer • Bangalore, IN

    Related jobs
    • Promoted
    GCP Data Engineer

    GCP Data Engineer

    BrillioBengaluru, Karnataka, India
    GCP (primary), AWS (secondary acceptable).Infrastructure as Code (good to have).Develop and maintain scalable, cloud-native data processing pipelines on GCP. BigQuery, DataFlow, Pub / Sub, Cloud Stora...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Web Scraping Engineer

    Senior Web Scraping Engineer

    S2T AI - AI-Powered InvestigationsBengaluru, IN
    We are on the lookout for a highly competent, self-motivated.Gather and process raw data at scale (including writing scripts, web scraping, calling APIs). Able to work independently to complete assi...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Innodata Inc.Bengaluru, IN
    CI / CD practices, Databricks (Spark), Python, Github and SQL.The ideal candidate should have hands-on expertise in building and automating data pipelines, managing multi-environment deployments, and...Show moreLast updated: 26 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Canopus Infosystems - A CMMI Level 3 Companybangalore, karnataka, in
    Python expertise and hands-on experience in handling large datasets, data cleaning, analysis, and visualization.The ideal candidate should be capable of building data pipelines, performing web scra...Show moreLast updated: 18 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Bahwan CyberTekhosur, tamil nadu, in
    Job Title : Data Engineer – Google Cloud Platform (GCP).We are seeking a skilled and motivated Data Engineer with hands-on experience in building scalable data pipelines and cloud-native data soluti...Show moreLast updated: 17 days ago
    • Promoted
    Data Engineer

    Data Engineer

    RevXhosur, tamil nadu, in
    RevX helps app businesses acquire and reengage users via programmatic to retain, monetize, and accelerate revenue.We're all about taking your app businesses to a new growth level.We rely on data sc...Show moreLast updated: 8 days ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    Kline + Companyhosur, tamil nadu, in
    Here at Kline our data capabilities have grown exponentially over the last four years.Having gone through a rapid digitization process and becoming a cloud-native corporation, we are looking for to...Show moreLast updated: 14 hours ago
    • Promoted
    Data Engineer - Web Scraping

    Data Engineer - Web Scraping

    Alternative PathBangalore, IN
    Alternative Path is seeking skilled software developers to collaborate on client projects with an asset management firm.In this role, you will collaborate with individuals across various company de...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    AI Web Scraping Engineer

    AI Web Scraping Engineer

    S2T AI - AI-Powered InvestigationsBengaluru, IN
    We're seeking a forward-thinking.AI tools to accelerate development and streamline data extraction processes.Join our India team and work at the intersection of traditional scraping expertise and c...Show moreLast updated: 18 hours ago
    • Promoted
    Data Engineer | Azure + Databricks

    Data Engineer | Azure + Databricks

    XebiaBengaluru, Karnataka, India
    Job Opportunity – Data Engineer | Azure + Databricks 🔹.Bangalore (Whitefield) | Hybrid (3 days office / week).Immediate to 2 weeks – only apply if you can join early. The ideal candidate will have st...Show moreLast updated: 8 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Kumaran Systemsbangalore, karnataka, in
    Strong hands-on experience in Databricks.Proven expertise in building and managing data ingestion pipelines.Exposure to Databricks data ingestion jobs along with incident management frameworks.Expe...Show moreLast updated: 7 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Insight Globalhosur, tamil nadu, in
    Notice Period : This is a ASAP hire - only apply if you can start 2 weeks after offer is extended!.Priority scheduling for candidates who message on LinkedIn with resume. Design, develop, and maintai...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer - Web Scrapping & Enrichment

    Data Engineer - Web Scrapping & Enrichment

    Gala IntelligenceBengaluru, IN
    We're looking for an entrepreneurial, passionate, and driven.Navneet Tech Venture situated in.As we're building our technology platform from scratch, you'll have the unique opportunity to shape our...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    SynechronBangalore Urban, Karnataka, India
    We have immediate opportunity for.Data Engineer (Python, Spark / Scala).Data Engineer (Python, Spark / Scala).Notice Period : Immediate joiner only. At Synechron, we believe in the power of digital to tr...Show moreLast updated: 30+ days ago
    • Promoted
    Backend Developer / Data Scraping / Kubernetes

    Backend Developer / Data Scraping / Kubernetes

    Daply | Scaling Digital Publishinghosur, tamil nadu, in
    IN ORDER TO BE CONSIDERED FOR THIS ROLE, YOU MUST SUBMIT AN APPLICATION HERE : .Web Scraping / Captcha Control.Show moreLast updated: 7 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Kanerika Inchosur, tamil nadu, in
    Following are high level responsibilities that you will play but not limited to : .Analyze the Data Model and do GAP analysis with Business Requirements and Power BI. Design and Model Power BI schema....Show moreLast updated: 19 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Everisehosur, tamil nadu, in
    Join us on our mission to elevate customer experiences for people around the world.As a member of the Everise family, you will be part of a global experience company that believes in being people-f...Show moreLast updated: 6 days ago
    • Promoted
    Data Engineer

    Data Engineer

    ACL DigitalBengaluru, Karnataka, India
    Design, develop, and optimize Spark-based data pipelines on Databricks for large-scale data processing.Design, develop, and optimize AWS pipeline as applicable. Implement and manage GitHub asset bun...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer III - Innovation - Admin

    Data Engineer III - Innovation - Admin

    WithumBengaluru, Karnataka, India
    Withum is a place where talent thrives - where who you are matters.It’s a place of endless opportunities for growth.A place where entrepreneurial energy plus inclusive teamwork equals exponential r...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    IntraEdgebangalore, karnataka, in
    Snowflake, AWS (Lambda, Glue), DBT, and SQL.The ideal candidate will be responsible for enabling seamless data integration, transformation, and analytics to support business intelligence and advanc...Show moreLast updated: 30+ days ago