Talent.com
Data Engineer (Webscraping)

Data Engineer (Webscraping)

Solytics PartnersDelhi, India
12 days ago
Job description

Company Profile :

Solytics Partners is a Global Analytics firm, recognized with multiple industry awards for innovation and excellence. Our team comprises experts with deep knowledge in risk, analytics, AI / ML, AML / FCC, and fraud. By converging this expertise with cutting edge technologies like AI, Machine Learning, Generative AI, and Large Language Models (LLMs), we deliver powerful automated platforms and incisive point solutions. Our offerings enable clients to streamline and future-proof their risk, AML, and analytics processes, comply seamlessly with global regulations, and safeguard financial systems. Whether it’s solving complex challenges or driving operational efficiency, Solytics Partners is committed to empowering organizations with transformative tools to stay ahead in an evolving regulatory landscape.

Job Title : Data Engineer (Web Scraping)

Experience : 5 – 10 years of relevant experience

Location & Timings : Pune – Work from office & Timing - 11 : 00 AM – 8 : 00 PM

Education Qualification : Masters or bachelor's in computer science or IT or in other relevant discipline from a reputed institute.

Role Type : Permanent / Full Time

Job Description : We are seeking an experienced Data Engineering & Automation Lead to design, automate, and optimize large-scale data processing and web scraping pipelines. The role involves leading a team to build and maintain high-performance ETL workflows using Apache Airflow, Apache Spark, and AWS services, while integrating AI / NLP solutions powered by OpenAI GPT and other GenAI models for intelligent data extraction and analytics.

Responsibilities :

  • Design, automate, and maintain ETL and data processing pipelines using Apache Airflow and Apache Spark.
  • Build, monitor, and optimize web scraping and data extraction workflows for global compliance and risk data sources.
  • Lead and manage web scraping and data engineering teams, ensuring delivery excellence, code quality, and scalability.
  • Create, design, and document automation workflows and secure data-sharing systems using AWS (Lambda, S3, API Gateway, SQS).
  • Implement AI and NLP integrations using OpenAI GPT and GenAI models for intelligent data extraction, tagging, and analytical automation.
  • Analyze large-scale datasets to identify quality gaps, improve accuracy, and optimize indexing and retrieval performance.
  • Collaborate with Backend, DevOps, and Frontend teams for data modeling, monitoring, and visualization.
  • Work closely with clients to gather and translate business requirements into scalable automation and analytics solutions.
  • Author HLD / LLD documentation, mentor junior engineers, and continuously improve automation processes and data workflows.

Required Skills :

  • Programming : Python, SQL, JavaScript
  • Data Engineering & Automation : Apache Airflow, Apache Spark, Web Scraping (Scrapy, Selenium), Pandas, NumPy
  • Databases & Storage : Elasticsearch, MongoDB, MySQL
  • Cloud & Backend : AWS (Lambda, S3, EC2, CloudWatch, SQS, SNS, EKS), Docker, Django, Flask
  • AI / ML & NLP : OpenAI GPT APIs, NER, Sentiment Analysis, Embeddings, Information Extraction
  • Monitoring & Tools : Grafana, Git, Postman, Jupyter, VS Code Good to Have
  • Strong understanding of Large Language Models (LLMs) and Generative AI for building intelligent data extraction and analytics agents.
  • Familiarity with risk and compliance domains, including Sanctions, PEP (Politically Exposed Persons), and AMS (Adverse Media Screening) data and processes.
  • Soft Skills :

  • Leadership & Team Mentoring
  • Problem-Solving & Analytical Thinking
  • Clear Technical Communication
  • Cross-functional Collaboration
  • Create a job alert for this search

    Data Engineer • Delhi, India

    Related jobs
    • Promoted
    Web Crawling Engineer

    Web Crawling Engineer

    Forage AInarela, delhi, in
    The ideal candidate will have strong Python programming skills and experience in web scraping frameworks, browser automation tools, and handling anti-scraping mechanisms. Forage AI is a pioneering A...Show moreLast updated: 11 days ago
    • Promoted
    Data Engineer

    Data Engineer

    RecroMeerut, IN
    Data Pipeline Engineering : Design, build, and maintain ingestion, transformation, and storage pipelines using Azure Data Factory, Synapse Analytics, and Data Lake. AI Data Enablement : Collaborate wi...Show moreLast updated: 30+ days ago
    • Promoted
    Snowflake Data Engineer

    Snowflake Data Engineer

    Live Connectionsnarela, delhi, in
    Role - Snowflake Data Engineer.Required Notice Period - Immediate Joiner.To apply, connect with Abhishek via abhishek.Show moreLast updated: 9 days ago
    • Promoted
    Data Engineer

    Data Engineer

    GMGGurugram, Haryana, India
    GMG is a global well-being company retailing, distributing and manufacturing a portfolio of leading international and home-grown brands across sport, food and health sectors.Its vision is to inspir...Show moreLast updated: 1 day ago
    • Promoted
    Web Scraping Engineer

    Web Scraping Engineer

    noonGurugram, Haryana, India
    Job title : Web Scraping Engineer.The ideal candidate will design and implement robust scrapers to collect, clean, and normalize product data (pricing, availability, reviews, images, etc.Develop and...Show moreLast updated: 1 day ago
    • Promoted
    Google Looker Data Engineer

    Google Looker Data Engineer

    Insight GlobalDelhi, IN
    We’re seeking a skilled Data Engineer to support a major reporting migration from Tableau to Google Looker.This role focuses on designing and building scalable data models, data marts, and ETL / ELT ...Show moreLast updated: 11 days ago
    • Promoted
    AWS Data Engineer

    AWS Data Engineer

    Tata Consultancy ServicesDelhi, Delhi, India
    Role • • - AWS Data Engineer Technical Skill Set -Aws data engineer having strong experience of Python Experience Range -6 to 8 Technical / Behavioral Competency 1. Proficient in Python, with experi...Show moreLast updated: 30+ days ago
    • Promoted
    AWS Data Engineer

    AWS Data Engineer

    Vista Applied Solutions Group IncDelhi, Delhi, India
    Job Summary for AWS Data Engineer : We are seeking an experienced AWS Data Engineer with strong skills in Python and SQL to design, develop, and maintain cloud-based data pipelines and analyti...Show moreLast updated: 4 days ago
    • Promoted
    Data Engineer

    Data Engineer

    OWOWNoida, Uttar Pradesh, India
    Data Engineer – AI-Powered Marketing Personalization Platform We’re seeking an experienced Data Engineer to help build and scale our next-generation AI-powered marketing personalization platform (...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Terra Technology Circle Consulting Private LimitedGurugram, Haryana, India
    We are seeking a highly skilled and motivated.In this role, you will design, build, and optimize scalable data pipelines and architectures to support analytics, machine learning, and business intel...Show moreLast updated: 1 day ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    DonyatiGhaziabad, IN
    We are seeking a highly skilled Senior Data Engineer to join our team in building a modern data platform on AWS.You will play a key role in transitioning from legacy systems to a scalable, cloud-na...Show moreLast updated: 5 days ago
    • Promoted
    Data Engineer

    Data Engineer

    IntraEdgeDelhi, IN
    Python, PySpark, AWS services (Glue, Lambda), and Snowflake.The ideal candidate will design, build, and maintain scalable data pipelines, ensure efficient data integration, and enable advanced anal...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    BayOne SolutionsDelhi, IN
    We are seeking a highly experienced Data Engineer to join our MarTech team and play a pivotal role in driving innovation within our microservices architecture, with a strong emphasis on data engine...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer - Web Scraping

    Data Engineer - Web Scraping

    Alternative PathDelhi, Delhi, India
    Alternative Path is seeking skilled software developers to collaborate on client projects with an asset management firm.In this role, you will collaborate with individuals across various company de...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Lead Data Engineer (Databricks)

    Lead Data Engineer (Databricks)

    SII Group IndiaNoida, Uttar Pradesh, India
    About the Role We are looking for a highly skilled Data Engineer with strong hands-on experience in Databricks, PySpark, GCP , and cloud data infrastructure. The ideal candidate will be responsi...Show moreLast updated: 21 hours ago
    • Promoted
    Data Engineer

    Data Engineer

    Insight GlobalMeerut, IN
    GCP DATA ENGINEER - Contract (Long term).Data Engineer with hands-on support for Google Looker.Strong experience in data modeling and building data marts. Proficiency in ETL / ELT pipeline development...Show moreLast updated: 30+ days ago
    • Promoted
    Databricks Engineer

    Databricks Engineer

    TTC GroupMeerut, IN
    We are seeking a Mid-Level Databricks Engineer with strong data engineering fundamentals and hands-on experience building scalable data pipelines on the Databricks platform.The ideal candidate will...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer

    Data Engineer

    DigitalzoneGhaziabad, IN
    As a Data Engineer, you will design, build, and optimize data pipelines and real-time systems that power AI-driven decisioning and analytics. Develop and maintain scalable ETL / ELT pipelines using Py...Show moreLast updated: 19 days ago