Talent.com
Data Engineer (Webscraping)

Data Engineer (Webscraping)

Solytics PartnersPushkar, IN
2 hours ago
Job description

Company Profile :

Solytics Partners is a Global Analytics firm, recognized with multiple industry awards for innovation and excellence. Our team comprises experts with deep knowledge in risk, analytics, AI / ML, AML / FCC, and fraud. By converging this expertise with cutting edge technologies like AI, Machine Learning, Generative AI, and Large Language Models (LLMs), we deliver powerful automated platforms and incisive point solutions. Our offerings enable clients to streamline and future-proof their risk, AML, and analytics processes, comply seamlessly with global regulations, and safeguard financial systems. Whether it’s solving complex challenges or driving operational efficiency, Solytics Partners is committed to empowering organizations with transformative tools to stay ahead in an evolving regulatory landscape.

Job Title : Data Engineer (Web Scraping)

Experience : 5 – 10 years of relevant experience

Location & Timings : Pune – Work from office & Timing - 11 : 00 AM – 8 : 00 PM

Education Qualification : Masters or bachelor's in computer science or IT or in other relevant discipline from a reputed institute.

Role Type : Permanent / Full Time

Job Description : We are seeking an experienced Data Engineering & Automation Lead to design, automate, and optimize large-scale data processing and web scraping pipelines. The role involves leading a team to build and maintain high-performance ETL workflows using Apache Airflow, Apache Spark, and AWS services, while integrating AI / NLP solutions powered by OpenAI GPT and other GenAI models for intelligent data extraction and analytics.

Responsibilities :

  • Design, automate, and maintain ETL and data processing pipelines using Apache Airflow and Apache Spark.
  • Build, monitor, and optimize web scraping and data extraction workflows for global compliance and risk data sources.
  • Lead and manage web scraping and data engineering teams, ensuring delivery excellence, code quality, and scalability.
  • Create, design, and document automation workflows and secure data-sharing systems using AWS (Lambda, S3, API Gateway, SQS).
  • Implement AI and NLP integrations using OpenAI GPT and GenAI models for intelligent data extraction, tagging, and analytical automation.
  • Analyze large-scale datasets to identify quality gaps, improve accuracy, and optimize indexing and retrieval performance.
  • Collaborate with Backend, DevOps, and Frontend teams for data modeling, monitoring, and visualization.
  • Work closely with clients to gather and translate business requirements into scalable automation and analytics solutions.
  • Author HLD / LLD documentation, mentor junior engineers, and continuously improve automation processes and data workflows.

Required Skills :

  • Programming : Python, SQL, JavaScript
  • Data Engineering & Automation : Apache Airflow, Apache Spark, Web Scraping (Scrapy, Selenium), Pandas, NumPy
  • Databases & Storage : Elasticsearch, MongoDB, MySQL
  • Cloud & Backend : AWS (Lambda, S3, EC2, CloudWatch, SQS, SNS, EKS), Docker, Django, Flask
  • AI / ML & NLP : OpenAI GPT APIs, NER, Sentiment Analysis, Embeddings, Information Extraction
  • Monitoring & Tools : Grafana, Git, Postman, Jupyter, VS Code Good to Have
  • Strong understanding of Large Language Models (LLMs) and Generative AI for building intelligent data extraction and analytics agents.
  • Familiarity with risk and compliance domains, including Sanctions, PEP (Politically Exposed Persons), and AMS (Adverse Media Screening) data and processes.
  • Soft Skills :

  • Leadership & Team Mentoring
  • Problem-Solving & Analytical Thinking
  • Clear Technical Communication
  • Cross-functional Collaboration
  • Create a job alert for this search

    Data Engineer • Pushkar, IN

    Related jobs
    • Promoted
    Data Engineer - Web Scraping

    Data Engineer - Web Scraping

    Alternative PathPushkar, IN
    Alternative Path is seeking skilled software developers to collaborate on client projects with an asset management firm.In this role, you will collaborate with individuals across various company de...Show moreLast updated: 30+ days ago
    • Promoted
    Web3 Engineer

    Web3 Engineer

    Xpay.shpushkar, gujarat, in
    Agent to Agent payments in the Agentic Economy with its cutting-edge control plane for managing x402 payments.The platform enables businesses to prevent runaway agent costs, monetize APIs instantly...Show moreLast updated: 3 days ago
    • Promoted
    Senior Snowflake Data Engineer

    Senior Snowflake Data Engineer

    iVoyantPushkar, IN
    One of our clients is looking for an experienced Senior Snowflake Data Engineer to join their team.We are seeking a Senior Data Engineer with 8+ years of experience in end-to-end data engineering a...Show moreLast updated: 6 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Tata Consultancy Servicesajmer, rajasthan, in
    Required Technical Skill Set -.Create and maintain optimal data pipeline architecture,.Assemble large, complex data sets that meet functional / non-functional business requirements.Identify, design...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Data Engineer

    Lead Data Engineer

    Saras Analyticspushkar, gujarat, in
    We are an ecommerce focused end to end data analytics firm assisting enterprises & brands in data driven decision making to maximize business value. Our suite of work spans extraction, transformatio...Show moreLast updated: 23 hours ago
    • Promoted
    • New!
    Data Engineer 2

    Data Engineer 2

    YubiAjmer, IN
    As a Data Engineer, you will be part of a highly talented Data Engineering team.Responsible for developing reusable capabilities and tools to automate various types of data processing pipelines.You...Show moreLast updated: 2 hours ago
    • Promoted
    Data Engineer

    Data Engineer

    Insight GlobalPushkar, IN
    GCP DATA ENGINEER - Contract (Long term).Data Engineer with hands-on support for Google Looker.Strong experience in data modeling and building data marts. Proficiency in ETL / ELT pipeline development...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    Whitefield CareersPushkar, IN
    Passionate about data, analytics and automation.Experience cleaning and modeling large quantities of raw, disorganized data (we use dbt). Experience with a variety of data sources (structured and un...Show moreLast updated: 2 hours ago
    • Promoted
    • New!
    Sr. Data Engineers (Google Stack)- Remote

    Sr. Data Engineers (Google Stack)- Remote

    Mewar Infotech LimitedAjmer, IN
    Remote
    BigQuery, Vertex AI, Pub / Sub, Cloud Functions.Implement transformations using .Collaborate with stakeholders for data modeling, operational support, and performance tuning.Strong hands-on experienc...Show moreLast updated: 2 hours ago
    • Promoted
    Founding Data Engineer

    Founding Data Engineer

    InSiteVersepushkar, gujarat, in
    We are building a next-generation .Generative AI and advanced ML models.Founding Data Pipeline Engineer.You will work closely with the . You will also architect and optimize .Azure OLAP services suc...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    DevRabbit IT SolutionsPushkar, IN
    We are looking for a highly skilled.Data Engineer / Analytics Engineer.The ideal candidate will have strong expertise in. Python, Advanced SQL, DBT, and Snowflake.Design, develop, and maintain robus...Show moreLast updated: 2 hours ago
    • Promoted
    AI Web Scraping Engineer

    AI Web Scraping Engineer

    S2T AI - AI-Powered Investigationsajmer, rajasthan, in
    We're seeking a forward-thinking.AI tools to accelerate development and streamline data extraction processes.Join our India team and work at the intersection of traditional scraping expertise and c...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    Ironbook AIPushkar, IN
    Data Engineer (Microsoft Fabric & Azure) - Relocation to KL.The Data Engineer is responsible for designing, building, and maintaining scalable data pipelines and modern data lakehouse architectures...Show moreLast updated: 2 hours ago
    • Promoted
    Data Engineer

    Data Engineer

    RecroAjmer, IN
    Data Pipeline Engineering : Design, build, and maintain ingestion, transformation, and storage pipelines using Azure Data Factory, Synapse Analytics, and Data Lake. AI Data Enablement : Collaborate wi...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    DigitalzonePushkar, IN
    As a Data Engineer, you will design, build, and optimize data pipelines and real-time systems that power AI-driven decisioning and analytics. Develop and maintain scalable ETL / ELT pipelines using Py...Show moreLast updated: 6 days ago
    • Promoted
    Snowflake Data Engineer

    Snowflake Data Engineer

    Newpage SolutionsAjmer, IN
    Location : Remote | Type : Contract.Newpage Solutions is a global digital health innovation company helping people live longer, healthier lives. We partner with life sciences organizations—including p...Show moreLast updated: 6 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Dexian Indiaajmer, rajasthan, in
    Designing and building optimized data pipelines using cutting-edge technologies in a cloud environment to drive analytical insights. Constructing infrastructure for efficient ETL processes from vari...Show moreLast updated: 6 days ago
    • Promoted
    AWS Data Engineer / Snowflake Data Engineer

    AWS Data Engineer / Snowflake Data Engineer

    Numeric Technologiespushkar, gujarat, in
    Please apply only if you are comfortable to work in rotational shift.Apply only if you are an immediate to 15 days joiner. Work Mode - Monthly once to office in Bangalore.Years of experience - 2 to ...Show moreLast updated: 4 days ago