Talent.com
Data Engineer - ETL / PySpark

Data Engineer - ETL / PySpark

Talent SocioMumbai
30+ days ago
Job description

We are building a next-generation Customer Data Platform (CDP) powered by the Databricks Lakehouse architecture and Lakehouse Engine framework. We're looking for a skilled Data Engineer with 4-9 years of experience to help us build metadata-driven pipelines, enable real-time data processing, and support marketing campaign orchestration capabilities at scale.

Responsibilities :

  • Configure and extend the Lakehouse Engine framework for batch and streaming pipelines.
  • Implement the medallion architecture (Bronze -> Silver -> Gold) using Delta Lake.
  • Develop metadata-driven ingestion patterns from various customer data sources.
  • Build reusable transformers for PII handling, data standardization, and data quality enforcement.
  • Build Spark Structured Streaming pipelines for customer behavior and event tracking.
  • Set up Debezium + Kafka for Change Data Capture (CDC) from CRM systems.
  • Design and develop identity resolution logic across both streaming and batch datasets.
  • Use Unity Catalog for managing RBAC, data lineage, and auditability.
  • Integrate Great Expectations or similar tools for continuous data quality monitoring.
  • Set up CI / CD pipelines for deploying Databricks notebooks, jobs, and DLT pipelines.

Requirements :

  • 4-9 years of hands-on experience in data engineering.
  • Expertise in Databricks Lakehouse platform, Delta Lake, and Unity Catalog.
  • Advanced PySpark skills, including Structured Streaming.
  • Experience implementing Kafka + Debezium CDC pipelines.
  • Strong in SQL transformations, data modeling, and analytical querying.
  • Familiarity with metadata-driven architecture and parameterized pipelines.
  • Understanding of data governance : PII masking, access control, and lineage tracking.
  • Proficiency in working with AWS, MongoDB, and PostgreSQL.
  • Experience working on Customer 360 or Martech CDP platforms.
  • Familiarity with Martech tools like Segment, Braze, or other CDPs.
  • Exposure to ML pipelines for segmentation, scoring, or personalization.
  • Knowledge of CI / CD for data workflows using GitHub Actions, Terraform, or Databricks CLI.
  • (ref : hirist.tech)

    Create a job alert for this search

    Data Engineer • Mumbai

    Related jobs
    • Promoted
    Data Engineer

    Data Engineer

    StraiveMumbai, Maharashtra, India
    Data Engineer will be responsible for designing, developing, and maintaining data pipelines and architectures that support the organization's analytics and reporting needs.The ideal candidate will ...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer – ETL & Pipeline Development (5 to 10 yrs)

    Data Engineer – ETL & Pipeline Development (5 to 10 yrs)

    AIMLEAPKalyan-Dombivli, IN
    Data Engineer – ETL & Pipeline Development.Remote (Work from Home) / Bangalore / India.Tech / MCA / Computer Science / IT. IT / Data / AI / LegalTech / Enterprise Solutions.ETL workflows — not just ...Show moreLast updated: 1 day ago
    • Promoted
    AWS Data Engineer

    AWS Data Engineer

    TerraGiGMumbai, IN
    Design, development, and implementation of performant ETL pipelines using python API (pySpark) of Apache Spark on AWS EMR. Writing reusable, testable, and efficient code.Integration of data storage ...Show moreLast updated: 17 days ago
    • Promoted
    • New!
    ETL Developer

    ETL Developer

    TagKalyan-Dombivli, IN
    We are seeking a highly skilled.This role is a key part of our.Business Intelligence team, responsible for enabling robust data flows that power enterprise dashboards, analytics, and machine learni...Show moreLast updated: 1 hour ago
    • Promoted
    Data Engineer

    Data Engineer

    Response Informaticsthane, maharashtra, in
    AWS services : Must be proficient in building scalable data pipelines and managing cloud-native ETL workflows.Snowflake : Moderate understanding of Snowflake architecture. CICD - Terraform or CloudFo...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Straivedombivli, maharashtra, in
    The ideal candidate is a strong software engineer with hands-on experience in Spark (3.You'll be responsible for designing and implementing ETL / ELT solutions, collaborating with teams to deliver da...Show moreLast updated: 30+ days ago
    • Promoted
    AWS Data Engineer

    AWS Data Engineer

    Tata Consultancy Servicesnavi mumbai, maharashtra, in
    Aws data engineer having strong experience of Python.Technical / Behavioral Competency.Proficient in Python, with experience in deploying Python packages and OOP, Experience in ingesting data from di...Show moreLast updated: 14 days ago
    • Promoted
    • New!
    ETL Data Engineer + SQL - (Immediate joiners only)

    ETL Data Engineer + SQL - (Immediate joiners only)

    Innovya TechnologiesMumbai, IN
    Innovya Technologies is a dynamic and growing software consulting firm that drives business automation with cutting-edge solutions. We help businesses quickly realize value from their technology and...Show moreLast updated: 11 hours ago
    • Promoted
    Data Engineer

    Data Engineer

    IntraEdgeThane, IN
    We are seeking a highly skilled Data Engineer with strong experience in Python, PySpark, Snowflake, and AWS Glue to join our growing data team. You will be responsible for building scalable and reli...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Dexian Indiadombivli, maharashtra, in
    Designing and building optimized data pipelines using cutting-edge technologies in a cloud environment to drive analytical insights. Constructing infrastructure for efficient ETL processes from vari...Show moreLast updated: 6 days ago
    • Promoted
    Data & Analytics Engineer

    Data & Analytics Engineer

    APPIT Software IncMumbai, IN
    Data Engineer : Snowflake -Mandatory – Hands -on Experience.ETL Tool -Informatica [IVS version],BDT.GCP : Big query – Mandatory -handson experience. Data Modelling & Data Warehouse -Mandatory -Hands...Show moreLast updated: 4 days ago
    • Promoted
    ETL Developer

    ETL Developer

    Pinnacle Group, Inc.Kalyan-Dombivli, IN
    PTR Global is a leader in providing innovative workforce solutions, dedicated to optimizing talent acquisition and management processes. Our commitment to excellence has earned us the trust of busin...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Data Engineer

    Sr Data Engineer

    Mitchell Martin Inc.Thane, IN
    Job Title : Senior Data Engineer.We are looking for a Senior Data Engineer to design, build, and optimize data pipelines and systems that power our analytics, reporting, and data-driven decision-mak...Show moreLast updated: 19 days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    SGS & Conavi mumbai, maharashtra, in
    Position Title : Senior Data Engineer.Experience Required : 8 to 12 Years.We are looking for a highly skilled and experienced Data Engineer with strong expertise in. The ideal candidate will play a ke...Show moreLast updated: 6 days ago
    • Promoted
    ETL Developer

    ETL Developer

    Programmers.ionavi mumbai, maharashtra, in
    Shift Timings : General Shift (12 : 00 PM IST till 9 : 00 PM or 9 : 30 PM IST).Location : PAN India-Remote Until Office Resume, Work from Home. Experience required : 5 to 8 years.Design, develop, and maintai...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Tata Consultancy Servicesdombivli, maharashtra, in
    Required Technical Skill Set -.Create and maintain optimal data pipeline architecture,.Assemble large, complex data sets that meet functional / non-functional business requirements.Identify, design...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    SourcebaeThane, IN
    Senior Python Data Engineer (6-10 Yrs) | PySpark, Databricks, ADF | Hybrid Model 🌟.Note : - Below 5 Years of candidate should not apply for the same not even 4. Join a fast-paced, innovation-driven ...Show moreLast updated: 30+ days ago
    • Promoted
    GCP Data Engineer

    GCP Data Engineer

    VML Enterprise Solutionsmumbai, maharashtra, in
    GCP experience in recent 3 projects.Hands-on experience with Google Cloud Platform (GCP) services (BigQuery, Cloud Run,.Proficiency in SQL for data manipulation and analysis.Familiarity with infras...Show moreLast updated: 6 days ago