Talent.com
This job offer is not available in your country.
Data Engineer - ETL / PySpark

Data Engineer - ETL / PySpark

Talent SocioMumbai
2 days ago
Job description

We are building a next-generation Customer Data Platform (CDP) powered by the Databricks Lakehouse architecture and Lakehouse Engine framework. We're looking for a skilled Data Engineer with 4-9 years of experience to help us build metadata-driven pipelines, enable real-time data processing, and support marketing campaign orchestration capabilities at scale.

Responsibilities :

  • Configure and extend the Lakehouse Engine framework for batch and streaming pipelines.
  • Implement the medallion architecture (Bronze -> Silver -> Gold) using Delta Lake.
  • Develop metadata-driven ingestion patterns from various customer data sources.
  • Build reusable transformers for PII handling, data standardization, and data quality enforcement.
  • Build Spark Structured Streaming pipelines for customer behavior and event tracking.
  • Set up Debezium + Kafka for Change Data Capture (CDC) from CRM systems.
  • Design and develop identity resolution logic across both streaming and batch datasets.
  • Use Unity Catalog for managing RBAC, data lineage, and auditability.
  • Integrate Great Expectations or similar tools for continuous data quality monitoring.
  • Set up CI / CD pipelines for deploying Databricks notebooks, jobs, and DLT pipelines.

Requirements :

  • 4-9 years of hands-on experience in data engineering.
  • Expertise in Databricks Lakehouse platform, Delta Lake, and Unity Catalog.
  • Advanced PySpark skills, including Structured Streaming.
  • Experience implementing Kafka + Debezium CDC pipelines.
  • Strong in SQL transformations, data modeling, and analytical querying.
  • Familiarity with metadata-driven architecture and parameterized pipelines.
  • Understanding of data governance : PII masking, access control, and lineage tracking.
  • Proficiency in working with AWS, MongoDB, and PostgreSQL.
  • Experience working on Customer 360 or Martech CDP platforms.
  • Familiarity with Martech tools like Segment, Braze, or other CDPs.
  • Exposure to ML pipelines for segmentation, scoring, or personalization.
  • Knowledge of CI / CD for data workflows using GitHub Actions, Terraform, or Databricks CLI.
  • (ref : hirist.tech)

    Create a job alert for this search

    Data Engineer • Mumbai

    Related jobs
    • Promoted
    Data Engineer

    Data Engineer

    Otomeyt AIKalyan-Dombivli, IN
    We are seeking a highly skilled 5+.The ideal candidate will have strong technical expertise in Azure, Data Engineering tools, and advanced ETL design along with excellent communication and problem-...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    DeltacubesMumbai, IN
    Build and maintain scalable ETL / ELT pipelines.Work with Snowflake and BigQuery for data storage.Implement orchestration with Airflow or Prefect. Integrate data workflows with Python.Optimize data pi...Show moreLast updated: 12 days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Otomeyt AIKalyan-Dombivli, IN
    We are seeking a highly skilled 7+.The ideal candidate will have strong technical expertise in Azure, Data Engineering tools, and advanced ETL design along with excellent communication and problem-...Show moreLast updated: 15 days ago
    • Promoted
    ETL LEAD with IBM DataStage experienced

    ETL LEAD with IBM DataStage experienced

    IntraEdgeThane, IN
    ETL Developer – DataStage, AWS, Snowflake.We are looking for a talented and motivated ETL Developer / Senior Developer.You will work on building scalable and efficient data pipelines using.IBM Data...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    ACL Digitalthane, maharashtra, in
    Design, develop, and optimize Spark-based data pipelines on Databricks for large-scale data processing.Design, develop, and optimize AWS pipeline as applicable. Implement and manage GitHub asset bun...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    INFEC Servicesthane, maharashtra, in
    Design, develop, and optimize data pipelines and ETL processes on GCP or Azure.Work with structured and unstructured data, integrating sources such as databases, APIs, and streaming platforms.Imple...Show moreLast updated: 1 day ago
    • Promoted
    Data Engineer - ETL / Python

    Data Engineer - ETL / Python

    Xanika InfotechMumbai
    Job Summary : We are seeking an experienced Data Engineer to join our team in Mumbai.The ideal candidate will have a strong background in data engi...Show moreLast updated: 5 days ago
    • Promoted
    Data Engineer - ETL / Python / Apache Airflow

    Data Engineer - ETL / Python / Apache Airflow

    K & R EnterprisesMumbai
    Roles and Responsibilities : - Develop, Monitor, and Maintain data pipeline.Create and maintain optimal data pipeline architecture - Assemble large, complex ...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer - ETL

    Data Engineer - ETL

    Targeticon Digital Services Pvt. Ltd.Mumbai
    Job Description : Data Engineer (NBFC / BFSI) Experience : 3+ years in BFSI domain Key Responsibilities : - Design, develop, and maintai...Show moreLast updated: 26 days ago
    • Promoted
    Lead Data Engineer

    Lead Data Engineer

    Eucloid Data SolutionsThane, IN
    Eucloid is looking for a Lead Data Engineer to join our Data Platform team supporting various business applications.The ideal candidate will support development of data infrastructure on Databricks...Show moreLast updated: 5 days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Straivethane, maharashtra, in
    The ideal candidate is a strong software engineer with hands-on experience in Spark (3.You'll be responsible for designing and implementing ETL / ELT solutions, collaborating with teams to deliver da...Show moreLast updated: 1 day ago
    • Promoted
    ETL Developer

    ETL Developer

    Pinnacle Group, Inc.Mumbai, IN
    PTR Global is a leader in providing innovative workforce solutions, dedicated to optimizing talent acquisition and management processes. Our commitment to excellence has earned us the trust of busin...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Innodata Inc.navi mumbai, maharashtra, in
    CI / CD practices, Databricks (Spark), Python, Github and SQL.The ideal candidate should have hands-on expertise in building and automating data pipelines, managing multi-environment deployments, and...Show moreLast updated: 22 days ago
    • Promoted
    Data Engineer Team Lead

    Data Engineer Team Lead

    SGImumbai, maharashtra, in
    To be discussed based on your skills and experience.Strong hands-on data engineering experience with a proven ability to design, build, and optimize scalable data pipelines in .Deep technical exper...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Aceolutiondombivli, maharashtra, in
    We are looking for a freelancer to engage with us for 20-40 hours per week.Kindly find the JD below for your reference.Design, develop, and maintain scalable data pipelines and workflows.Work exten...Show moreLast updated: 5 days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Tredence Inc.dombivli, maharashtra, in
    Design, build, and maintain scalable data pipelines using DBT and Airflow.Develop and optimize SQL queries and data models in Snowflake. Implement ETL / ELT workflows, ensuring data quality, performan...Show moreLast updated: 28 days ago
    • Promoted
    Data Engineer

    Data Engineer

    IntraEdgenavi mumbai, maharashtra, in
    Snowflake, AWS (Lambda, Glue), DBT, and SQL.The ideal candidate will be responsible for enabling seamless data integration, transformation, and analytics to support business intelligence and advanc...Show moreLast updated: 23 days ago
    • Promoted
    Data Engineer

    Data Engineer

    R SystemsThane, IN
    Offshore candidates accepted (Singapore Based Company).Please don't apply if less than 4 years exp in Data Engineer).ETL pipelines, real-time streaming, and data transformations.RDDs, transformatio...Show moreLast updated: 23 days ago