Talent.com
Data Engineer - Scala Spark
Data Engineer - Scala SparkNielsenIQ • Chennai, India
Data Engineer - Scala Spark

Data Engineer - Scala Spark

NielsenIQ • Chennai, India
9 days ago
Job description

Role Summary :

Design, build, and optimize large-scale ETL and data-processing pipelines handling GB–TB volumes. Operate within the Databricks ecosystem and drive migration of selected workloads to high-performance engines such as Polars and DuckDB. Maintain strong engineering rigor across CI / CD, testing, and code-quality enforcement. Apply analytical thinking to solve data reliability, performance, and scalability problems. AI familiarity is advantageous.

Core Responsibilities :

  • Develop and maintain distributed data pipelines using Scala, Spark, Delta, and Databricks.
  • Engineer robust ETL workflows tuned for high-volume ingestion, transformation, and publishing.
  • Profile pipelines, remove bottlenecks, and optimize compute, storage, and job orchestration.
  • Lead migration of suitable workloads to Polars, DuckDB, or equivalent high-performance engines.
  • Implement CI / CD workflows with automated builds, tests, deployments, and environment gating.
  • Enforce coding standards through code coverage targets, unit / integration tests, and SonarQube rules.
  • Ensure pipeline observability : logging, data quality checks, lineage, and failure diagnostics.
  • Apply analytical reasoning to triage complex data issues and deliver root-cause clarity.
  • Contribute to AI-aligned initiatives when required : RAG design, fine-tuning workflows, agentic patterns.
  • Collaborate with product, analytics, and platform teams to operationalize data solutions

Required Skills and Experience :

  • 3+ years in data engineering with strong command of Scala and Spark.
  • Proven background in ETL design, distributed processing, and high-volume data systems.
  • Hands-on experience with Databricks (jobs, clusters, notebooks, Delta Lake).
  • Proficiency in workflow optimization, performance tuning, and memory management.
  • Experience with Polars, DuckDB, or similar columnar / accelerated engines.
  • CI / CD discipline using Git-based pipelines; strong testing and code-quality practices.
  • Familiarity with SonarQube, coverage metrics, and static analysis.
  • Strong analytical and debugging capability across data, pipelines, and infra.
  • Exposure to AI concepts : embeddings, vector stores, retrieval-augmented generation, fine-tuning, agentic architectures.
  • Preferred :

  • Experience with Azure cloud environments .
  • Experience in metadata-driven or config-driven pipeline frameworks.
  • Create a job alert for this search

    Engineer Spark Scala • Chennai, India

    Related jobs
    Data Engineer

    Data Engineer

    Digitalzone • Chennai, IN
    As a Data Engineer, you will design, build, and optimize data pipelines and real-time systems that power AI-driven decisioning and analytics. Develop and maintain scalable ETL / ELT pipelines using Py...Show more
    Last updated: 22 days ago • Promoted
    Data Engineer - Spark / Hadoop

    Data Engineer - Spark / Hadoop

    TalenTree • Chennai
    Key Responsibilities : - Build and optimise data ingestion, transformation, and integration pipelines across multiple sources - clinical trials, EHR / EMR, laboratory ...Show more
    Last updated: 17 days ago • Promoted
    PySpark Data Engineer

    PySpark Data Engineer

    EXTRAGIG • Chennai, IN
    Contract Assistant – Data Engineer Support (Remote, EST Hours).PySpark Data Engineer with daily activities.This is a remote contract role. Execute creative software and data solutions, including des...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    RapidBrains • Chennai, IN
    Job Title : Senior Data Engineer.We are looking for a Senior Data Engineer with deep expertise in Azure Data Engineering to design, build, and optimize large-scale data pipelines.The ideal candidate...Show more
    Last updated: 13 days ago • Promoted
    Data Engineer (GCP)

    Data Engineer (GCP)

    HISH IT SERVICES • Chennai, IN
    We have a new urgent GCP Data Engineer opportunity open to support a migration initiative from Teradata to Cerebro (BigQuery). This role requires a hands-on developer who can collaborate closely wit...Show more
    Last updated: 14 days ago • Promoted
    Data Engineer

    Data Engineer

    IntraEdge • chennai, tamil nadu, in
    Python, PySpark, AWS services (Glue, Lambda), and Snowflake.The ideal candidate will design, build, and maintain scalable data pipelines, ensure efficient data integration, and enable advanced anal...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer - Scala Spark

    Data Engineer - Scala Spark

    NielsenIQ • Chennai, Tamil Nadu, India
    Design, build, and optimize large-scale ETL and data-processing pipelines handling GB–TB volumes.Operate within the Databricks ecosystem and drive migration of selected workloads to high-performanc...Show more
    Last updated: 10 days ago • Promoted
    Prodapt - Big Data Engineer - Spark / Scala

    Prodapt - Big Data Engineer - Spark / Scala

    Prodapt Solutions Private Limited • Chennai
    Big Data Engineer - : 7 to 10 years Location : : ...Show more
    Last updated: 23 days ago • Promoted
    Big Data Developer - Spark

    Big Data Developer - Spark

    HCLTech • Chennai, Tamil Nadu, India
    Position : Senior Data Engineer – Experience : 6–9 Years.We are seeking Senior Data Engineer resources to work on the migration of applications from our legacy Cloudera environment to the new Kuberne...Show more
    Last updated: 10 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    SGS & Co • chennai, tamil nadu, in
    Position Title : Senior Data Engineer.Experience Required : 8 to 12 Years.We are looking for a highly skilled and experienced Data Engineer with strong expertise in. The ideal candidate will play a ke...Show more
    Last updated: 22 days ago • Promoted
    Data Engineer

    Data Engineer

    Vriba Solutions • Chennai, IN
    Design, develop & maintain ETL / ELT pipelines.Ingest & transform data from APIs, DBs, files, streams.Build real-time & batch processing solutions. Data validation, quality & cleansing.Translate busin...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Dexian India • Chennai, Tamil Nadu, India
    Minimum 8+ years of hands-on experience designing, building, deploying, testing, maintaining, monitoring, and owning scalable, resilient, and distributed data pipelines. High proficiency in Python, ...Show more
    Last updated: 9 days ago • Promoted
    Data Engineer - Scala / Apache Spark

    Data Engineer - Scala / Apache Spark

    Ixceed Solutions • Chennai
    Responsibilities : - Design, develop, and maintain robust and scalable data pipelines using Apache Spark and Scala on the Databricks platform. Implement ETL (Extract, Transform,...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer - Apache Spark

    Data Engineer - Apache Spark

    Wroots Global Private Limited • Chennai
    About the Role : We are looking for an experienced Data Engineer with strong hands-on expertise in building, optimizing, and maintaining large-scale ...Show more
    Last updated: 14 days ago • Promoted
    Big Data Developer

    Big Data Developer

    UST • chennai, tamil nadu, in
    The ideal candidate should demonstrate a deep understanding of big data concepts, programming fundamentals, and the ability to solve complex problems related to scalability, failure handling, and o...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer (Spark / Scala)

    Data Engineer (Spark / Scala)

    Tata Consultancy Services • Chennai, Tamil Nadu, India
    Greetings from Tata Consulting Services.TCS is Hiring for Data Engineer (Spark / Scala).Required Technical Skill - Scala / Spark , Hadoop, Hive.Show more
    Last updated: 22 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Straive • chennai, tamil nadu, in
    The ideal candidate is a strong software engineer with hands-on experience in Spark (3.You'll be responsible for designing and implementing ETL / ELT solutions, collaborating with teams to deliver da...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    Net2Source (N2S) • chennai, tamil nadu, in
    Join a Global Leader in Workforce Solutions – Net2Source Inc.Operating in 32 countries | 5,500+ Employees.Job Type : Contract – 6 months ext. Databricks hands-on experience and.Strong Python and SQL ...Show more
    Last updated: 23 hours ago • Promoted