Talent.com
Data Engineer - Scala Spark

Data Engineer - Scala Spark

NielsenIQChennai, Tamil Nadu, India
7 days ago
Job description

Role Summary :

Design, build, and optimize large-scale ETL and data-processing pipelines handling GB–TB volumes. Operate within the Databricks ecosystem and drive migration of selected workloads to high-performance engines such as Polars and DuckDB. Maintain strong engineering rigor across CI / CD, testing, and code-quality enforcement. Apply analytical thinking to solve data reliability, performance, and scalability problems. AI familiarity is advantageous.

Core Responsibilities :

  • Develop and maintain distributed data pipelines using Scala, Spark, Delta, and Databricks.
  • Engineer robust ETL workflows tuned for high-volume ingestion, transformation, and publishing.
  • Profile pipelines, remove bottlenecks, and optimize compute, storage, and job orchestration.
  • Lead migration of suitable workloads to Polars, DuckDB, or equivalent high-performance engines.
  • Implement CI / CD workflows with automated builds, tests, deployments, and environment gating.
  • Enforce coding standards through code coverage targets, unit / integration tests, and SonarQube rules.
  • Ensure pipeline observability : logging, data quality checks, lineage, and failure diagnostics.
  • Apply analytical reasoning to triage complex data issues and deliver root-cause clarity.
  • Contribute to AI-aligned initiatives when required : RAG design, fine-tuning workflows, agentic patterns.
  • Collaborate with product, analytics, and platform teams to operationalize data solutions

Required Skills and Experience :

  • 3+ years in data engineering with strong command of Scala and Spark.
  • Proven background in ETL design, distributed processing, and high-volume data systems.
  • Hands-on experience with Databricks (jobs, clusters, notebooks, Delta Lake).
  • Proficiency in workflow optimization, performance tuning, and memory management.
  • Experience with Polars, DuckDB, or similar columnar / accelerated engines.
  • CI / CD discipline using Git-based pipelines; strong testing and code-quality practices.
  • Familiarity with SonarQube, coverage metrics, and static analysis.
  • Strong analytical and debugging capability across data, pipelines, and infra.
  • Exposure to AI concepts : embeddings, vector stores, retrieval-augmented generation, fine-tuning, agentic architectures.
  • Preferred :

  • Experience with Azure cloud environments .
  • Experience in metadata-driven or config-driven pipeline frameworks.
  • Create a job alert for this search

    Engineer Spark Scala • Chennai, Tamil Nadu, India

    Related jobs
    • Promoted
    Data Engineer

    Data Engineer

    DigitalzoneChennai, IN
    As a Data Engineer, you will design, build, and optimize data pipelines and real-time systems that power AI-driven decisioning and analytics. Develop and maintain scalable ETL / ELT pipelines using Py...Show moreLast updated: 19 days ago
    • Promoted
    Data Engineer (GCP)

    Data Engineer (GCP)

    HISH IT SERVICESChennai, IN
    We have a new urgent GCP Data Engineer opportunity open to support a migration initiative from Teradata to Cerebro (BigQuery). This role requires a hands-on developer who can collaborate closely wit...Show moreLast updated: 11 days ago
    • Promoted
    Data Engineer – CDP

    Data Engineer – CDP

    Integers.Aichennai, tamil nadu, in
    Job Description : Data Engineer – CDP.Data Engineer with strong CDP expertise.The ideal candidate will have hands-on experience working with Customer Data Platforms—specifically Real-Time CDP and Sa...Show moreLast updated: 1 day ago
    • Promoted
    Data Engineer

    Data Engineer

    EXLChennai, Tamil Nadu, India
    Collaborate with project stakeholders (client) to identify product and technical requirements.Develop, implement, and tune large-scale distributed systems and pipelines that process large volume of...Show moreLast updated: 30+ days ago
    • Promoted
    GCP Data Engineer

    GCP Data Engineer

    HCLTechChennai, IN
    Looking for 5+ Years of experience.Storage Classes, Dataflow, Big query, Pyspark / Python, Airflow.Show moreLast updated: 19 days ago
    • Promoted
    Big Data Developer - Spark

    Big Data Developer - Spark

    HCLTechChennai, Tamil Nadu, India
    Position : Senior Data Engineer – Experience : 6–9 Years.We are seeking Senior Data Engineer resources to work on the migration of applications from our legacy Cloudera environment to the new Kuberne...Show moreLast updated: 7 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Tata Consultancy ServicesChennai, Tamil Nadu, India
    TCS present an excellent opportunity for Data Engineer Job Location : Delhi, Hyderabad, Bangalore, Kochi, Chennai, Kolkata, Pune Experience required : 4- 12 yrs Skills : Data Engineer - GCPSQL + Py...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    SGS & CoChennai, Tamil Nadu, India
    Position Title : Senior Data Engineer.Experience Required : 8 to 12 Years.We are looking for a highly skilled and experienced Data Engineer with strong expertise in. The ideal candidate will play a ke...Show moreLast updated: 19 days ago
    • Promoted
    Databricks Engineer

    Databricks Engineer

    TTC GroupChennai, IN
    We are seeking a Mid-Level Databricks Engineer with strong data engineering fundamentals and hands-on experience building scalable data pipelines on the Databricks platform.The ideal candidate will...Show moreLast updated: 3 days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Dexian IndiaChennai, Tamil Nadu, India
    Minimum 8+ years of hands-on experience designing, building, deploying, testing, maintaining, monitoring, and owning scalable, resilient, and distributed data pipelines. High proficiency in Python, ...Show moreLast updated: 5 days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    MindlanceChennai, IN
    We’re looking for a strong, hands-on Sr Data Engineer who can independently drive client conversations, collaborate with SMEs, and deliver high-quality data solutions in a cloud-native environment....Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Insight GlobalChennai, IN
    GCP DATA ENGINEER - Contract (Long term).Data Engineer with hands-on support for Google Looker.Strong experience in data modeling and building data marts. Proficiency in ETL / ELT pipeline development...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer - Scala / Apache Spark

    Data Engineer - Scala / Apache Spark

    Ixceed SolutionsChennai
    Responsibilities : - Design, develop, and maintain robust and scalable data pipelines using Apache Spark and Scala on the Databricks platform. Implement ETL (Extract, Transform,...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer - Apache Spark

    Data Engineer - Apache Spark

    Wroots Global Private LimitedChennai
    About the Role : We are looking for an experienced Data Engineer with strong hands-on expertise in building, optimizing, and maintaining large-scale ...Show moreLast updated: 11 days ago
    • Promoted
    Data Engineer (Spark / Scala)

    Data Engineer (Spark / Scala)

    Tata Consultancy ServicesChennai, Tamil Nadu, India
    Greetings from Tata Consulting Services.TCS is Hiring for Data Engineer (Spark / Scala).Required Technical Skill - Scala / Spark , Hadoop, Hive.Show moreLast updated: 19 days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Straivechennai, tamil nadu, in
    The ideal candidate is a strong software engineer with hands-on experience in Spark (3.You'll be responsible for designing and implementing ETL / ELT solutions, collaborating with teams to deliver da...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    IntraEdgeChennai, IN
    Python, PySpark, AWS services (Glue, Lambda), and Snowflake.The ideal candidate will design, build, and maintain scalable data pipelines, ensure efficient data integration, and enable advanced anal...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    SAIVA AIChennai, IN
    We are building the future of healthcare analytics.Join us to design, build, and scale robust data pipelines that power nationwide analytics and support our machine learning systems.Our goal : pipel...Show moreLast updated: 30+ days ago