Data Engineer - Scala Spark

NielsenIQDelhi, India

3 days ago

Job description

Role Summary :

Design, build, and optimize large-scale ETL and data-processing pipelines handling GB–TB volumes. Operate within the Databricks ecosystem and drive migration of selected workloads to high-performance engines such as Polars and DuckDB. Maintain strong engineering rigor across CI / CD, testing, and code-quality enforcement. Apply analytical thinking to solve data reliability, performance, and scalability problems. AI familiarity is advantageous.

Core Responsibilities :

Develop and maintain distributed data pipelines using Scala, Spark, Delta, and Databricks.

Engineer robust ETL workflows tuned for high-volume ingestion, transformation, and publishing.

Profile pipelines, remove bottlenecks, and optimize compute, storage, and job orchestration.

Lead migration of suitable workloads to Polars, DuckDB, or equivalent high-performance engines.

Implement CI / CD workflows with automated builds, tests, deployments, and environment gating.

Enforce coding standards through code coverage targets, unit / integration tests, and SonarQube rules.

Ensure pipeline observability : logging, data quality checks, lineage, and failure diagnostics.

Apply analytical reasoning to triage complex data issues and deliver root-cause clarity.

Contribute to AI-aligned initiatives when required : RAG design, fine-tuning workflows, agentic patterns.

Collaborate with product, analytics, and platform teams to operationalize data solutions

Required Skills and Experience :

3+ years in data engineering with strong command of Scala and Spark.

Proven background in ETL design, distributed processing, and high-volume data systems.

Hands-on experience with Databricks (jobs, clusters, notebooks, Delta Lake).

Proficiency in workflow optimization, performance tuning, and memory management.

Experience with Polars, DuckDB, or similar columnar / accelerated engines.

CI / CD discipline using Git-based pipelines; strong testing and code-quality practices.

Familiarity with SonarQube, coverage metrics, and static analysis.

Strong analytical and debugging capability across data, pipelines, and infra.

Exposure to AI concepts : embeddings, vector stores, retrieval-augmented generation, fine-tuning, agentic architectures.

Preferred :

Experience with Azure cloud environments .

Experience in metadata-driven or config-driven pipeline frameworks.

Create a job alert for this search

Engineer Spark Scala • Delhi, India

Related jobs

Promoted

Data Engineer (GCP)

HISH IT SERVICESDelhi, IN

We have a new urgent GCP Data Engineer opportunity open to support a migration initiative from Teradata to Cerebro (BigQuery). This role requires a hands-on developer who can collaborate closely wit...Show moreLast updated: 8 days ago

Promoted

Senior Data Engineer

SGS & Comeerut, uttar pradesh, in

Position Title : Senior Data Engineer.Experience Required : 8 to 12 Years.We are looking for a highly skilled and experienced Data Engineer with strong expertise in. The ideal candidate will play a ke...Show moreLast updated: 16 days ago

Promoted

Data Engineer (Spark / Scala)

Tata Consultancy ServicesDelhi, India

TCS is Hiring for Data Engineer (Spark / Scala) Experience : 5+Yrs Location : .Chennai, Bangalore, Pune, Gurugram Notice Period : 0-60 Days. Please find the JD below Good work experience & Proven experi...Show moreLast updated: 25 days ago

Promoted
New!

Data Engineer | Bangalore | Leading Big 4 Consulting Firm

Acme ServicesMeerut, IN

Design, develop, and maintain scalable and robust data pipelines (ETL / ELT) using PySpark on the Databricks platform.Implement data ingestion strategies to load data from various sources (e.APIs, da...Show moreLast updated: 9 hours ago

Promoted

Azure Data Engineer

People Prime WorldwideMeerut, IN

Azure Data Lake Storage (Gen1 / Gen2).Show moreLast updated: 30+ days ago

Promoted
New!

Data Engineer

WhiteLotus Talent PartnersDelhi, India

Job Title- Azure Data Engineer Pyspark.Location- Whitefield / Domlur Embassy Park , Bangalore.Minimum 5 years of PySpark Development experience, especially in Spark SQL and Complex Transformations ...Show moreLast updated: 21 hours ago

Promoted
New!

AWS Data Engineer (PySpark)

Atyeti IncDelhi, India

Technical Skills Must Have Skills : Proficient with Python, PySpark and Airflow Strong understanding of Object-Oriented Programming and Functional Programming paradigm Must have experience working w...Show moreLast updated: 21 hours ago

Promoted

Data Engineer

IntraEdgeGhaziabad, IN

We are seeking a highly skilled Data Engineer with strong experience in Python, PySpark, Snowflake, and AWS Glue to join our growing data team. You will be responsible for building scalable and reli...Show moreLast updated: 30+ days ago

Promoted
New!

Data Engineer

NexionPro ServicesGhaziabad, IN

PySpark-based data processing workflows.Collaborate with data architects, analysts, and cross-functional teams to understand business requirements and translate them into technical solutions.Optimi...Show moreLast updated: 9 hours ago

Promoted
New!

Big data Spark Scala Back-End Engineer

Tata Consultancy ServicesMeerut, IN

I hope you're doing well! I came across your profile and was impressed by your extensive experience as an / a (mention role / skill). We have a similar opportunity to join our dynamic team at TCS.Job Ti...Show moreLast updated: 9 hours ago

Promoted

Data Engineer

Tata Consultancy ServicesDelhi, IN

TCS has been a great pioneer in feeding the fire of Techies like you.We are a global leader in the technology arena and there’s nothing that can stop us from growing together.Your role is of key im...Show moreLast updated: 30+ days ago

Promoted

Data Engineer

Ubique SystemsGhaziabad, IN

Primary skills : Python, SQL, data lakes, azure.Pipeline Development & Automation.Design, build, and maintain CI / CD pipelines to automate deployment of DQ rules and data services across environments...Show moreLast updated: 30+ days ago

Promoted

Data Engineer

DigitalzoneGhaziabad, IN

As a Data Engineer, you will design, build, and optimize data pipelines and real-time systems that power AI-driven decisioning and analytics. Develop and maintain scalable ETL / ELT pipelines using Py...Show moreLast updated: 16 days ago

Promoted
New!

Databricks Engineer

TTC GroupMeerut, IN

We are seeking a Mid-Level Databricks Engineer with strong data engineering fundamentals and hands-on experience building scalable data pipelines on the Databricks platform.The ideal candidate will...Show moreLast updated: 9 hours ago

Promoted

Lead PySpark Data Engineer _ Exp : 7+ Years

Atyeti IncDelhi, India

Job Description : Roles and Responsibility : .Build and maintain all facets of Data Pipelines for Data Engineering team.Build the pipelines required for optimal extraction, transformation, and loading ...Show moreLast updated: 6 days ago

Promoted

Big Data Developer - Scala / Spark

XebiaDelhi, India

Job Title : Big Data Developer - Spark / Scala Job location : Bengaluru Notice Period : immediate - 15 days.Position Overview : We are looking for Scala / Spark developers with strong expertise on Pyth...Show moreLast updated: 3 days ago

Promoted

Data Engineer

Havells India LtdNoida, Uttar Pradesh, India

We are seeking a skilled and experienced Data Engineer to join our dynamic team.The ideal candidate will have a strong background in data engineering, with a focus on PySpark, Python, and SQL.Exper...Show moreLast updated: 30+ days ago

Promoted

Senior Data Engineer

Straivegurgaon, haryana, in

The ideal candidate is a strong software engineer with hands-on experience in Spark (3.You'll be responsible for designing and implementing ETL / ELT solutions, collaborating with teams to deliver da...Show moreLast updated: 30+ days ago