Design, develop, and maintain robust and scalable data pipelines using Apache Spark and Scala on the Databricks platform.
Implement ETL (Extract, Transform, Load) processes for various data sources, ensuring data quality, integrity, and efficiency.
Optimize Spark applications for performance and cost-efficiency within the Databricks environment.
Work with Delta Lake for building reliable data lakes and data warehouses, ensuring ACID transactions and data versioning.
Collaborate with data scientists, analysts, and other engineering teams to understand data requirements and deliver solutions.
Implement data governance and security best practices within Databricks.
Troubleshoot and resolve data-related issues, ensuring data availability and reliability.
Stay updated with the latest advancements in Spark, Scala, Databricks, and related big data technologies.
Required Skills and Experience :
Proven experience as a Data Engineer with a strong focus on big data technologies.
Expertise in Scala programming language for data processing and Spark application development.
In-depth knowledge and hands-on experience with Apache Spark, including Spark SQL, Spark Streaming, and Spark Core.
Proficiency in using Databricks platform features, including notebooks, jobs, workflows, and Unity Catalog.
Experience with Delta Lake and its capabilities for building data lakes.
Strong understanding of data warehousing concepts, data modeling, and relational databases.
Familiarity with cloud platforms (e.g., AWS, Azure, GCP) and their data services.
Experience with version control systems like Git.
Excellent problem-solving and analytical skills.
Ability to work independently and as part of a team.
Preferred Qualifications (Optional) :
Experience with other big data technologies like Kafka, Flink, or Hadoop ecosystem components.
Knowledge of data visualization tools.
Understanding of DevOps principles and CI / CD pipelines for data engineering.
Relevant certifications in Spark or Databrick
(ref : hirist.tech)
Create a job alert for this search
Data Engineer • Pune
Related jobs
Senior Cloudera Developer - Apache Spark
Deutsche Telekom • Pune
Your Role : We are looking for a Senior Cloudera Developer (Data Engineer) with extensive experience in Spark, the Hadoop ecosystem, and GC...Show more
Last updated: 29 days ago • Promoted
Senior Data Engineer
RapidBrains • Pune, IN
Job Title : Senior Data Engineer.We are looking for a Senior Data Engineer with deep expertise in Azure Data Engineering to design, build, and optimize large-scale data pipelines.The ideal candidate...Show more
Last updated: 14 days ago • Promoted
Data Engineer
IntraEdge • Pune, IN
We are seeking a highly skilled Data Engineer with strong experience in Python, PySpark, Snowflake, and AWS Glue to join our growing data team.
You will be responsible for building scalable and reli...Show more
Last updated: 30+ days ago • Promoted
Data Engineer / Data Developer (GCP)
Tekshiras Software Services Private Limited • Pune, Maharashtra, India
Role : Data Engineer / Data Developer (GCP) Experience : 2–6 Years Location : Banglore and Pune Required Skills & Experience - Strong hands-on experience as a Data Engineer / Data Developer - Exper...Show more
Last updated: 13 hours ago • Promoted • New!
PySpark Data Engineer
EXTRAGIG • Pune, IN
Contract Assistant – Data Engineer Support (Remote, EST Hours).PySpark Data Engineer with daily activities.This is a remote contract role.
Execute creative software and data solutions, including des...Show more
Last updated: 30+ days ago • Promoted
Data Engineer
Trinity Technology Solutions LLC • Pune, Maharashtra, India
Location : Pune preferred / remote.Programming Python, PySpark / Spark.Operating Systems Linux (RedHat).Databases Hive, MSSQL, MySQL, PostgreSQL.
Big Data Hadoop ecosystem (HDFS, YARN, Sqoop).Version Co...Show more
Last updated: 7 days ago • Promoted
Data Engineer
Synechron • Pune, India
We have immediate opportunity for PL / SQL Developer.Notice Period : Immediate to 30 Days.At Synechron, we believe in the power of digital to transform businesses for the better.Our global consulting ...Show more
Last updated: 30+ days ago • Promoted
Cloud Data Engineer- Spark & Databricks
Confidential • Chennai, Hyderabad / Secunderabad, Telangana, Pune
We are looking for a highly skilled .The ideal candidate will have extensive experience working with cloud platforms such as AWS, Azure, and GCP, and a deep understanding of data engineering, ETL p...Show more
Last updated: 30+ days ago • Promoted
Data Engineer (GCP)
HISH IT SERVICES • Pune, IN
We have a new urgent GCP Data Engineer opportunity open to support a migration initiative from Teradata to Cerebro (BigQuery).
This role requires a hands-on developer who can collaborate closely wit...Show more
Last updated: 15 days ago • Promoted
Data Engineer - Spark / Hadoop
TalenTree • Pune
Key Responsibilities : - Build and optimise data ingestion, transformation, and integration pipelines across multiple sources - clinical trials, EHR / EMR, laboratory ...Show more
Last updated: 18 days ago • Promoted
Senior Spark Data Engineer
Confidential • Pune, India
Join us as a Senior Spark Data Engineer at Barclays, where you'll take part in the evolution of our digital landscape, driving innovation and excellence.
You'll harness cutting-edge technology to re...Show more
Last updated: 27 days ago • Promoted
Data Engineer
Vriba Solutions • Pune, IN
Design, develop & maintain ETL / ELT pipelines.Ingest & transform data from APIs, DBs, files, streams.Build real-time & batch processing solutions.
Data validation, quality & cleansing.Translate busin...Show more
Last updated: 30+ days ago • Promoted
Sr. Data Engineer
Persistent Systems • Pune, Maharashtra, India
About Position : As a Senior Scala Engineer, you’ll be part of a team of smart, highly skilled technologists,who are passionate about learning and supporting cutting-edge technologies such as Spark...Show more
Last updated: 1 day ago • Promoted
5676 - Data Engineer
EXL • Pune, Maharashtra, India
Location - Pune, Bangalore, Noida, Gurgaon, Hyderabad.The ideal candidate will have strong expertise in Snowflake, Hadoop ecosystem, PySpark, and SQL, and will play a key role in enabling data-driv...Show more
Last updated: 23 days ago • Promoted
Data Engineer- Databricks
InfoCepts • Pune, Maharashtra, India
Position : Data Engineer- Databricks Purpose of the Position : Develop, support and steer end-to-end business intelligence using Databricks.
Location : Nagpur / Pune / Chennai / Bangalore Key Responsib...Show more
Last updated: 23 days ago • Promoted
Senior Data Engineer
SPIRO • Pune, India
Position : Senior Data Engineer.We are seeking an experienced Senior Data Engineer with a strong background in PySpark, Spark, and Big Data technologies.
The ideal candidate will have over five year...Show more
Last updated: 30+ days ago • Promoted
Data Engineer
Persistent Systems • Pune, Maharashtra, India
We are seeking Data Engineer with hands-on experience in Spark, Pyspark, AWS, Java, Python etc.Location : All Persistent Locations.
Job Type : Full Time Employment.Design, build, and maintain data pip...Show more
Last updated: 30+ days ago • Promoted
Databricks Data Engineer
Ascendion • Pune, Maharashtra, India
Job Title : Senior Data Engineer.Location : Gurgaon / Pune / Bangalore.Skills : PySpark, SQL, Databricks, AWS.The ideal candidates should have hands-on expertise in building.
Databricks and AWS, along w...Show more