Data Engineer, PuneRegnology • Pune, Maharashtra, India

Data Engineer, Pune

Regnology • Pune, Maharashtra, India

30+ days ago

Job description

What youll do

We are looking for a strong Senior Data Engineer with deep experience in Java based data platforms and hands-on expertise with GCP GCS Iceberg and Parquet . The role involves building efficient data pipelines improving storage and query performance and enabling a scalable data lake architecture. Experience with Trino or Apache Spark is a plus.

Java Data Engineering is a must due to our tech stack requirement; would not consider Pyspark candidates.

Key Responsibilities

Data Engineering and Development

Design and develop scalable data ingestion and transformation frameworks using Java.
Build and maintain Iceberg tables stored on GCS using Parquet format.
Continuously improve pipeline performance through better partitioning compression data layouts and efficient Java code.

2. Cloud Engineering (Google Cloud Platform)

Develop and optimize data solutions using GCP storage and compute services.

Tune GCS usage IAM configuration and lifecycle rules for reliability and cost.

Implement data residency and security for high performance and low latency workloads.

3. Data Lake Operations

Manage Iceberg metadata schema evolution commit operations and manifest handling.

Improve read and write performance through partition strategies clustering file sizing and metadata compaction.

Troubleshoot concurrent write issues and optimize execution paths.

4. Integration and Query Layer

Work with Trino or Spark to run efficient queries on Iceberg datasets.

Improve Trino catalog performance through caching connector tuning and configuration changes.

Integrate Java based applications with data lake endpoints and reduce application query latencies.

5. Testing and Quality

Build comprehensive automated tests for schema validation data correctness and regression detection.

Validate data performance under different loads and benchmark improvements.

6. DevOps and Observability

Implement CI and CD pipelines for data services.

Develop monitoring for Iceberg metadata operations GCS performance Trino query speeds and storage metrics.

Identify bottlenecks and drive continuous performance improvements across the platform.

Why we should decide on you

5 years of experience

Prior experience migrating financial / regulatory datasets.

Experience with Regulatory Reporting or similar enterprise workloads.

Familiarity with large-scale performance benchmarking and cost modelling.

Required Skills

Strong Java development background.

Deep hands-on experience with GCP and GCS.

Practical experience with Apache Iceberg including table design and performance tuning.

Strong knowledge of Parquet format compression options and file optimization techniques.

Good understanding of distributed systems and data consistency.

Experience building scalable and high performance data platforms.

Nice to Have

Experience with Trino for federated querying.

Experience with Apache Spark for distributed data processing.

SQL tuning experience.

Knowledge of Oracle to Iceberg migration patterns.

Soft Skills

Strong analytical and debugging capability.

Clear communication and ability to work with cross functional teams.

Ownership mindset and drive to deliver performance improvements without supervision.

Education

Bachelor or Master degree in Computer Science Engineering or equivalent.

Why you should decide on us

Letsgrow together join a market leadingSaaScompany our agile character and culture of innovationenablesyou to design our future.

We provide you with the opportunity to take on responsibility and participate in international projects.

In addition to our buddy-program we offernumerousindividual and wide-ranging training opportunities during which you can explore technical and functional areas.

Our internal mobility initiative encourages colleagues to transfer cross functionally to gain experience and promotes knowledge sharing.

We are proud of our positive working atmosphere characterized by a supportive team acrossvarious locationsand countries and transparent communication across all levels.

Togetherwerebetter - meet your colleagues at ournumerousteam events.

To get a first impression we only need your CV and look forward to meeting you in a (personal / virtual) interview!

Recognizing the benefits of working in diverse teams we are committed to equal employment opportunities regardless of gender age nationality ethnic or social origin disability and sexual identity.

Are you interested Apply now!

R&DN202502

R&DN202503

About us

Regnology is a leading technology firm on a mission to bring efficiency and stability to the financial markets. With an exclusive focus on regulatory reporting and more than 35000 financial institutions over 100 regulators international organizations and tax authorities relying on our solutions to process their regulatory reporting data we are uniquely positioned to bring greater data quality automation and cost savings to all market participants. With a global team of over 1200 employees our clients can swiftly implement and derive value from our solutions and stay ahead of regulatory changes. Established in 2021 through the merger of BearingPoint RegTech and Vizor Software Regnology is rapidly growing into a leading global regulatory reporting powerhouse.

Visit our website

W ant to know m ore about Regnology Find our news and business events on LinkedIn : to know more about life and people at Regnology Check out our Instagram page :

Key Skills

Apache Hive,S3,Hadoop,Redshift,Spark,AWS,Apache Pig,NoSQL,Big Data,Data Warehouse,Kafka,Scala

Employment Type : Full-Time

Experience : years

Vacancy : 1

Create a job alert for this search

Data Engineer • Pune, Maharashtra, India

Related jobs

Data Engineer | 5-8 YoE | Join a Fast-Growing HealthTech Startup in Pune

Geektrust • Pune, Maharashtra, India

We’re hiring an experienced Data Engineer.Google Cloud Dataflow, Datastream, and Airbyte.AWS Postgres, Google FHIR Store, and BigQuery. Build unified data models integrating EHR / FHIR, claims, HL7, C...Show more

Last updated: 15 days ago • Promoted

Data Engineer

IntraEdge • Pune, IN

We are seeking a highly skilled Data Engineer with strong experience in Python, PySpark, Snowflake, and AWS Glue to join our growing data team. You will be responsible for building scalable and reli...Show more

Last updated: 30+ days ago • Promoted

Senior Data Engineer

RiDiK (a Subsidiary of CLPS. Nasdaq : CLPS) • Pune, India

Role : Data Engineering Practice Leader.Availability : Immediate to 30 days.We're hiring a Data engineering Practice Leader to drive our data and AI initiatives. This senior role demands over 20 ...Show more

Last updated: 30+ days ago • Promoted

Opening for GCP Data Engineer @ Pune

Quess IT Staffing • Pune, India

GCP ,BigQuery, SQL / PLSQL, Data fusion.Data warehousing, ETL fundamentals.If interested please share resume at.Show more

Last updated: 19 days ago • Promoted

Azure Data Engineer - Pune

Lumiq • Pune, Maharashtra, India

LUMIQ is the leading Data and Analytics company in the Financial Services and Insurance (FSI) industry.We are trusted by the world's largest FSIs, including insurers, banks, AMCs, and NBFCs, t...Show more

Last updated: 30+ days ago

Platform Engineer – Python / Databricks / Notebooks / Kubernetes

Syntasa • Pune, IN

Platform Engineer – Python / Databricks / Notebooks / Kubernetes.Syntasa is seeking a high-caliber and dedicated.Syntasa Technologies India Private Limited. This position offers an exciting opportuni...Show more

Last updated: 30+ days ago • Promoted

Data Engineer

Synechron • Pune, Maharashtra, India

We have immediate opportunity for.Notice Period : Immediate to 30 Days.At Synechron, we believe in the power of digital to transform businesses for the better. Our global consulting firm combines cre...Show more

Last updated: 30+ days ago • Promoted

Data Engineer

Staffingine LLC • Pune, IN

The Data Engineer will be responsible for designing, developing, and optimizing scalable data pipelines and cloud-based data solutions. This role requires strong Python programming skills, expertise...Show more

Last updated: 2 days ago • Promoted

Market Data Technology API Engineer

HCLTech • Pune, Maharashtra, India

Market Data Technology API Engineer.Work on cutting-edge technology, collaborate with global teams, and drive innovation in the financial domain.Show more

Last updated: 7 days ago • Promoted

Senior Data Engineer

vConstruct Private Limited • Pune, Maharashtra, India

Construct, a Pune-based Construction Technology company is seeking a Senior Data Engineer for its Data Science and Analytics team, a close-knit group of analysts and engineers supporting all data a...Show more

Last updated: 30+ days ago • Promoted

Senior Data Engineer

Primesoft Inc • Pune, IN

Primesoft Enterprise IT Services Pvt.APIs, analytics, AI and machine learning at scale.Products team, building and maintaining production-grade pipelines and platform components that power business...Show more

Last updated: 30+ days ago • Promoted

Data Engineer

Peoplefy • Pune, Maharashtra, India

I am on lookout for Data Engineer for a leading MNC based in Kharadi, Pune.Please refer below JD and share your resume on pallavi. Strong experience with Data Engineering & SQL.Understanding on data...Show more

Last updated: 30+ days ago • Promoted

Databricks Data Engineer

Ascendion • Pune, Maharashtra, India

Job Title : Senior Data Engineer.Location : Gurgaon / Pune / Bangalore.Skills : PySpark, SQL, Databricks, AWS.The ideal candidates should have hands-on expertise in building. Databricks and AWS, along w...Show more

Last updated: 30+ days ago • Promoted

Data Engineer

Trinity Technology Solutions LLC • Pune, India

Location : Pune preferred / remote.Programming Python, PySpark / Spark.Operating Systems Linux (RedHat).Databases Hive, MSSQL, MySQL, PostgreSQL. Big Data Hadoop ecosystem (HDFS, YARN, Sqoop).Version Co...Show more

Last updated: 9 days ago • Promoted

TAVS Senior Data Engineer | Pune

DigiHelic Solutions Pvt. Ltd. • Pune, IN

Job Titile : TAVS Senior Data Engineer.Proficient in PostgreSQL – design, optimization, management.Strong Python skills (Pandas, NumPy) and Databricks / Spark experience. Hands-on with Azure data servi...Show more

Last updated: 2 days ago • Promoted

Data Engineer-AI / ML with US Product based Company-Pune (Hybrid)

Seventh Contact Hiring Solutions • Pune, India

Hiring : Data Engineer – AI / ML | Pune | Hybrid | Full-time.We are looking for a Data Engineer with strong experience in Machine Learning, MLOps, and Generative AI to join our Engineering team.In thi...Show more

Last updated: 3 days ago • Promoted

Data Engineer

Tata Consultancy Services • pune, maharashtra, in

TCS is hiring for Azure Data Engineer, please find the below JD.Experience range – 6 to 8 years.Location - Pune, Kolkata, Mumbai, Bangalore. Skills Required - Azure Databricks, Azure data factory, A...Show more

Last updated: 30+ days ago • Promoted

Data Engineer

K&K Talents - India • Pune, Maharashtra, India

This position is with one of our clients in.Location : Chennai ,Bangalore , Hyderabad , Gurgaon , Bhopal, Pune,Jaipur.Experience in data engineering, with a focus on data architecture and pipeline d...Show more

Last updated: 7 days ago • Promoted