Job Title : Lead Data Engineer
Location : Pune, Gurgaon, Bangalore, Chennai
Primary Skills Pyspark, SQL, ETL, AWS,Glue, Unix, DW concept
Experience : 8 years
- Good Understanding of Cloud, Hadoop and DW concepts.
- Hands on experience in Pyspark and strong knowledge on Dataframes, RDD and SparkSQL
- Hands on experience in Pyspark performance optimization techniques.
- Hands on Experience in developing, testing, and maintaining applications on AWS Cloud.
- Strong hold on AWS Data Analytics Technology Stack (Glue, S3, Lambda, Lake formation, Athena, Event Bridge)
- Design and implement scalable and efficient data transformation / storage solutions with open table formats such as DELTA, Iceberg, Hudi.
- Experience in using DBT (Data Build Tool) with snowflake / Athena / Glue for ELT pipeline development.
- Experience in Writing advanced SQL and PL SQL programs.
- Hands On Experience for building reusable components using Snowflake and AWS Tools / Technology
- DevOps & CI / CD for Data – Experience using GitLab Actions or similar tools for version control, CI / CD, and infrastructure-as-code for data pipelines.
Good to Have
Exposure to data governance or lineage tools such as Immuta and Alation is added advantage.Experience in using Orchestration tools such as Apache Airflow or Snowflake Tasks is added advantage.Knowledge on Ab-initio ETL tool is a plus