Job Title : Data Engineer III
Location : Delhi (Hybrid)
Department : Data Engineering
Reports To : Engineering Manager – Data Platform
About the Role
The Data Engineer III will lead the design and optimization of Baazi’s large-scale data platform. You’ll architect end-to-end data solutions, mentor junior engineers, and drive innovation across our AWS-based data ecosystem to enable faster, smarter insights across products.
Key Responsibilities
- Architect, build, and optimize large-scale data pipelines and lakehouse systems using Iceberg or Hudi.
- Design and implement advanced ETL / ELT frameworks in AWS (Glue, EMR, Redshift, Lambda).
- Lead development of reusable, modular data pipeline components using PySpark, Python, and SQL.
- Oversee orchestration workflows via Airflow and ensure reliability, scalability, and fault tolerance.
- Collaborate with product, analytics, and engineering teams to define and maintain unified data models.
- Conduct performance tuning, cost optimization, and security hardening of AWS data infrastructure.
- Mentor Data Engineer I / II team members, review code, and enforce best practices.
- Champion data quality frameworks, cataloging standards, and metadata-driven design.
Required Skills & Experience
4–8 years of experience in data engineering with at least 4+ years in PySpark.Deep expertise in AWS ecosystem : Glue, EMR, S3, Lambda, Redshift, CloudWatch.Proven hands-on experience with Apache Iceberg or Hudi (Iceberg preferred).Strong programming skills in Python and PySpark.Excellent command over SQL and performance optimization.Experience with Airflow for complex pipeline orchestration.Exposure to Kubernetes or containerized deployment environments (good to have).Strong understanding of distributed systems, data modeling, and data governance.Ability to translate business needs into scalable technical solutions.