Description :
Job Title : Data Engineer (Python, PySpark, BigQuery)
Location : Mumbai (Onsite)
Experience : 7+ Years
Employment Type : Full-Time
Job Summary :
We are looking for a highly experienced Data Engineer with deep expertise in Python, PySpark, and BigQuery to join our data engineering team in Mumbai.
The ideal candidate will be responsible for building and maintaining large-scale data pipelines and data infrastructure to support real-time and batch analytics across the organization.
Key Responsibilities :
- Design, develop, and maintain efficient and scalable ETL / ELT data pipelines using Python and PySpark
- Manage and optimize large datasets in Google BigQuery, ensuring high performance and cost efficiency
- Work closely with Data Scientists, Analysts, and Product Teams to understand data needs and deliver timely solutions
- Perform data modeling, transformation, and integration from multiple data sources
- Ensure data quality, validation, and integrity across all systems and workflows
- Build and maintain job orchestration and scheduling using tools like Apache Airflow
- Write efficient SQL queries and scripts for data extraction and reporting
- Maintain clear documentation of data workflows, architecture, and business logic
Must-Have Skills :
7+ years of hands-on experience in Data EngineeringStrong programming skills in Python, with expertise in writing modular, production-ready codeProficiency in PySpark for distributed data processingSolid experience working with Google BigQuery, including query optimization and data managementStrong SQL skills and experience with relational and columnar databasesExperience with job orchestration tools like Apache Airflow (or similar)Excellent problem-solving and analytical skillsFamiliarity with version control tools like Git(ref : hirist.tech)