Position : DE_SQL_Python_Pyspark_Datawarehousing_Datamodelling
Experience : 5-10 Years
Location : Bangalore
Type : Full Time
Certifications : in Azure or AWS or Databricks
About the Role :
We are hiring sharp, hands-on Data Engineers to build scalable data solutions and drive performance across both traditional and cloud-based data platforms. If you love writing clean code, solving tough data problems, and designing robust data architectures, this role is for you.
What you will do :
- Design and implement scalable data pipelines for batch and near real-time use cases in cloud environments
- Write optimized, complex SQL queries for data transformation and analysis in cloud data warehouses
- Develop efficient Python and PySpark scripts for large-scale data processing and ETL workflows
- Create and maintain data models in Databricks , ensuring data quality and consistency
- Optimize queries and scripts over large-scale datasets (TBs) with a focus on performance and cost-efficiency
- Implement data governance and security best practices in cloud environments
- Collaborate across teams to translate business requirements into robust technical solutions
'Must have’ knowledge, skills and experiences
5+ years of hands-on experience in Data EngineeringStrong command over SQL , Python , and PySpark for data manipulation and analysisDeep experience with data warehousing concepts and implementation in cloud environments ( Azure / AWS )Proficiency in data modeling techniques for cloud-based systems ( Databricks, Snowflake )Solid understanding of ETL / ELT processes and best practices in cloud architecturesExperience with dimensional modeling, star schemas, and data mart designStrong analytical thinking and problem-solving skills‘Good to have’ knowledge, skills and experiences
Familiarity with data lake architectures and delta lake conceptsFamiliarity with Snowflake / Databricks Data WarehouseKnowledge of data warehouse migration strategies to cloudExperience with real-time data streaming technologies (e.G., Apache Kafka, Azure Event Hubs)Exposure to data quality and data governance tools and methodologiesUnderstanding of data virtualization and federated query enginesUnderstanding of performance optimization techniques for cloud-based data warehousesApache Airflow – Workflow orchestration and schedulingCertifications in Azure or AWS or Databricks