Job Summary :
BCSS is seeking a Databricks Data Engineer to support its enterprise-wide Sustainability initiative. The engineer will be responsible for building data pipelines and models to support product-level carbon footprint analysis. This role involves integrating structured engineering, manufacturing, and supplier data into a unified model using Databricks on AWS.
Key Responsibilities :
- Develop and optimize ETL / ELT pipelines on Databricks using PySpark and SQL to support carbon footprint analytics.
- Build data models that combine engineering (EBOM), manufacturing (MBOM), supplier, and factory operations data to generate emissions metrics.
- Integrate and transform data from :
- MBOM from SAP, including multilevel BOM explosion logic
- EBOM from Oracle-based systems (no explosion required)
- Supplier environmental data (e.g., material-level emissions)
- Factory data (e.g., energy consumption, material usage)
- Collaborate with sustainability analysts, engineering teams, and supply chain stakeholders to translate carbon calculation logic into data transformations.
- Ensure high data quality, lineage, and governance using Delta Lake, Unity Catalog, and standard best practices.
- Leverage AWS services such as S3, Glue, and Athena for orchestration and storage.
- Document data logic and workflows for traceability and compliance with ESG standards.
Technical Requirements :
Hands-on experience with Databricks (Delta Lake, Unity Catalog, Jobs, Workflows)Strong skills in PySpark and SQLExperience with SAP MBOM structures, especially multilevel BOM explosionUnderstanding of Oracle-based EBOM systems and ability to integrate structured dataFamiliarity with AWS data ecosystem (S3, Glue, Lambda, Athena)Strong knowledge of data modeling, pipeline optimization, and performance tuningPreferred Qualifications :
Experience in carbon accounting, lifecycle analysis, or sustainability-focused data projectsUnderstanding of manufacturing data and supply chain operationsExposure to SAP ECC / S / 4HANA and Oracle-based PLM or engineering systemsExperience with version control, CI / CD (e.g., Git, Databricks Repos)