Key Responsibilities
- Design, develop, and maintain scalable data pipelines and architectures using cloud and on-premise platforms.
- Implement solutions across Azure Databricks, PySpark, Synapse, Snowflake, AWS, Google BigQuery, and other modern data platforms.
- Develop ETL / ELT workflows using tools such as Informatica IICS, Datastage, Ab Initio, and SAP MDG.
- Integrate and manage master data using Informatica MDM, Reltio MDM, and Stibo.
- Apply AI / ML, Gen AI, and MLOps techniques to enhance data-driven insights and automation.
- Optimize data warehouse performance, ensure data quality, and implement best practices in data management.
- Collaborate with cross-functional teams to translate business requirements into technical solutions.
- Troubleshoot and resolve complex technical issues related to big data and cloud platforms.
- Document architecture, data flows, and processes for knowledge sharing and compliance.
Skills Required
Azure Databricks, Pyspark, Azure Synapse, Python, Big Data