Azure Data Engineer
Primary Skills : SQL(Azure), Data Vault, Azure Data Factory, Blob Storage, Azure Synapse, Pipelines, Delta Lake,Databricks
Secondary Skills : Python, Apache Spark, MS Fabric (Trained Knowledge or experience), Power BI
- Proficiency in SQL and Python. SQL being far more important.
- Experience with Apache Spark for data processing (Python version)
- Understanding of Delta files and Lakehouse architecture
- Data warehouse basic knowledge or experience
- Hands-on skills (even gotten only by training) are essential for effectively using Microsoft Fabric.
- Practical experience in setting up end-to-end analytics, managing Lakehouse and Medallion Architecture, using Apache Spark and Delta Lake tables, handling data ingestion with Dataflows Gen2, creating pipelines with Data Factory, and setting up data warehouses is crucial.
- Understand the capabilities of Microsoft Fabric for complete analytics solutions, including data ingestion, transformation, storage, and visualization.
- Familiarity with features such as Direct Lake access for Power BI reports, is important.
- Utilize Apache Spark for large-scale data processing and work with Delta Lake tables for advanced data analytics.
- Ingest data using Dataflows Gen2 and create pipelines with Data Factory capabilities for multi-step data ingestion and transformation tasks.
- Set up and query data warehouses in Microsoft Fabric, integrating them with other analytics components.
- Learn how to secure a Microsoft Fabric data warehouse and administer the platform effectively.