Greetings from Teknikoz
Experience : 9+Years
Roles and Responsibilities
Advanced PySpark (Expert Level - Required)
- Mastery of DataFrame APIs and Spark SQL
- Performance tuning : broadcast joins, caching strategies, shuffle reduction
- Handling skewed data and optimizing wide transformations
- Writing scalable, testable PySpark code with modular design
Databricks Expertise (Expert Level - Required)
Cluster sizing, autoscaling, and spot instance optimizationJob orchestration using Workflows with retries, alerts, and dependenciesDelta Lake :OPTIMIZE with Z-OrderingMERGE operations and schema evolutionTime travel and versioningUse of Unity Catalog for data governance and access controlMonitoring via Databricks metrics and audit logsPostgreSQL (Required)
Query profiling and execution plan analysisIndexing strategies for analytical workloadsTable partitioning and parallel query execution