Key Deliverables :
- Design and implement scalable, secure enterprise-grade data platforms
- Build real-time and batch data pipelines to support ML and analytics use cases
- Optimize cloud-based data infrastructure for performance, scalability, and compliance
- Enforce robust data governance and CI / CD practices for high-quality data delivery
Role Responsibilities :
Integrate diverse data sources using distributed systems architectureCollaborate on real-time streaming solutions using Flink, Kafka, or KinesisImplement DevOps DataOps workflows using Terraform, Airflow, and KubernetesWork with stakeholders to ensure data systems align with business and regulatory needsSkills Required
Apache Spark, Kafka, Python, Data Modeling