Experience : 7-12 Years
Notice period : Immediate joiners only
Mandatory Skills :
Proven experience leading data engineering teams, including distributed teams across multiple geographies and time zones.
Effective in managing cross-team collaboration with architects, product managers, and operations.
Scala and Python
Apache Spark (batch & streaming) – must!
Deep knowledge of HDFS internals and migration strategies.
Experience with Apache Iceberg (or similar table formats like Delta Lake / Apache Hudi) for schema evolution, ACID transactions, and time travel.
Running Spark and / or Flink jobs on Kubernetes (e.g., Spark-on-K8s operator, Flink-on-K8s).
Experience with distributed blob storages like Ceph or AWS S3 and similar
Building ingestion, transformation, and enrichment pipelines for large-scale datasets.
Infrastructure-as-Code (Terraform, Helm) for provisioning data infrastructure.
Strong communication skills
Grid Dynamics (NASDAQ : GDYN) is a leading provider of technology consulting, platform and product engineering, and advanced analytics services. Fusing technical vision with business acumen, we enable positive business outcomes for enterprise companies undergoing business transformation by solving their most pressing technical challenges. A key differentiator for Grid Dynamics is our 7+ years of experience and leadership in enterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization, and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India. Follow us on LinkedIn.
Senior Data Engineer • Hyderabad, India