We are looking for an experienced GIS Data Engineer (6+ years) who can architect and implement large-scale geospatial data pipelines. The ideal candidate will bring solid expertise in Spark / Glue / EMR, geospatial formats, and AWS data services to support ingestion, transformation, and optimization of complex GIS datasets.
Key Responsibilities :
- Design and implement end-to-end data pipelines for GIS datasets.
- Build ingestion & validation frameworks to handle schema drift, standardization, quarantine, and data quality auditing.
- Transform geospatial data into GeoParquet and load into Aurora / PostGIS using DMS or equivalent approaches.
- Automate schema management, data lineage, and dataset discovery with AWS Glue Catalog.
- Optimize pipelines for scalability, performance, cost efficiency, and reusability.
- Collaborate closely with geospatial analysts and data science teams for production-ready GIS solutions.
- Develop IaC templates using Terraform / YAML / JSON for consistent deployments.
- Ensure best practices in security, governance, and monitoring for data pipelines.
Required Skills & Expertise :
6+ years of proven experience in Data Engineering, ETL / ELT pipeline development.Strong programming skills in Python or Scala, with hands-on Spark development.Deep knowledge of AWS services : S3, Glue, EMR, Athena, Aurora, DMS, Lambda.Experience working with geospatial datasets and formats (Shapefile, GeoJSON, Parquet, GeoParquet).Strong understanding of data lake / lakehouse architectures.Hands-on with Infrastructure as Code (Terraform).Proficiency with schema evolution, governance, metadata management, and data audits.Preferred Background :
Experience in Geospatial Data Engineering, handling large-scale spatial datasets.Familiarity with PostGIS spatial queries, geometry operations, and indexing strategies.Exposure to DevOps, CI / CD pipelines, and monitoring solutions (e.g., CloudWatch).Strong problem-solving mindset with ability to work in fast-paced, agile environments.Why Join Us?
Cutting-edge exposure to geospatial data engineering at scale.Collaborative, innovation-driven, and growth-focused work culture.Flexible work location across major Indian cities (Delhi, Gurgaon, Bangalore, Pune, Hyderabad).Competitive compensation & benefits package.Opportunity to work with global clients and large-scale datasets.(ref : hirist.tech)