Position : Snowflake / Data Vault 2.0 Developer
Location : Remote –EST hours
Length : 4+ months
Notes :
Project involves implementing Data Vault 2.0 and building from the ground up.
- Work includes specific pipeline development in Snowflake with Streamlit.
- Not a traditional star schema – approach is based on Snowflake methodology and Data Vault 2.0.
- Replication approach required across environments (development, production, data structure).
- Must be able to integrate and code along the way to ensure AI functionality at the end.
- Extra steps and integrations involved – requires someone experienced in Snowflake and Data Vault 2.0.
We are seeking a Snowflake-focused Data Engineer with deep expertise in IoT pipelines, ML integration, and containerized applications on Snowpark Container Services (SPCS) . This is not a duplicate of EDM’s function : the role embeds applied engineering inside our department, ensuring we can move faster on ML, containerized applications, and complex pipeline orchestration.
Key Responsibilities
1. Snowflake IoT Data Pipelines (Batch + Streaming)
Design, implement, and optimize IoT data ingestion pipelines (Snowflake Streams, Snowpipe, Tasks, OpenFlow orchestration).Support SCD Type 1, Type 2, and Type 3 pipelines for incremental data processing.Handle both raw sensor telemetry and third-party API integration.2. Machine Learning Operations (Snowpark + Cortex AI)
Deploy, monitor, and optimize ML models (e.g., fouling detection, fuel prediction, itinerary optimization) directly in Snowflake.Assisting with Cortex AI services for advanced analytics and semantic enrichment.3. Containerized Applications on Snowflake SPCS
Build and deploy CI / CD native containerized applications in Snowflake SPCS (e.g., custom APIs, advanced analytics services, routing optimizers).Manage application lifecycle : packaging, scaling, monitoring, and troubleshooting.Integrate containerized services with Snowflake data assets and downstream applications.4. Collaboration & EDM Alignment
Work with EDM on procedural alignment but extend scope into ML + IoT streaming. Source data access alignment.Manage permissions, roles, and cross-account access provisioning in collaboration with EDM and Blue Cloud vendors.Document processes and provide reproducible, automated deployment standards. Troubleshooting previous issue for ML or Cortex based applications among end users.