This role is for one of Weekday’s clients
Min Experience : 5 years
Location : Remote (India)
JobType : full-time
Requirements
We’re looking for a Senior Data Engineer to support enterprise-grade AI solution deployments. This role will work closely with the AI / ML, Strategy & Cloud teams to build scalable data infrastructure, pipelines, and governance mechanisms—especially within the Azure OpenAI ecosystem.
Key Responsibilities
- Collaborate with AI Engineers to define data needs (structured, unstructured, real-time, batch)
- Ingest, clean & transform datasets into AI-ready formats
- Build ETL / ELT pipelines using Azure-native tools
- Manage embedding / vector databases (Pinecone, FAISS, Azure Cognitive Search) for RAG models
- Enable data ingestion from APIs, DBs, SharePoint & other sources
- Work with Cloud / DevOps teams to operationalize AI data pipelines
- Optimize infrastructure costs & performance for AI workloads
- Ensure data governance, security & compliance (e.g., GDPR)
Required Skills
5+ years in Data Engineering (enterprise environments)Strong expertise in Azure Data Factory, Synapse, Data Lake, Event Hubs, DatabricksProficient in Python, SQL, PySparkExperience building AI / ML data pipelines (NLP, embeddings, unstructured data)Knowledge of data modeling, pipeline orchestration, CI / CDFamiliarity with embedding / vector stores for GenAIGood to Have
Azure Data Engineer Associate certificationExperience with RAG-based AI architecturesBachelor’s / Master’s in CS, Data Engineering, or related field