This job offer is not available in your country.

Data Engineer

OWOWRajkot, IN

9 hours ago

Job description

What You'll Build

Core Responsibilities

Data Architecture & Infrastructure (40%)

Design and implement a multi-database architecture (MongoDB, Redis, Milvus, Neo4j, BigQuery)
Build scalable data pipelines for real-time conversation processing and personalization
Architect ETL / ELT workflows for data migration from legacy systems
Implement data partitioning, sharding, and optimization strategies for high-throughput systems
Create data governance frameworks ensuring quality, security, and compliance Vector & Graph Database Systems (25%)
Design and optimize Milvus vector collections for semantic search (1024-dim embeddings)
Build graph schemas in Neo4j for customer journey mapping and persona relationships
Implement HNSW indexing strategies and similarity search optimization
Create hybrid search systems combining vector, full-text, and graph queries
Monitor and tune database performance (query latency, throughput, resource utilization)

ML Data Infrastructure (20%)

Build data collection pipelines for LLM fine-tuning (conversation logs, tool executions)

Create feature stores for GNN training (customer interactions, engagement signals)

Implement data versioning and lineage tracking for ML experiments

Design A / B testing data infrastructure with CUPED variance reduction

Build real-time feature computation pipelines for contextual bandits

Analytics & Monitoring (15%)

Design BigQuery schemas for marketing analytics and performance tracking

Create materialized views and aggregation pipelines for real-time dashboards

Implement data quality monitoring and anomaly detection

Build observability infrastructure (Prometheus metrics, Grafana dashboards)

Develop cost optimization strategies for cloud data warehousing

Technical Stack You'll Work With

Databases & Storage

MongoDB (conversation state, active sessions)

Redis (caching, rate limiting, real-time data)

Milvus (vector embeddings, semantic search)

Neo4j (customer journey graphs, persona networks)

BigQuery (analytics warehouse, historical data)

Data Processing & Orchestration

Apache Airflow or Prefect (workflow orchestration)

Pandas , Polars (data transformation)

Apache Spark (optional - for large-scale processing)

dbt (data transformation and modeling)

ML / AI Data Pipeline

vLLM (LLM inference serving)

MLflow (model registry, experiment tracking)

Sentence Transformers (embedding generation)

PyTorch , TensorFlow (ML model training)

Cloud & Infrastructure

Google Cloud Platform (BigQuery, Cloud Storage, Compute)

Docker & Kubernetes (containerization, orchestration)

Terraform (infrastructure as code)

GitHub Actions or GitLab CI (CI / CD pipelines)

Programming & Tools

Python 3.10+ (primary language)

SQL (complex queries, query optimization)

Shell scripting (Bash / Zsh)

Git (version control)

Requirements

Must-Have Skills

5+ years of data engineering experience with production systems

Expert-level SQL and database design skills

Strong Python programming (async / await, type hints, testing)

Experience with at least 3 different database technologies (SQL, NoSQL, Vector, Graph)

Proven track record building high-scale data pipelines (>

1M records / day)

Deep understanding of data modeling (dimensional, normalized, denormalized)

Experience with cloud data warehouses (BigQuery, Redshift, or Snowflake)

Strong knowledge of data quality, validation, and governance

Excellent debugging and optimization skills

Highly Desirable

Experience with vector databases (Milvus, Pinecone, Weaviate, Qdrant)

Experience with graph databases (Neo4j, ArangoDB, Neptune)

Knowledge of embedding models and semantic search

Experience with ML data pipelines (feature stores, model training data)

Understanding of A / B testing and experimental design

Experience with real-time streaming (Kafka, Pub / Sub, Kinesis)

Knowledge of LLMs and conversational AI systems

Experience with data migration projects (especially large-scale)

Background in marketing technology or customer data platforms

Nice-to-Have

Experience with PyTorch Geometric or graph neural networks

Knowledge of marketing analytics (attribution, segmentation, personalization)

Familiarity with LangChain , LangGraph , or agent frameworks

Experience with cost optimization in cloud environments

Contributions to open-source data engineering projects

Experience with data compliance (GDPR, CCPA)

Key Projects You'll Own

Phase 1 : Foundation

Migrate 10M+ conversation vectors from Pinecone to Milvus

Design and implement MongoDB schemas for real-time agent state

Set up Neo4j graph database with customer journey models

Create BigQuery data warehouse with partitioned tables

Phase 2 : Optimization

Build automated data quality monitoring system

Implement caching strategies (Redis) for 10x latency reduction

Optimize vector search queries (target :

Create real-time analytics dashboards (Grafana)

Phase 3 : ML Infrastructure

Build LLM fine-tuning data pipeline

Implement feature store for GNN training

Create A / B testing data infrastructure

Design multi-armed bandit state management

Work Environment

Collaborative team : Work with ML engineers, backend developers, and data scientists

Modern stack : Latest technologies and tools

Impact : Your work directly affects millions of marketing interactions

Autonomy : Own your projects end-to-end

Growth : Clear path to Senior / Lead / Principal roles

Create a job alert for this search

Data Engineer • Rajkot, IN

Related jobs

Promoted

Data Engineer - Scala

Idyllic ServicesRajkot, IN

Code Optimization , Shell Scripting, Data Engineering & Scala.Big Data tools like Spark, Hadoop & Hive.The Senior Data Engineer will be responsible for designing, developing, and maintaining scalab...Show moreLast updated: 3 days ago

Promoted

Data Engineer

Response InformaticsRajkot, IN

We’re Hiring | Data Engineering Experts (Hyderabad / Remote).We’re looking for passionate and experienced.Data Engineering professionals. If you love building scalable data pipelines, optimizing clo...Show moreLast updated: 5 days ago

Promoted

Data Engineer

XebiaRajkot, IN

We’re Hiring : Data Engineer | Xebia.Any Xebia location (Hybrid, 3 days office per week).Immediate to 2 weeks – only apply if you can join early. Databricks, Python, SQL, and Postgres.The ideal candi...Show moreLast updated: 25 days ago

Promoted

Data Engineer

KPG99 INCRajkot, IN

Duration- 12+ months with Extensions.REQUIRED SKILLS AND EXPERIENCE.Strong hands-on experience with Databricks for data processing and pipeline development. Proficiency in SQL for data querying, tra...Show moreLast updated: 5 days ago

Promoted

Databricks Data Engineer

Insight GlobalRajkot, IN

Insight Global is seeking a Databricks Data Engineer in India with with 3–5 years of experience to support data engineering initiatives in the pharmaceutical domain. The ideal candidate will have ha...Show moreLast updated: 3 days ago

Promoted

Data Engineer

IntraEdgeRajkot, IN

The Senior Data Engineer will design, develop, monitor and maintain a robust and scalable data platform used by other data analyst and engineering teams to deliver powerful insights to both interna...Show moreLast updated: 30+ days ago

Promoted

Senior Data Engineer

InfogainRajkot, IN

Big Data Engineer (Lead) : As a Big Data Engineer (Lead), you will be responsible for leading a team of big data engineers. You will work closely with clients and team members to understand their req...Show moreLast updated: 30+ days ago

Promoted

Data Engineer

Jade Business Services (JBS)Rajkot, IN

Data Engineer (AI Knowledge Base Platform).The engineer will be responsible for ensuring reliable, scalable, and secure data movement across multiple sources, powering Retrieval-Augmented Generatio...Show moreLast updated: 30+ days ago

Promoted

Lead Data Engineer

Searce IncRajkot, IN

Searce means ‘a fine sieve’ & indicates ‘to refine, to analyze, to improve’.It signifies our way of working : To improve to the finest degree of excellence, ‘solving for better’ every time.Searcians...Show moreLast updated: 15 days ago

Promoted

Data Engineer

KeasisRajkot, IN

We are seeking a Data Engineer with strong Apache NiFi expertise to design and implement pipelines that move and transform data from Cloudera (HDFS / Hive / Impala) into Apache Iceberg tables, with dow...Show moreLast updated: 5 days ago

Promoted

Data Platform Engineer

Intuitive.CloudRajkot, IN

With the reputation of being a.Digital Transformation challenges across following Intuitive Superpowers : .Application & Database Modernization. Platform Engineering (IaC / EaC, DevSecOps & SRE).Cloud N...Show moreLast updated: 25 days ago

Promoted

Data Engineer

Digivance SolutionsRajkot, IN

Chennai, Bengaluru, Pune, Hyderabad, Mumbai, Delhi NCR.Collaborate with business and technology stakeholders to understand current and future data requirements. Design, build, and maintain reliable,...Show moreLast updated: 24 days ago

Promoted

GCP Data Engineer

EXLRajkot, IN

Must Have- GCP Data Engineer with Banking / Finance Institutions Experience.Google Cloud Platform (GCP) Engineers.This solution will integrate data from diverse sources including.SAS systems, Excel ...Show moreLast updated: 5 days ago

Promoted

Data Engineer

Manuh TechnologiesRajkot, IN

S3, Glue, Redshift, EMR, Lambda).Develop automation scripts and tools using.Collaborate with data analysts, data scientists, and business stakeholders to ensure data availability and reliability.Tr...Show moreLast updated: 30+ days ago

Promoted

Data Engineer

EveriseRajkot, IN

Join us on our mission to elevate customer experiences for people around the world.As a member of the Everise family, you will be part of a global experience company that believes in being people-f...Show moreLast updated: 26 days ago

Promoted

Data Engineer

TalogyRajkot, IN

This opportunity is ideal for a determined and proactive individual who has a wide range of skills in a variety of database administration, reporting and dashboarding disciplines.This role requires...Show moreLast updated: 30+ days ago

Promoted

Data Engineer

AAA GlobalRajkot, IN

Proprietary Trading / Financial Markets.We are seeking an experienced Data Engineer to strengthen our core Data Engineering team. In this key role, you will ensure the secure, scalable, and efficien...Show moreLast updated: 5 days ago

Promoted

SAP Data Engineer (Datasphere)

KPG99 INCRajkot, IN

Duration : 12-month contract with extensions.Candidates' Core Skillset Must Be : .SAP ABAP, HANA / AMDP / CDS, SOAP / OData / Rest API’s etc.Show moreLast updated: 2 days ago