Data Scientist / Data Engineer (Analytics + Data Pipelines)Confidential • Delhi, India

Data Scientist / Data Engineer (Analytics + Data Pipelines)

Confidential • Delhi, India

1 day ago

Job description

We're hiring a

Data Scientist / Data Engineer

to help us turn raw data into reliable datasets, insights, and models that drive real decisions. This role blends strong

data engineering

(pipelines, quality, orchestration) with hands-on

data science

(analysis, experimentation, forecasting, ML when needed). You'll work closely with product and engineering teams to build data products that are accurate, scalable, and actionable.

What you'll do

Design and build end-to-end

data pipelines

(batch and, if applicable, streaming).

Collect, clean, transform, and model data into well-structured datasets for analytics and ML.

Develop and maintain a

data warehouse / lake

model (dimensional modeling, data marts, curated layers).

Implement

data quality checks , observability, lineage, and monitoring.

Perform exploratory analysis and deliver insights via dashboards, notebooks, and stakeholder-ready summaries.

Build and deploy ML models when needed (forecasting, churn / segmentation, anomaly detection, recommendations).

Run experiments / A / B testing support (metrics definitions, evaluation, statistical validity).

Collaborate with backend teams to define event schemas, tracking plans, and data contracts.

Optimize performance and cost across storage, compute, and queries.

Must-have skills

Strong SQL and solid programming skills (Python preferred).

Experience building pipelines using tools like

Airflow / Dagster / Prefect

(or equivalent).

Strong knowledge of data modeling (star schema, slowly changing dimensions, event modeling).

Experience with at least one of :

PostgreSQL / MySQL / BigQuery / Snowflake / Redshift .

Proven ability to validate data correctness and implement

data quality frameworks .

Comfortable communicating insights and technical trade-offs to non-technical stakeholders.

Nice-to-have skills

Streaming :

Kafka / Kinesis / PubSub , real-time processing ( Spark Streaming / Flink ).

Big data :

Spark , distributed compute, partitioning strategies.

Lakehouse :

Iceberg / Delta / Hudi , object storage (S3 / GCS / Azure Blob).

MLOps :

MLflow , model monitoring, feature stores, deployment pipelines.

BI :

Superset / Power BI / Looker / Metabase , semantic layers.

Cloud : AWS / Azure / GCP (IAM, networking basics, managed data services).

Experience with privacy / security compliance (PII handling, retention policies, access controls).

What we value

Ownership : you build reliable systems, not just one-off scripts.

Curiosity : you ask the 'why' behind metrics and propose better approaches.

Practicality : you can balance speed vs correctness and deliver iteratively.

Strong collaboration with engineers, product, and leadership.

Create a job alert for this search

Data Scientist • Delhi, India

Similar jobs

Data Engineer

Insight Global • Ghaziabad, IN

Proficiency with Snowflake (Snowpark, Streams / Tasks, UDFs).Hands-on experience implementing embedding pipelines and vector stores (Snowflake or external). Experience with Python and data engineering...Show more

Last updated: 3 days ago • Promoted

Data Scientist

HT Digital Streams • Delhi, India, India

HT Digital Streams, part of the.We combine startup agility with the scale and trust of one of India’s leading media houses — creating data-driven digital experiences that reach millions of users ev...Show more

Last updated: 6 days ago • Promoted

Data Engineer / Data Scientist

Luxoft • Delhi, India

Project Description : We are seeking a Python Developer with strong expertise in Java and SQL to design, develop, and maintain scalable software solutions within a cloud-enabled environment.The role...Show more

Last updated: 4 days ago • Promoted

Data Scientist

Tata Consultancy Services • New Delhi, Delhi, India

Python (Advance level with complex algorithms).R (Advance level with complex algorithms).Monitoring with Prometheus and Grafana. Working knowledge of Data Management Platforms : .FreeIPA role-based ac...Show more

Last updated: 2 days ago • Promoted

Remote AI Data Engineer

Turing • Ghaziabad, IN

Remote

We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show more

Last updated: 4 days ago • Promoted

Data Engineer

IntraEdge • Ghaziabad, IN

We are seeking a highly skilled Data Engineer with strong experience in Python, PySpark, Snowflake, and AWS Glue to join our growing data team. You will be responsible for building scalable and reli...Show more

Last updated: 12 days ago • Promoted

Data Scientist

v4c.ai • Ghaziabad, IN

The Data Scientist supports the development and implementation of data models, focusing on Machine Learning, under the supervision of more experienced scientists, contributing to the team’s innovat...Show more

Last updated: 5 days ago • Promoted

AI Data Pipeline Engineer

crenovent • Noida, Republic Of India, IN

We are an early stage SaaS / AI startup building building what we are calling Enterprise Revenue Fabric platform.You will work alongside a team of engineers, data scientists, and product managers t...Show more

Last updated: 10 days ago • Promoted

Data Science Engineer (Python)

IKTARA Data Sciences • Delhi, India, India

IKTARA Data Sciences is a product and engineering services company dedicated to providing innovative software solutions for complex, multi-dimensional challenges. With a mission to democratize AI, t...Show more

Last updated: 8 days ago • Promoted

GenAI Lead Engineer (Investment Data Platforms)

Vichara Technologies • Delhi, India, India

We are seeking a highly skilled.This role will also involve leading a cross-functional team of.Machine Learning Engineers and UI Developers. Develop custom frameworks using.Optimize LLM usage for in...Show more

Last updated: 4 days ago • Promoted

Data Scientist

Qubryx • Ghaziabad, IN

Senior Data Scientist (Remote – India) – Predictive Modeling & Machine Learning.We are looking for a highly skilled.India-based team in a remote capacity. This role focuses on building and deploying...Show more

Last updated: 12 days ago • Promoted

Lead Data Engineer

Vriba Solutions • Ghaziabad, IN

Work Model : Full-time Timings : 8AM IST to 5 PM IST.We are seeking a highly experienced.The ideal candidate will have strong expertise in. AWS, Snowflake, PySpark, and ETL frameworks.This role requir...Show more

Last updated: 5 days ago • Promoted

Data Scientist / Data Engineer (Analytics + Data Pipelines)

Eurisko • ghaziabad, India

Data Scientist / Data Engineer.You’ll work closely with product and engineering teams to build data products that are accurate, scalable, and actionable. Collect, clean, transform, and model data in...Show more

Last updated: 1 day ago • Promoted

Data Pipeline Engineer

Havells India Ltd • Noida, Republic Of India, IN

We are seeking a skilled and experienced Data Engineer to join our dynamic team.The ideal candidate will have a strong background in data engineering, with a focus on PySpark, Python, and SQL.Exper...Show more

Last updated: 30+ days ago • Promoted

Data & AI Engineer (Data Pipelines & RAG)

Pro5.ai • Ghaziabad, IN

Data & AI Engineer (Data Pipelines & RAG).We are seeking a versatile Data & AI Engineer with 4-7 years of experience to build, deploy & maintain end-to-end data pipelines for downstream Gen AI appl...Show more

Last updated: 3 days ago • Promoted

Data Scientist

Enterprise Minds, Inc • Ghaziabad, IN

Hiring : Senior Data Scientist – Generative AI (3.Generative AI, LLM, and agentic systems.In this role, you will transform complex business problems into. As a key individual contributor, you will : .L...Show more

Last updated: 12 days ago • Promoted

AWS Data Engineer

Insight Global • Ghaziabad, IN

Expertise in AWS or Azure (not required to know both).Hands-on experience with vector stores and embedding pipelines.Strong Python development experience. Experience with OCR / document intelligence t...Show more

Last updated: 3 days ago • Promoted

Data Scientist

Maestro Technologies, Inc. • Ghaziabad, IN

Develop, validate, and deploy predictive, prescriptive, and scoring models to power product features and business decisions. Partner with the product management and data engineering teams to design ...Show more

Last updated: 3 days ago • Promoted