Talent.com
Data Engineer
Data EngineerAceolution • baddi, himachal pradesh, in
No longer accepting applications
Data Engineer

Data Engineer

Aceolution • baddi, himachal pradesh, in
30+ days ago
Job description

Job Title : Data Engineer – Python Expert(Freelance Role)

Location : Remote / Hybrid

Employment Type : Contract / Freelance

Role Summary

We are looking for a seasoned Senior Data Engineer to architect, build, and own the data pipelines that power our large language model (LLM) development. As a senior Individual Contributor (IC), you will be the team's expert on data ingestion, processing, and quality for all AI training.

Your primary mission is to build scalable, automated systems that transform massive, raw datasets into pristine, model-ready formats. While your focus will be on data engineering, your expertise will be valued in collaborating on model training runs and experiments. You're the perfect fit if you are a Python expert who thrives on solving large-scale data challenges and enjoys working at the intersection of data engineering and machine learning.

Key Responsibilities

Architect & Build : Design, develop, and own robust, scalable, and automated ETL / ELT pipelines in Python for ingesting and processing terabyte-scale text datasets.

Data Quality : Implement rigorous data cleaning, deduplication, filtering, and normalization strategies. Define and enforce data quality standards to ensure the highest integrity for model training.

Data Transformation : Efficiently structure and format diverse datasets (JSON, Parquet, etc.) for consumption by LLM training frameworks.

Collaboration : Work closely with our team of AI researchers and ML engineers to understand data requirements, define metrics, and support the model training lifecycle.

Optimization : Continuously optimize data processing workflows for speed, cost, and reliability.

ML Support (Secondary) : Occasionally assist in launching, monitoring, and debugging data-related issues during model training runs.

Required Qualifications

8+ years of professional experience in data engineering, data processing, or backend software engineering.

Expert-level proficiency in Python and its data ecosystem (e.g., Pandas, NumPy, Dask, Polars).

Proven experience building and maintaining large-scale data pipelines.

Deep understanding of data structures, data modeling, and software engineering best practices (Git, CI / CD, testing).

Experience handling and parsing diverse data formats (JSON, CSV, XML, Parquet) at scale.

Excellent problem-solving skills and a meticulous attention to detail.

Strong communication and collaboration skills, with experience working in a team environment.

Preferred Qualifications (Nice-to-Haves)

Hands-on experience with the data preprocessing pipeline for an LLM (e.g., LLaMA, BERT, GPT-family).

Strong experience with big data frameworks like Apache Spark or Ray.

Experience with Hugging Face libraries (Transformers, Datasets, Tokenizers).

Familiarity with ML frameworks like PyTorch or TensorFlow.

Proficiency with cloud platforms (AWS, GCP, Azure) and their data / storage services.

Why Join Us

  • Opportunity to lead cutting-edge AI and ML projects.
  • Collaborative and innovative team culture.
  • Competitive compensation with continuous learning opportunities.

📩 If you are interested, please share your updated CV to sharmila@aceolution.com along with your expected rate per hour.

Create a job alert for this search

Data Engineer • baddi, himachal pradesh, in

Related jobs
Aws Data Engineer (Remote)

Aws Data Engineer (Remote)

Mindcraft Labs • Baddi, Republic Of India, IN
Remote
This role focuses on building and maintaining data pipelines and analytics infrastructure on AWS.You will work daily with S3, Glue, Redshift, Athena, Lake Formation, Airflow, SNS / SQS, and Postgres ...Show more
Last updated: 2 hours ago • Promoted • New!
Azure Databricks Engineer

Azure Databricks Engineer

Philodesign Technologies Inc • baddi, India
Job Opening : Azure Databricks Engineer (7+ Years Experience) | Budget : ₹1 LPM | Remote / Hybrid.We are seeking a highly skilled. This role is ideal for professionals who have deep expertise in buildin...Show more
Last updated: 10 hours ago • Promoted • New!
GCP Big Data Engineer (Full-time at a Fortune 500 tech MNC )

GCP Big Data Engineer (Full-time at a Fortune 500 tech MNC )

HARP • baddi, India
We are looking for an experienced and motivated.The ideal candidate will have 5 years of relevant experience in data engineering, with a strong focus on. This role requires strong technical expertis...Show more
Last updated: 30+ days ago • Promoted
AWS AI / ML Engineer (Remote)

AWS AI / ML Engineer (Remote)

Mindcraft Labs • baddi, India
Remote
This is a hands-on engineering role focused on building and maintaining AI and ML services on AWS.You will help turn ideas and prototypes into robust, production-ready APIs and ML flows using Amazo...Show more
Last updated: 4 hours ago • Promoted • New!
Data Engineer

Data Engineer

System Soft Technologies • baddi, India
Location : Remote (3–4-hour time zone overlaps with EST if off shore).Experience with next flow is required, as the consultant will make targeted enhancements to existing workflows and pipelines.Whi...Show more
Last updated: 3 days ago • Promoted
Data Engineer

Data Engineer

Staffingine LLC • baddi, India
The Data Engineer will be responsible for designing, developing, and optimizing scalable data pipelines and cloud-based data solutions. This role requires strong Python programming skills, expertise...Show more
Last updated: 4 hours ago • Promoted • New!
AI Data Engineer - 17852

AI Data Engineer - 17852

Turing • baddi, India
We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show more
Last updated: 21 days ago • Promoted
Senior Data Engineer

Senior Data Engineer

Arenema • baddi, India
India (remote – Bangalore / Karnataka area preferred).Full-time contractor / employee.You will be a core member of the team building a data platform that maps economic, advertising and real-estate ac...Show more
Last updated: 4 hours ago • Promoted • New!
Gcp Big Data Engineer

Gcp Big Data Engineer

HARP • Baddi, Republic Of India, IN
We are looking for an experienced and motivated.The ideal candidate will have 5 years of relevant experience in data engineering, with a strong focus on. This role requires strong technical expertis...Show more
Last updated: 30+ days ago • Promoted
Data Engineer(Intern)

Data Engineer(Intern)

Tech Phoenix • Baddi, Republic Of India, IN
Tech Phoenix is a data science and AI startup that delivers cutting-edge solutions to clients across industries.We take on challenging projects ranging from advanced analytics and machine learning ...Show more
Last updated: 1 day ago • Promoted
AI Engineer (Data Pipelines & RAG)

AI Engineer (Data Pipelines & RAG)

BeGig • baddi, India
Job Role- AI Engineer (Data Pipelines & RAG).Work Mode- Remote(6 days working).We are looking for a hands-on AI / Data Engineer (4–7 years) to build and scale data pipelines powering GenAI and agenti...Show more
Last updated: 10 days ago • Promoted
Ai Data Engineer - 17852

Ai Data Engineer - 17852

Turing • Baddi, Republic Of India, IN
We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show more
Last updated: 22 days ago • Promoted
Ai Engineer

Ai Engineer

BeGig • Baddi, Republic Of India, IN
Job Role- AI Engineer (Data Pipelines & RAG).Work Mode- Remote(6 days working).We are looking for a hands-on AI / Data Engineer (4–7 years) to build and scale data pipelines powering GenAI and agenti...Show more
Last updated: 10 days ago • Promoted
AI Engineer

AI Engineer

NyxaLabs • baddi, India
We're seeking an exceptional AI Engineer with deep expertise in TensorFlow model training to design and build next-generation AI systems. This role focuses on developing sophisticated machine learni...Show more
Last updated: 10 hours ago • Promoted • New!
AWS Data Engineer (Remote)

AWS Data Engineer (Remote)

Mindcraft Labs • baddi, India
Remote
This role focuses on building and maintaining data pipelines and analytics infrastructure on AWS.You will work daily with S3, Glue, Redshift, Athena, Lake Formation, Airflow, SNS / SQS, and Postgres ...Show more
Last updated: 4 hours ago • Promoted • New!
Senior Snowflake + DBT Engineer (8+ Years)

Senior Snowflake + DBT Engineer (8+ Years)

MindBrain • baddi, India
Job Opportunity : Snowflake + DBT Engineer.We are seeking a highly skilled Snowflake + DBT Engineer to design, build, and optimize scalable cloud-based data platforms. The ideal candidate will have s...Show more
Last updated: 4 hours ago • Promoted • New!
Snowflake Data Engineer

Snowflake Data Engineer

Live Connections • baddi, India
Role - Snowflake Data Engineer.Required Notice Period - Immediate Joiner.To apply, connect with Abhishek via.Show more
Last updated: 12 days ago • Promoted
Data Center Engineer

Data Center Engineer

Estarta Solutions • baddi, India
Job Title : Datacenter Engineer.Estarta is seeking a skilled Datacenter Engineer to support Cisco’s Customer Delivery Engineering function. The role focuses on delivering high-quality technical solut...Show more
Last updated: 23 days ago • Promoted