Talent.com
Dockerfile Data Validation Engineer - 52945
Dockerfile Data Validation Engineer - 52945Turing • Ghaziabad, IN
Dockerfile Data Validation Engineer - 52945

Dockerfile Data Validation Engineer - 52945

Turing • Ghaziabad, IN
3 days ago
Job description

About Turing :

Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways : first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L.

About the Role :

We are seeking an engineer responsible for designing, implementing, and maintaining data-validation workflows inside Docker-based build pipelines . This role involves creating and managing Dockerfile labels, metadata standards, and validation scripts that ensure datasets, schemas, and model artifacts meet quality and compliance requirements before deployment.

You will work closely with data engineering, machine learning, and DevOps teams to build reliable, reproducible, and fully validated containerized data pipelines .

What does day-to-day look like :

  • Develop and optimize Dockerfiles with built-in data-validation steps.
  • Implement LABEL metadata for dataset versions, schemas, and lineage.
  • Create validation scripts (Python / Bash) for schema checks, data integrity, and quality control.
  • Integrate validation steps into CI / CD pipelines and enforce fail-on-bad-data checks.
  • Document standards for Dockerfile labeling, validation logic, and data governance .

Required Skills :

  • Experienced DevOps engineers.
  • Strong experience with Docker & Dockerfiles.
  • Proficiency in Python or Bash for validation scripting.
  • Knowledge of data formats, schemas, and validation tools.
  • Familiarity with CI / CD systems and container registries.
  • Nice to Have :

  • Previous participation in LLM research or evaluation projects.
  • Experience building or testing developer tools or automation agents.
  • Experience with MLOps workflows, data versioning, or Great Expectations.
  • Knowledge of Kubernetes or container security tools.
  • Perks of Freelancing With Turing :

  • Work in a fully remote environment.
  • Opportunity to work on cutting-edge AI projects with leading LLM companies.
  • Offer Details :

  • Commitments Required : At least 4 hours per day and minimum 20 hours per week with overlap of 4 hours with PST. (We have 3 options of time commitment : 20 hrs / week, 30 hrs / week or 40 hrs / week)
  • Employment type : Contractor assignment (no medical / paid leave)
  • Duration of contract : 2-4 weeks; [expected start date is next week]
  • Evaluation Process (approximately 75 mins) :

  • Interviews (30-60 min technical discussion in QODE)
  • Know amazing talent? Refer them at turing.com / referrals , and earn money from your network.

    Create a job alert for this search

    Validation Engineer • Ghaziabad, IN

    Related jobs
    Data Engineer

    Data Engineer

    Vriba Solutions • Ghaziabad, IN
    AWS, Snowflake, Kafka, Airflow, GitHub, PySpark, Python.Design, develop, and maintain scalable ETL / ELT pipelines.Ingest data from various sources (APIs, databases, files, etc.Implement both real-ti...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    Tata Consultancy Services • Delhi, India, India
    TCS PAN INDIA hiring for Microsoft Azure Data Engineer on 22nd Nov(Saturday) through Virtual Mode of Interview !!!!!.Role : Microsoft Azure Data Engineer. Strong hands on with Azure Data Factory (ADF...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    Randstad Enterprise • Ghaziabad, IN
    Shift Timing : 2 : 00 Pm - 11 : 00 Pm.Experience : 2- 4 years relevant Experience only ( this is a Junior position with us ). GCP - 2 years minimum working Experience.Worked with global stakeholders.Ran...Show more
    Last updated: 30+ days ago • Promoted
    Senior Validation Engineer

    Senior Validation Engineer

    Ignitarium • Delhi, India
    Position : Senior Engineer (Post / Pre Silicon Validation Engineer ) Experience : 3 to 9 years Location : Bangalore.Knowledge of one or more Protocols : PCIe, LPDDR, SPI, USB, AXI 2.Knowledge of ARM ...Show more
    Last updated: 15 days ago • Promoted
    Freelance Data Quality Engineer

    Freelance Data Quality Engineer

    Leading MNC • Ghaziabad, IN
    Freelance Data Quality Engineer.The candidate should have a minimum of 8+ yrs.If you're looking for freelance / part time opportunity (along with your day job) & a chance to work with the top 0.You ...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    LTIMindtree • Delhi, India, India
    We are hiring for the position of.GCP (Mandatory), BigQuery, SQL.This role is specifically for candidates with.GCP Data Engineering background. Please apply only if you meet the required criteria.In...Show more
    Last updated: 28 days ago • Promoted
    Data Engineer- ETL development

    Data Engineer- ETL development

    Globus Systems • Ghaziabad, IN
    Company : AA GLOBUSDIGITAL INDIA PRIVATE LIMITED.AA GLOBUSDIGITAL INDIA PRIVATE LIMITED, is a wholly owned subsidiary of Globus Systems Inc US,. Globus Systems was founded by industry executives who ...Show more
    Last updated: 17 days ago • Promoted
    Data Engineer

    Data Engineer

    ShimentoX Technologies • Ghaziabad, IN
    Data Engineer (Strong with Building data connectors).Key Skills : Python, Data Connectors, Metadata, API Integration-Rest / GraphQL. Must have proven background in building data connectors.Experience s...Show more
    Last updated: 1 day ago • Promoted
    Data Engineer

    Data Engineer

    Staffingine LLC • Ghaziabad, IN
    The Data Engineer will be responsible for designing, developing, and optimizing scalable data pipelines and cloud-based data solutions. This role requires strong Python programming skills, expertise...Show more
    Last updated: 7 days ago • Promoted
    Azure Data Engineer

    Azure Data Engineer

    Tata Consultancy Services • Delhi, India, India
    Interview Date : Weekend Virtual Drive 15-Nov-25.Extensive coding experience (3+ years) including Python / Pyspark (1 years +). Experience developing in Azure with key data technologies (e.ADLS, ADF, A...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    IntraEdge • Ghaziabad, IN
    We are seeking a highly skilled Data Engineer with strong experience in Python, PySpark, Snowflake, and AWS Glue to join our growing data team. You will be responsible for building scalable and reli...Show more
    Last updated: 30+ days ago • Promoted
    Model Validation

    Model Validation

    Tata Consultancy Services • Ghaziabad, IN
    Year of Experience- 6 to 15 Years.Location-Bangalore, Hyderabad, Chennai, Pune, Kolkata.As a Senior Quantitative Analytics Associate, you will be responsible for leading independent validations and...Show more
    Last updated: 16 days ago • Promoted
    Data Quality & Test Engineer - Contract

    Data Quality & Test Engineer - Contract

    Gravity Infosolutions, Inc. • Ghaziabad, IN
    Comprehensive testing support covering unit testing, integration testing, system testing, and User Acceptance Testing (UAT) for data pipelines and the overall product. Design, implement, and execute...Show more
    Last updated: 1 day ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Ironbook AI • Ghaziabad, IN
    The ideal candidate will have strong experience with cloud platforms, modern ETL / ELT tools, and deep technical skills in Python, SQL, and distributed data frameworks. Design, develop, and maintain s...Show more
    Last updated: 16 hours ago • Promoted • New!
    GCP Senior Data Engineer

    GCP Senior Data Engineer

    Tata Consultancy Services • Greater Delhi Area, India
    TCS is hiring for GCP Senior Data Engineer.Location : Hyderabad,Bangalore,Chennai,Delhi,Pune,Kolkata.Data Integration, orchestration mechanism, ability to design BI solutions using Cloud Store, Big ...Show more
    Last updated: 5 days ago • Promoted
    Data Engineer

    Data Engineer

    Sikich India • Ghaziabad, IN
    Sikich India is seeking an experienced Data Engineer to join our Data & AI practice.You will design, build, and optimize end-to-end data solutions using Microsoft’s data platforms, including Micros...Show more
    Last updated: 30+ days ago • Promoted
    GCP Data Engineer

    GCP Data Engineer

    Adastra • Ghaziabad, IN
    We are looking for a proactive and solution-oriented GCP Data Engineer to join our team.This role requires hands-on experience in Google Cloud Platform (GCP), especially with BigQuery and Airflow, ...Show more
    Last updated: 17 days ago • Promoted
    Lead Data Engineer

    Lead Data Engineer

    Guidanz Inc • Ghaziabad, IN
    BI Connector is the industry leading solution for integrating Oracle Fusion Cloud data into modern BI platforms like Power BI, Tableau, and Data Warehouse, without complex ETL.Our Data Architecture...Show more
    Last updated: 1 day ago • Promoted