Talent.com
Senior Data Engineer (Python Coder)
Senior Data Engineer (Python Coder)MyRemoteTeam Inc • Mangalore, IN
Senior Data Engineer (Python Coder)

Senior Data Engineer (Python Coder)

MyRemoteTeam Inc • Mangalore, IN
2 days ago
Job description

About Us

MyRemoteTeam, Inc is a fast-growing distributed workforce enabler, helping companies scale with top global talent. We empower businesses by providing world-class software engineers, operations support, and infrastructure to help them grow faster and better.

Position : Senior Data Engineer (Python Coder)

Location : India ( Remote )

Work Commitment : 40 Hrs / Week (full-time)

Contract Duration : 3 - 6 Months

Client : Wipro ( Google )

BGV : YES

Role : Senior Data Engineer (Python Coder)

Exp : Min. 8 Years

Role Summary

We are looking for a seasoned Senior Data Engineer to architect, build, and own the data pipelines that power our large language model (LLM) development. As a senior Individual Contributor (IC) , you will be the team's expert on data ingestion, processing, and quality for all AI training.

Your primary mission is to build scalable, automated systems that transform massive, raw datasets into pristine, model-ready formats. While your focus will be on data engineering, your expertise will be valued in collaborating on model training runs and experiments. You're the perfect fit if you are a Python expert who thrives on solving large-scale data challenges and enjoys working at the intersection of data engineering and machine learning.

Key Responsibilities

Architect & Build : Design, develop, and own robust, scalable, and automated ETL / ELT pipelines in Python for ingesting and processing terabyte-scale text datasets.

Data Quality : Implement rigorous data cleaning, deduplication, filtering, and normalization strategies. Define and enforce data quality standards to ensure the highest integrity for model training.

Data Transformation : Efficiently structure and format diverse datasets (JSON, Parquet, etc.) for consumption by LLM training frameworks.

Collaboration : Work closely with our team of AI researchers and ML engineers to understand data requirements, define metrics, and support the model training lifecycle.

Optimization : Continuously optimize data processing workflows for speed, cost, and reliability.

ML Support (Secondary) : Occasionally assist in launching, monitoring, and debugging data-related issues during model training runs.

Required Qualifications

  • 8+ years of professional experience in data engineering, data processing, or backend software engineering.
  • Expert-level proficiency in Python and its data ecosystem (e.g., Pandas, NumPy, Dask, Polars).
  • Proven experience building and maintaining large-scale data pipelines.
  • Deep understanding of data structures, data modeling, and software engineering best practices (Git, CI / CD, testing).
  • Experience handling and parsing diverse data formats (JSON, CSV, XML, Parquet) at scale.
  • Excellent problem-solving skills and a meticulous attention to detail.
  • Strong communication and collaboration skills, with experience working in a team environment.

Preferred Qualifications (Nice-to-Haves)

  • Hands-on experience with the data preprocessing pipeline for an LLM (e.g., LLaMA, BERT, GPT-family).
  • Strong experience with big data frameworks like Apache Spark or Ray.
  • Experience with Hugging Face libraries (Transformers, Datasets, Tokenizers).
  • Familiarity with ML frameworks like PyTorch or TensorFlow.
  • Proficiency with cloud platforms (AWS, GCP, Azure) and their data / storage services.
  • Create a job alert for this search

    Data Engineer Python • Mangalore, IN

    Related jobs
    Senior Data Engineer

    Senior Data Engineer

    SAIVA AI • Mangalore, IN
    We are building the future of healthcare analytics.Join us to design, build, and scale robust data pipelines that power nationwide analytics and support our machine learning systems.Our goal : pipel...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    Dexian India • Mangalore, IN
    Designing and building optimized data pipelines using cutting-edge technologies in a cloud environment to drive analytical insights. Constructing infrastructure for efficient ETL processes from vari...Show more
    Last updated: 3 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    RapidBrains • Mangalore, IN
    Job Title : Senior Data Engineer.We are looking for a Senior Data Engineer with deep expertise in Azure Data Engineering to design, build, and optimize large-scale data pipelines.The ideal candidate...Show more
    Last updated: 2 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Amicon Hub Services • mangalore, karnataka, in
    Required Skills & Qualifications.Delta Lake, Spark, PySpark, SQL).SQL Server, MongoDB, InfluxDB).Kafka, Azure Event Hubs, or similar). Excellent problem-solving skills and the ability to work in a f...Show more
    Last updated: 30+ days ago • Promoted
    Senior Python Backend Developer (AI, Cloud & CI / CD)Agentic AI Engineer

    Senior Python Backend Developer (AI, Cloud & CI / CD)Agentic AI Engineer

    GenNow.AI • mangalore, karnataka, in
    Senior Python Backend Developer (4–8 years of experience).Python (FastAPI, Flask, or Django).Deploy and manage backend services on. GitHub Actions, GitLab CI, or Jenkins.Containerize applications us...Show more
    Last updated: 24 days ago • Promoted
    Senior Data Engineer - Data Acquisition

    Senior Data Engineer - Data Acquisition

    InfoBeans • mangalore, karnataka, in
    We are seeking a highly skilled.Senior Data Engineer – Data Acquisition (ODS).The ideal candidate will have extensive hands-on experience in building and optimizing data ingestion and transformatio...Show more
    Last updated: 4 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Insight Global • mangalore, karnataka, in
    The Senior Data Engineer is responsible for building and optimizing ETL / ELT pipelines that process terabytes of data daily across 186 data assets, implementing BigQuery datasets with enterprise-sca...Show more
    Last updated: 3 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Dexian India • Mangalore, IN
    Minimum 8+ years of hands-on experience designing, building, deploying, testing, maintaining, monitoring, and owning scalable, resilient, and distributed data pipelines. High proficiency in Python, ...Show more
    Last updated: 3 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Straive • mangalore, karnataka, in
    The ideal candidate is a strong software engineer with hands-on experience in Spark (3.You'll be responsible for designing and implementing ETL / ELT solutions, collaborating with teams to deliver da...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Incept Labs • mangalore, karnataka, in
    Incept Labs is an AI research lab based in San Francisco, California.Our small team of scientists, engineers, and builders is passionate about developing domain-specific, next-generation AI solutio...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Intellias • Mangalore, IN
    Apache Flink / Apache Spark (Streaming).Data Engineer or similar role, with hands-on expertise in large-scale, production-grade data pipelines. Kafka + Flink / Spark Streaming).Python for data engin...Show more
    Last updated: 11 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Intellify Solutions • Mangalore, IN
    Total years of experience : 10+ years.Required : Bachelor’s degree or relevant experience.Required : Prior experience as a software engineer or data engineer in a. Required : Cloud Data Warehouse experi...Show more
    Last updated: 2 days ago • Promoted
    Senior Data Engineer (Python + Polars)

    Senior Data Engineer (Python + Polars)

    iVoyant • Mangalore, IN
    Our client is looking for a Senior Python Data Engineer who not only builds pipelines but also understands business context, data modeling, and why certain schemas or architectures are used.They ex...Show more
    Last updated: 2 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Mastek • mangalore, karnataka, in
    We are looking for a Senior Data Engineer with good exposure to Cloud Services in AWS(IAM, RDS, Lambda, DMS) along with Snowflake / Postgresql, ETL and Streaming Services like Kafka.Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    Tata Consultancy Services • mangalore, karnataka, in
    Required Technical Skill Set -.Create and maintain optimal data pipeline architecture,.Assemble large, complex data sets that meet functional / non-functional business requirements.Identify, design...Show more
    Last updated: 30+ days ago • Promoted
    Azure Senior Data Engineer

    Azure Senior Data Engineer

    Tata Consultancy Services • mangalore, karnataka, in
    TCS is hiring for AWS Data Engineer.Extensive coding experience including Python / Pyspark.Experience developing in Azure with key data technologies (e. ADLS, ADF, Azure Databricks etc.Software develo...Show more
    Last updated: 6 days ago • Promoted
    Senior Python / AI Engineer — Core Team @ Zumlo

    Senior Python / AI Engineer — Core Team @ Zumlo

    Zumlo • mangalore, karnataka, in
    Zumlo is an always-on well-being companion—one place for immediate help, gentle structure, and progress you can see.We unify mind, body, emotions, and relationships through timely support, a caring...Show more
    Last updated: 19 days ago • Promoted
    Data Engineer

    Data Engineer

    Aceolution • Mangalore, IN
    Data Engineer – Python Expert(Freelance Role).We are looking for a seasoned Senior Data Engineer to architect, build, and own the data pipelines that power our large language model (LLM) developmen...Show more
    Last updated: 30+ days ago • Promoted