Talent.com
Principal Data Pipeline Engineer

Principal Data Pipeline Engineer

SAIVA AIRepublic Of India, IN
30+ days ago
Job description

We are building the future of healthcare analytics. Join us to design, build, and scale robust data pipelines that power nationwide analytics and support our machine learning systems. Our goal : pipelines that are reliable, observable, and continuously improving in production.

This is a fully remote role, open to candidates based in Europe or India, with periodic team gatherings in Mountain View, California.

What You’ll Do

  • Design, build, and maintain scalable ETL pipelines using Python (Pandas, PySpark) and SQL, orchestrated with Airflow (MWAA).
  • Develop and maintain the SAIVA Data Lake / Lakehouse on AWS, ensuring quality, governance, scalability, and accessibility.
  • Run and optimize distributed data processing jobs with Spark on AWS EMR and / or EKS.
  • Implement batch and streaming ingestion frameworks (APIs, databases, files, event streams).
  • Enforce validation and quality checks to ensure reliable analytics and ML readiness.
  • Monitor and troubleshoot pipelines with CloudWatch, integrating observability tools like Grafana, Prometheus, or Datadog.
  • Automate infrastructure provisioning with Terraform, following AWS best practices.
  • Manage SQL Server, PostgreSQL, and Snowflake integrations into the Lakehouse.
  • Participate in an on-call rotation to support pipeline health and resolve incidents quickly.
  • Write production-grade code, and contribute to design / code reviews and engineering best practices.

What We’re Looking For

  • 5+ years in data engineering, ETL pipeline development, or data platform roles (flexible for exceptional candidates).
  • Experience designing and operating data lakes or Lakehouse architectures on AWS (S3, Glue, Lake Formation, Delta Lake, Iceberg).
  • Strong SQL skills with PostgreSQL, SQL Server, and at least one AWS cloud warehouse (Snowflake or Redshift).
  • Proficiency in Python (Pandas, PySpark);
  • Scala or Java a plus.

  • Hands-on with Spark on AWS EMR and / or EKS for distributed processing.
  • Strong background in Airflow (MWAA) for workflow orchestration.
  • Expertise with AWS services : S3, Glue, Lambda, Athena, Step Functions, ECS, CloudWatch.
  • Proficiency with Terraform for IaC;
  • familiarity with Docker, ECS, and CI / CD pipelines.

  • Experience building monitoring, validation, and alerting into pipelines with CloudWatch, Grafana, Prometheus, or Datadog.
  • Strong communication skills and ability to collaborate with data scientists, analysts, and product teams.
  • A track record of delivering production-ready, scalable AWS pipelines, not just prototypes.
  • Create a job alert for this search

    Data Pipeline Engineer • Republic Of India, IN

    Related jobs
    • Promoted
    • New!
    Principal Data Engineer

    Principal Data Engineer

    CodeMyMobileIndia, India
    Experience Required - 7 to 10 Years.Are you a Data Engineer who cares about clean engineering, autonomy, and solving real data challenges? If this sounds like you, we’d love to connect!.Email your ...Show moreLast updated: 20 hours ago
    • Promoted
    Data Engineer

    Data Engineer

    DigitalzoneNagpur, IN
    As a Data Engineer, you will design, build, and optimize data pipelines and real-time systems that power AI-driven decisioning and analytics. Develop and maintain scalable ETL / ELT pipelines using Py...Show moreLast updated: 1 day ago
    • Promoted
    Data Engineer

    Data Engineer

    Response Informaticsnagpur, maharashtra, in
    AWS services : Must be proficient in building scalable data pipelines and managing cloud-native ETL workflows.Snowflake : Moderate understanding of Snowflake architecture. CICD - Terraform or CloudFo...Show moreLast updated: 30+ days ago
    • Promoted
    Backend and Data Pipeline Engineer

    Backend and Data Pipeline Engineer

    JRD SystemsNagpur, IN
    Job Role : Backend and Data Pipeline Engineer - Python.We’re investing in technology to develop new products that help our customers drive their growth and transformation agenda.These include new da...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer(Azure / AWS, Python / Pyspark, SQL)

    Data Engineer(Azure / AWS, Python / Pyspark, SQL)

    Sail Analyticsnagpur, maharashtra, in
    Architect, develop, test and maintain scalable data warehouses and data pipelines.Expertise in SQL, PySpark / Python and Azure(ADB, ADF) or AWS(Glue, Lambda, Redshift). Bachelor's degree or equivalent...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer (AWS / Databricks)

    Senior Data Engineer (AWS / Databricks)

    Accoladenagpur, maharashtra, in
    The multifamily real estate industry is undergoing a massive transformation, and Accolade is at the forefront.We are building the industry's first AI-native Operations Centralization Platform, desi...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    RecroNagpur, IN
    Data Pipeline Engineering : Design, build, and maintain ingestion, transformation, and storage pipelines using Azure Data Factory, Synapse Analytics, and Data Lake. AI Data Enablement : Collaborate wi...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Canopus Infosystems - A CMMI Level 3 CompanyNagpur, IN
    Python expertise and hands-on experience in handling large datasets, data cleaning, analysis, and visualization.The ideal candidate should be capable of building data pipelines, performing web scra...Show moreLast updated: 30+ days ago
    • Promoted
    AWS Data Engineer

    AWS Data Engineer

    TerraGiGNagpur, IN
    Design, development, and implementation of performant ETL pipelines using python API (pySpark) of Apache Spark on AWS EMR. Writing reusable, testable, and efficient code.Integration of data storage ...Show moreLast updated: 12 days ago
    • Promoted
    Python Data Engineer

    Python Data Engineer

    iVoyantNagpur, IN
    One of our clients is looking for an experienced Python Data Engineer to join their team.Strong Python Experience (web services, background jobs. we use Fast API).Data processing and reporting usin...Show moreLast updated: 12 days ago
    • Promoted
    Python Data Engineer

    Python Data Engineer

    Dexian Indianagpur, maharashtra, in
    Designing and building optimized data pipelines using cutting-edge technologies in a cloud environment to drive analytical insights. Constructing infrastructure for efficient ETL processes from vari...Show moreLast updated: 1 day ago
    • Promoted
    Senior Data Engineer – ETL & Pipeline Development (7 to 12 yrs)

    Senior Data Engineer – ETL & Pipeline Development (7 to 12 yrs)

    AIMLEAPNagpur, IN
    Senior Data Engineer – ETL & Pipeline Development.Remote (Work from Home) / Bangalore / India.Tech / MCA / Computer Science / IT. IT / Data / AI / LegalTech / Enterprise Solutions.Pandas, Airflow, o...Show moreLast updated: 1 day ago
    • Promoted
    Data Engineer

    Data Engineer

    Veraxionnagpur, maharashtra, in
    Python, Spark, DBT, and AWS-native services.Agile environment to deliver scalable, secure, and high-performance data solutions. Python, DBT, and AWS services (Data Ops Live).Deliver end-to-end data ...Show moreLast updated: 12 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Dexian Indianagpur, maharashtra, in
    Designing and building optimized data pipelines using cutting-edge technologies in a cloud environment to drive analytical insights. Constructing infrastructure for efficient ETL processes from vari...Show moreLast updated: 1 day ago
    • Promoted
    Data Engineer II

    Data Engineer II

    ClearDemandNagpur, IN
    Building on the foundation of the SDE-I role, the DE- II position takes on a greater level of responsibility and leadership. You'll play a crucial role in driving the evolution and efficiency of our...Show moreLast updated: 1 day ago
    • Promoted
    AWS Data Engineer

    AWS Data Engineer

    Tata Consultancy Servicesnagpur, maharashtra, in
    Aws data engineer having strong experience of Python.Technical / Behavioral Competency.Proficient in Python, with experience in deploying Python packages and OOP, Experience in ingesting data from di...Show moreLast updated: 9 days ago
    • Promoted
    Azure Data Engineer

    Azure Data Engineer

    LTIMindtreenagpur, maharashtra, in
    Role Senior Data Engineer 8 years of experience.Build reusable utilities templates and automation pipelines.Design scalable data engineering frameworks standards and best practices.Provide architec...Show moreLast updated: 30+ days ago
    • Promoted
    Freelance Data Engineer

    Freelance Data Engineer

    upGradNagpur, IN
    We are seeking a highly skilled and motivated.The ideal candidate will be responsible for designing, developing, and optimizing large-scale data pipelines and data warehouse solutions, utilizing a ...Show moreLast updated: 21 days ago