Talent.com
Senior Data Pipeline Engineer

Senior Data Pipeline Engineer

ClearDemandRepublic Of India, IN
1 day ago
Job description

Job Summary :

  • Building on the foundation of the SDE-I role, the DE- II position takes on a greater level of responsibility and leadership.
  • You'll play a crucial role in driving the evolution and efficiency of our data collection and analytics platform, capable of handling terabyte-scale data and billions of data points.

Key Responsibilities

  • Lead the design, development, and optimization of large-scale data pipelines and infrastructures using technologies like Apache Airflow, Spark, Kafka, and more.
  • Architect and implement distributed data processing solutions to handle terabyte-scale datasets and billions of records efficiently across multi-region cloud infrastructure (AWS, GCP, DO).
  • Develop and maintain real-time data processing solutions for high-volume data collection operations using technologies like Spark Streaming and Kafka.
  • Optimize data storage strategies using technologies such as Amazon S3, HDFS, and Parquet / Avro file formats for efficient querying and cost management.
  • Build and maintain high-quality ETL pipelines, ensuring robust data collection and transformation processes with a focus on scalability and fault tolerance.
  • Collaborate with data analysts, researchers, and cross-functional teams to define and maintain data quality metrics, implement robust data validation, and enforce security best practices.
  • Mentor junior engineers (SDE-I) and foster a collaborative, growth-oriented environment.
  • Participate in technical discussions, contributing to architectural decisions, and proactively identifying improvements for scalability, performance, and cost-efficiency.
  • Ensure application performance monitoring (APM) is in place, utilizing tools like Datadog, New Relic, or similar to proactively monitor and optimize system performance, detect bottlenecks, and ensure system health.
  • Implement effective data partitioning strategies and indexing for performance optimization in distributed databases such as DynamoDB, Cassandra, or HBase.
  • Stay current with advancements in data engineering, orchestration tools, and emerging cloud technologies, continually enhancing the platform’s capabilities
  • Qualifications & Experience :

  • 4-5+ years of hands-on experience with Apache Airflow and other orchestration tools for managing large-scale workflows and data pipelines.
  • Expertise in AWS technologies, Athena, AWS Glue, DynamoDB, Apache Spark, PySpark, SQL, and NoSQL databases.
  • Experience in designing and managing distributed data processing systems that scale to terabyte and billion-scale datasets using cloud platforms like AWS, GCP, or Digital Ocean.
  • Proficiency in web crawling frameworks, including Node.Js, HTTP protocols, Puppeteer, Playwright, and Chromium for large-scale data extraction.
  • Experience with monitoring and observability tools such as Grafana, Prometheus, Elasticsearch, and familiarity with monitoring and optimizing resource utilization in distributed systems.
  • Strong understanding of infrastructure as code using Terraform, automated CI / CD pipelines with Jenkins, and event-driven architecture with Kafka.
  • Experience with data lake architectures and optimizing storage using formats such as Parquet, Avro, or ORC.
  • Strong background in optimizing query performance and data processing frameworks (Spark, Flink, or Hadoop) for efficient data processing at scale.
  • Knowledge of containerization (Docker, Kubernetes) and orchestration for distributed system deployments.
  • Deep experience in designing resilient data systems with a focus on fault tolerance, data replication, and disaster recovery strategies in distributed environments.
  • Strong data engineering skills, including ETL pipeline development, stream processing, and distributed systems.
  • Excellent problem-solving abilities, with a collaborative mindset and strong communication skills.
  • Create a job alert for this search

    Senior Data Engineer • Republic Of India, IN

    Related jobs
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    IntelliasNagpur, IN
    Apache Flink / Apache Spark (Streaming).Data Engineer or similar role, with hands-on expertise in large-scale, production-grade data pipelines. Kafka + Flink / Spark Streaming).Python for data engin...Show moreLast updated: 1 day ago
    • Promoted
    Backend and Data Pipeline Engineer

    Backend and Data Pipeline Engineer

    JRD SystemsIndia, India
    Job Role : Backend and Data Pipeline Engineer - Python.We’re investing in technology to develop new products that help our customers drive their growth and transformation agenda.These include new da...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Guidanz IncNagpur, IN
    BI Connector is the industry leading solution for integrating Oracle Fusion Cloud data into modern BI platforms like Power BI, Tableau, and Data Warehouse, without complex ETL.Our Data Architecture...Show moreLast updated: 1 day ago
    • Promoted
    Data Engineer

    Data Engineer

    Response Informaticsnagpur, maharashtra, in
    AWS services : Must be proficient in building scalable data pipelines and managing cloud-native ETL workflows.Snowflake : Moderate understanding of Snowflake architecture. CICD - Terraform or CloudFo...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer (AWS / Databricks)

    Senior Data Engineer (AWS / Databricks)

    Accoladenagpur, maharashtra, in
    The multifamily real estate industry is undergoing a massive transformation, and Accolade is at the forefront.We are building the industry's first AI-native Operations Centralization Platform, desi...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Straivenagpur, maharashtra, in
    The ideal candidate is a strong software engineer with hands-on experience in Spark (3.You'll be responsible for designing and implementing ETL / ELT solutions, collaborating with teams to deliver da...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer(Azure / AWS, Python / Pyspark, SQL)

    Data Engineer(Azure / AWS, Python / Pyspark, SQL)

    Sail Analyticsnagpur, maharashtra, in
    Architect, develop, test and maintain scalable data warehouses and data pipelines.Expertise in SQL, PySpark / Python and Azure(ADB, ADF) or AWS(Glue, Lambda, Redshift). Bachelor's degree or equivalent...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    RecroNagpur, IN
    Data Pipeline Engineering : Design, build, and maintain ingestion, transformation, and storage pipelines using Azure Data Factory, Synapse Analytics, and Data Lake. AI Data Enablement : Collaborate wi...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Canopus Infosystems - A CMMI Level 3 CompanyNagpur, IN
    Python expertise and hands-on experience in handling large datasets, data cleaning, analysis, and visualization.The ideal candidate should be capable of building data pipelines, performing web scra...Show moreLast updated: 30+ days ago
    • Promoted
    Python Data Engineer

    Python Data Engineer

    iVoyantNagpur, IN
    One of our clients is looking for an experienced Python Data Engineer to join their team.Strong Python Experience (web services, background jobs. we use Fast API).Data processing and reporting usin...Show moreLast updated: 12 days ago
    • Promoted
    Python Data Engineer

    Python Data Engineer

    Dexian Indianagpur, maharashtra, in
    Designing and building optimized data pipelines using cutting-edge technologies in a cloud environment to drive analytical insights. Constructing infrastructure for efficient ETL processes from vari...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Senior Data Engineer – ETL & Pipeline Development (7 to 12 yrs)

    Senior Data Engineer – ETL & Pipeline Development (7 to 12 yrs)

    AIMLEAPNagpur, IN
    Senior Data Engineer – ETL & Pipeline Development.Remote (Work from Home) / Bangalore / India.Tech / MCA / Computer Science / IT. IT / Data / AI / LegalTech / Enterprise Solutions.Pandas, Airflow, o...Show moreLast updated: 17 hours ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    SGS & Conagpur, maharashtra, in
    Position Title : Senior Data Engineer.Experience Required : 8 to 12 Years.We are looking for a highly skilled and experienced Data Engineer with strong expertise in. The ideal candidate will play a ke...Show moreLast updated: 1 day ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    SAIVA AINagpur, IN
    We are building the future of healthcare analytics.Join us to design, build, and scale robust data pipelines that power nationwide analytics and support our machine learning systems.Our goal : pipel...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer II

    Data Engineer II

    ClearDemandNagpur, IN
    Building on the foundation of the SDE-I role, the DE- II position takes on a greater level of responsibility and leadership. You'll play a crucial role in driving the evolution and efficiency of our...Show moreLast updated: 1 day ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Incept Labsnagpur, maharashtra, in
    Incept Labs is an AI research lab based in San Francisco, California.Our small team of scientists, engineers, and builders is passionate about developing domain-specific, next-generation AI solutio...Show moreLast updated: 30+ days ago
    • Promoted
    Sr ETL Data Engineer -HL7

    Sr ETL Data Engineer -HL7

    BigRionagpur, maharashtra, in
    Job Title : Sr ETL Data Engineer -HL7.Location : Remote – India (UK Shift).BigRio is a remote-based, technology consulting firm headquartered in Boston, MA. We specialize in delivering advanced softwa...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    LTIMindtreenagpur, maharashtra, in
    We are seeking a skilled Data Engineer with hands-on experience in.The ideal candidate will be responsible for developing scalable data pipelines, transforming and ingesting large-scale data, and e...Show moreLast updated: 30+ days ago