Talent.com
Data Pipeline Engineer

Data Pipeline Engineer

EXLHaryāna, Republic Of India, IN
15 hours ago
Job description

Job Description

We are looking for a Python & PySpark developer and data engineer who can design and build solutions for one of our Fortune 500 Client programs in the realm of Financial Master & Reference Data Management. This is high visibility, fast-paced key initiative will integrate data across internal and external sources, provide analytical insights, and integrate with the customer’s critical systems.

Key Responsibilities

  • Ability to design, build and unit test applications on Spark framework on Python.
  • Build Python and PySpark based applications based on data in both Relational (Oracle, SQL Server) and NoSQL database e.G. Redis, Valkey
  • Build ingestion pipelines to load data into in-memory caching database.
  • Build data pipelines to load and export data to / from the Master Data Management or Reference Data Management Platform using PySpark
  • Hands on experience doing at least one MDM implementation using TIBCO, Informatica, GoldenSource or custom-built MDM / RDM platforms
  • Experience with designing / building Data APIs and its interaction with data consumers
  • Optimize performance of the built Spark applications using configurations around Spark Context, Spark-SQL and Data Frame.
  • Build Python programs using module libs e.G. pandas, requests, json, flask, pickle
  • Experience in processing large amounts of structured data, including integrating data from multiple sources.
  • Ability to design & build real-time applications using REST API, JSON, XML
  • Ability to build solutions on AWS services using Glue ETL, Lambda functions on Python
  • Create and maintain integration and regression testing framework on Jenkins integrated with BitBucket and / or GIT repositories
  • Participate in the agile development process, and document and communicate issues and bugs relative to data standards in scrum meetings
  • Work collaboratively with onsite and offshore team.
  • Develop & review technical documentation for artifacts delivered.
  • Ability to solve complex data-driven scenarios and triage towards defects and production issues
  • Participate in code release and production deployment.
  • Challenge and inspire team members to achieve business results in a fast paced and quickly changing environment

Requirements

  • BE / B.Tech / B.Sc. in Computer Science / Statistics, Econometrics from an accredited college or university.
  • Minimum 4 years of extensive experience in design, build and deployment of PySpark-based applications.
  • Hands-on experience in Redis, Valkey, OpenSearch database
  • In-depth knowledge of Python core programming language – lists, dictionaries, tuples
  • Expertise on Python libraries e.G. pandas, requests, json, flask, pickle
  • Good understanding on Informatica PowerCenter for building data pipelines
  • Understanding of Master Data Management processes & domain is preferred.
  • Hands-on experience writing complex SQL queries, exporting, and importing large amounts of data using utilities.
  • Ability to build abstracted, modularized reusable code components.
  • Hands-on experience in generating / parsing XML, JSON documents, and REST API request / responses
  • Hands-on experience in Redis, Valkey, OpenSearch database
  • Able to quickly adapt and learn.
  • Able to jump into an ambiguous situation and take the lead on resolution.
  • Able to communicate and coordinate across various teams.
  • Are comfortable tackling new challenges and new ways of working
  • Are ready to move from traditional methods and adapt into agile ones
  • Comfortable challenging your peers and leadership team.
  • Can prove yourself quickly and decisively.
  • Excellent communication skills and Good Customer Centricity.
  • Strong Target & High Solution Orientation.
  • Create a job alert for this search

    Data Pipeline Engineer • Haryāna, Republic Of India, IN

    Related jobs
    • Promoted
    • New!
    Lead GCP Data Engineer

    Lead GCP Data Engineer

    Impetusharyana, haryana, in
    Lead Data Engineer – GCP (BigQuery • Composer • Python • PySpark).You will lead the design, build and operation of large-scale data platforms on the Google Cloud Platform.You will manage a team of ...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    EXLharyana, haryana, in
    We are looking for a Python & PySpark developer and data engineer who can design and build solutions for one of our Fortune 500 Client programs in the realm of Financial Master & Reference Data Man...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    KKRharyana, haryana, in
    KKR aims to generate attractive investment returns by following a patient and disciplined investment approach, employing world-class people, and supporting growth in its portfolio companies and com...Show moreLast updated: 9 hours ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    Azilen Technologiesharyana, haryana, in
    Manage and create design schema, SQL query tuning, and code review.Min 4+ years of professional experience in the field of data engineering with knowledge of the data platform and DWH development.D...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Senior Data Engineer

    Senior Data Engineer

    KOGTA FINANCIAL (INDIA) LIMITEDharyana, haryana, in
    ETL & Data Warehouse Developer.As a key member of our data engineering team, you will be responsible for designing, developing, and optimizing ETL pipelines and scalable data warehouse solutions on...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    GCP Data Engineer

    GCP Data Engineer

    Impetusharyana, haryana, in
    Design, build, and maintain large-scale data pipelines on BigQuery and other Google Cloud Platform (GCP) services.Use Python and PySpark / Spark to transform, clean, aggregate and prepare data for an...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Lead Data Engineer

    Lead Data Engineer

    MakeMyTripharyana, haryana, in
    At MakeMyTrip (MMT), technology is at the heart of everything we do.As a leading player in the travel industry, we leverage cutting-edge solutions like AI, machine learning, and cloud infrastructur...Show moreLast updated: 9 hours ago
    • Promoted
    • New!
    Data Pipeline Engineer

    Data Pipeline Engineer

    GMGHaryāna, Republic Of India, IN
    GMG is a global well-being company retailing, distributing and manufacturing a portfolio of leading international and home-grown brands across sport, food and health sectors.Its vision is to inspir...Show moreLast updated: 14 hours ago
    • Promoted
    • New!
    Senior Data Engineer

    Senior Data Engineer

    ManpowerGroup Indiaharyana, haryana, in
    ADF, ETL and SSIS, Python and SQL including data warehousing concepts and data warehousing principles.Proficiency in ETL tools and SQL for data extraction, transformation, and loading.Experience pe...Show moreLast updated: 5 hours ago
    • Promoted
    • New!
    Principal Data Pipeline Engineer

    Principal Data Pipeline Engineer

    Antal InternationalHaryāna, Republic Of India, IN
    Udaipur 3 months then Gurugram, India.You’ll lead the development of high-performance data pipelines, ensure data integrity, and guide the engineering team with best practices and cutting-edge tech...Show moreLast updated: 15 hours ago
    • Promoted
    • New!
    AI Data Engineer - 17852

    AI Data Engineer - 17852

    Turingharyana, haryana, in
    We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Data Pipeline Engineer

    Data Pipeline Engineer

    Sirius AIHaryāna, Republic Of India, IN
    Sirius AI is a US headquartered AI Consulting services and products company with operations in India.Sirius AI focuses on Financial Services enterprises and solutions / services delivered across mu...Show moreLast updated: 15 hours ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    IGT Solutionsharyana, haryana, in
    We are seeking a highly skilled.Databricks, PySpark, ETL development, and SQL.The ideal candidate will be responsible for designing, developing, and optimizing scalable data pipelines and analytics...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Senior Data Pipeline Engineer

    Senior Data Pipeline Engineer

    ImpetusHaryāna, Republic Of India, IN
    Lead Data Engineer – GCP (BigQuery - Composer - Python - PySpark).You will lead the design, build and operation of large-scale data platforms on the Google Cloud Platform.You will manage a te...Show moreLast updated: 14 hours ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    Terra Technology Circle Consulting Private Limitedharyana, haryana, in
    We are seeking a highly skilled and motivated.In this role, you will design, build, and optimize scalable data pipelines and architectures to support analytics, machine learning, and business intel...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Principal Data Pipeline Engineer

    Principal Data Pipeline Engineer

    Pacific Data IntegratorsHaryāna, Republic Of India, IN
    Shift time : Open to work in EST shift (5PM to 2AM IST).Lead the design, development, and implementation of complex data integration solutions using Informatica Intelligent Data Management Cloud (ID...Show moreLast updated: 15 hours ago
    • Promoted
    • New!
    Sr. Data Engineer II

    Sr. Data Engineer II

    Antal Internationalharyana, haryana, in
    Udaipur 3 months then Gurugram, India.You’ll lead the development of high-performance data pipelines, ensure data integrity, and guide the engineering team with best practices and cutting-edge tech...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    NABharyana, haryana, in
    Proficient in executing ETL processes between data environments.Proficient at coding in SQL (additional knowledge in Python and R is highly regarded). Proficient at creating analytics related docume...Show moreLast updated: 17 hours ago