Talent.com
Big Data Engineer - Spark/HDFS
Big Data Engineer - Spark/HDFSCGI Information Systems and Management Consultants • Hyderabad
Big Data Engineer - Spark / HDFS

Big Data Engineer - Spark / HDFS

CGI Information Systems and Management Consultants • Hyderabad
23 hours ago
Job description

Description :

Experience : Level 3 : 6-8 Years of exp

Location : Hyderabad

Skill : Python, Spark, HFDS, MongoDB

About the Role :

We are seeking a highly skilled Data Engineer to join our team to design, build, and optimize scalable data pipelines and platforms.

The ideal candidate will have hands-on experience with Python, Spark, HDFS, and MongoDB, and a proven ability to work with large-scale datasets in a distributed Responsibilities :

  • Design, develop, and maintain end-to-end data pipelines for batch and real-time processing.
  • Work with Apache Spark to process and transform large datasets efficiently.
  • Manage and optimize HDFS storage, ensuring data availability, reliability, and performance.
  • Develop scripts and data orchestration workflows using Python.
  • Build and maintain NoSQL data solutions using MongoDB, including data modeling and performance tuning.
  • Collaborate with Data Scientists, Analysts, and Platform Engineering teams to deliver high-quality data solutions.
  • Implement data quality, validation, and monitoring frameworks to ensure accuracy and consistency.
  • Participate in design reviews, code reviews, and performance optimization initiatives.
  • Contribute to the continuous improvement of data engineering standards and best practices.

Required Skills & Qualifications :

  • Bachelors or Masters degree in Computer Science, Information Technology, Data Engineering or related field.
  • 3+ years of hands-on experience in Data Engineering or related domain.
  • Strong proficiency in Python programming for data processing and automation.
  • Expertise in Apache Spark (PySpark preferred) for large-scale data processing.
  • Solid experience with HDFS (Hadoop Distributed File System) and distributed data architecture.
  • Hands-on experience with MongoDB including schema design, queries, and performance optimization.
  • Good understanding of ETL concepts, data warehousing, and data modeling.
  • Proficient in working with Linux / Unix environments and shell scripting.
  • Experience with version control tools like Git.
  • Good to Have (Optional) :

  • Experience with workflow orchestration tools (Airflow, Luigi, Oozie, etc.)
  • Knowledge of cloud platforms (AWS, Azure, GCP) and cloud-native data services
  • Exposure to CI / CD and DevOps practices for data engineering
  • Experience with streaming systems (Kafka, Flink, etc.)
  • (ref : hirist.tech)

    Create a job alert for this search

    Big Data Engineer • Hyderabad

    Related jobs
    Big Data Engineer

    Big Data Engineer

    LiveRamp • Hyderabad, Republic Of India, IN
    LiveRamp is the data collaboration platform of choice for the world’s most innovative companies.A groundbreaking leader in consumer privacy, data ethics, and foundational identity, LiveRamp is sett...Show more
    Last updated: 22 days ago • Promoted
    Big Data Solutions Engineer

    Big Data Solutions Engineer

    Tata Consultancy Services • Hyderabad, Republic Of India, IN
    Advanced working Scala ,SQL,Python / PySpark knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.Experience bu...Show more
    Last updated: 22 days ago • Promoted
    Senior Data Engineer - Spark / Hadoop

    Senior Data Engineer - Spark / Hadoop

    Syniverse Technologies Services India Pvt. Ltd. • Hyderabad
    Description : Job description : The Sr Data Engineer is an experienced ...Show more
    Last updated: 15 days ago • Promoted
    Data Engineer - Scala / Apache Spark

    Data Engineer - Scala / Apache Spark

    Ixceed Solutions • Hyderabad
    Responsibilities : - Design, develop, and maintain robust and scalable data pipelines using Apache Spark and Scala on the Databricks platform. Implement ETL (Extract, Transform,...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Straive • Hyderabad, Telangana, India
    The ideal candidate is a strong software engineer with hands-on experience in Spark (3.You'll be responsible for designing and implementing ETL / ELT solutions, collaborating with teams to deliver da...Show more
    Last updated: 30+ days ago • Promoted
    Big Data Engineer (GCP, Hadoop, PySpark)

    Big Data Engineer (GCP, Hadoop, PySpark)

    Confidential • Hyderabad / Secunderabad, Telangana
    Design, develop, and optimize big data pipelines and ETL workflows using.Hadoop (HDFS, MapReduce, Hive, HBase).Develop and maintain data ingestion, transformation, and integration processes on.Ensu...Show more
    Last updated: 30+ days ago • Promoted
    Big Data Engineer - GCP

    Big Data Engineer - GCP

    Tata Consultancy Services • Hyderabad, Republic Of India, IN
    Chennai / Hyderabd / Bangalore / Pune / Gurgoan / Noida / NCR.Years of experience in IT industry in Planning, deploying, and configuring GCP based solutions. Mandatory to have knowledge of Big Data A...Show more
    Last updated: 30+ days ago • Promoted
    Sunware Technologies - Big Data Engineer - Hadoop / Spark

    Sunware Technologies - Big Data Engineer - Hadoop / Spark

    Sunware Technologies • Hyderabad
    Description : Location : Hyderabad Experience : 610 years Key Responsibili...Show more
    Last updated: 28 days ago • Promoted
    Data Engineer - Spark / Hadoop

    Data Engineer - Spark / Hadoop

    TalenTree • Hyderabad
    Key Responsibilities : - Build and optimise data ingestion, transformation, and integration pipelines across multiple sources - clinical trials, EHR / EMR, laboratory ...Show more
    Last updated: 17 days ago • Promoted
    Data Engineer

    Data Engineer

    Vriba Solutions • Hyderabad, IN
    Design, develop & maintain ETL / ELT pipelines.Ingest & transform data from APIs, DBs, files, streams.Build real-time & batch processing solutions. Data validation, quality & cleansing.Translate busin...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Adaptive Technology Insights • Hyderabad, India
    We are looking for an experienced Data Engineer with strong expertise in Google Cloud Platform (GCP) and BigQuery to join our growing data team. You will be responsible for designing, building, and ...Show more
    Last updated: 8 days ago • Promoted
    Big Data Engineer (Scala, AWS)

    Big Data Engineer (Scala, AWS)

    Confidential • Hyderabad / Secunderabad, Telangana
    Black And White Business Solutions is actively seeking a highly skilled.This role is ideal for a professional with extensive experience in designing, developing, and optimizing scalable big data pi...Show more
    Last updated: 30+ days ago • Promoted
    Big Data Engineer

    Big Data Engineer

    Confidential • Hyderabad / Secunderabad, Telangana
    Datawarehouse, Big Data, and Hadoop implementation within Azure environment.Participate in the design and implementation of analytics architecture. Hands-on experience with Hadoop Distribution and s...Show more
    Last updated: 30+ days ago • Promoted
    Lead Data Pipeline Engineer

    Lead Data Pipeline Engineer

    Straive • Hyderabad, Republic Of India, IN
    The ideal candidate is a strong software engineer with hands-on experience in Spark (3.You'll be responsible for designing and implementing ETL / ELT solutions, collaborating with teams to deliver da...Show more
    Last updated: 30+ days ago • Promoted
    Data Solutions Engineer

    Data Solutions Engineer

    Straive • Hyderabad, Republic Of India, IN
    Design, build and maintain scalable.Implement core ETL / ELT logic in Scala and Python;.Write and optimize complex SQL for ingestion, transformation and consumption layers. Tune Spark jobs for perform...Show more
    Last updated: 9 days ago • Promoted
    Data Engineer

    Data Engineer

    Straive • Hyderabad, Telangana, India
    Design, build and maintain scalable.Implement core ETL / ELT logic in Scala and Python; author efficient Spark DataFrame / Dataset jobs. Write and optimize complex SQL for ingestion, transformation and ...Show more
    Last updated: 9 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    RapidBrains • Hyderabad, IN
    Job Title : Senior Data Engineer.We are looking for a Senior Data Engineer with deep expertise in Azure Data Engineering to design, build, and optimize large-scale data pipelines.The ideal candidate...Show more
    Last updated: 13 days ago • Promoted
    Big Data Engineer

    Big Data Engineer

    Tata Consultancy Services • Hyderabad, Republic Of India, IN
    Greetings from Tata Consulting Services.Location : Bangalore / Chennai / Hyderabad.Show more
    Last updated: 30+ days ago • Promoted