Big Data Engineer - Spark/HDFSCGI Information Systems and Management Consultants • Hyderabad

Big Data Engineer - Spark / HDFS

CGI Information Systems and Management Consultants • Hyderabad

23 hours ago

Job description

Description :

Experience : Level 3 : 6-8 Years of exp

Location : Hyderabad

Skill : Python, Spark, HFDS, MongoDB

About the Role :

We are seeking a highly skilled Data Engineer to join our team to design, build, and optimize scalable data pipelines and platforms.

The ideal candidate will have hands-on experience with Python, Spark, HDFS, and MongoDB, and a proven ability to work with large-scale datasets in a distributed Responsibilities :

Design, develop, and maintain end-to-end data pipelines for batch and real-time processing.
Work with Apache Spark to process and transform large datasets efficiently.
Manage and optimize HDFS storage, ensuring data availability, reliability, and performance.
Develop scripts and data orchestration workflows using Python.
Build and maintain NoSQL data solutions using MongoDB, including data modeling and performance tuning.
Collaborate with Data Scientists, Analysts, and Platform Engineering teams to deliver high-quality data solutions.
Implement data quality, validation, and monitoring frameworks to ensure accuracy and consistency.
Participate in design reviews, code reviews, and performance optimization initiatives.
Contribute to the continuous improvement of data engineering standards and best practices.

Required Skills & Qualifications :

Bachelors or Masters degree in Computer Science, Information Technology, Data Engineering or related field.

3+ years of hands-on experience in Data Engineering or related domain.

Strong proficiency in Python programming for data processing and automation.

Expertise in Apache Spark (PySpark preferred) for large-scale data processing.

Solid experience with HDFS (Hadoop Distributed File System) and distributed data architecture.

Hands-on experience with MongoDB including schema design, queries, and performance optimization.

Good understanding of ETL concepts, data warehousing, and data modeling.

Proficient in working with Linux / Unix environments and shell scripting.

Experience with version control tools like Git.

Good to Have (Optional) :

Experience with workflow orchestration tools (Airflow, Luigi, Oozie, etc.)

Knowledge of cloud platforms (AWS, Azure, GCP) and cloud-native data services

Exposure to CI / CD and DevOps practices for data engineering

Experience with streaming systems (Kafka, Flink, etc.)

(ref : hirist.tech)

Create a job alert for this search

Big Data Engineer • Hyderabad

Related jobs

Big Data Engineer

LiveRamp • Hyderabad, Republic Of India, IN

LiveRamp is the data collaboration platform of choice for the world’s most innovative companies.A groundbreaking leader in consumer privacy, data ethics, and foundational identity, LiveRamp is sett...Show more

Last updated: 22 days ago • Promoted

Big Data Solutions Engineer

Tata Consultancy Services • Hyderabad, Republic Of India, IN

Advanced working Scala ,SQL,Python / PySpark knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.Experience bu...Show more

Last updated: 22 days ago • Promoted

Senior Data Engineer - Spark / Hadoop

Syniverse Technologies Services India Pvt. Ltd. • Hyderabad

Description : Job description : The Sr Data Engineer is an experienced ...Show more

Last updated: 15 days ago • Promoted

Data Engineer - Scala / Apache Spark

Ixceed Solutions • Hyderabad

Responsibilities : - Design, develop, and maintain robust and scalable data pipelines using Apache Spark and Scala on the Databricks platform. Implement ETL (Extract, Transform,...Show more

Last updated: 30+ days ago • Promoted

Senior Data Engineer

Straive • Hyderabad, Telangana, India

The ideal candidate is a strong software engineer with hands-on experience in Spark (3.You'll be responsible for designing and implementing ETL / ELT solutions, collaborating with teams to deliver da...Show more

Last updated: 30+ days ago • Promoted

Big Data Engineer (GCP, Hadoop, PySpark)

Confidential • Hyderabad / Secunderabad, Telangana

Design, develop, and optimize big data pipelines and ETL workflows using.Hadoop (HDFS, MapReduce, Hive, HBase).Develop and maintain data ingestion, transformation, and integration processes on.Ensu...Show more

Last updated: 30+ days ago • Promoted

Big Data Engineer - GCP

Tata Consultancy Services • Hyderabad, Republic Of India, IN

Chennai / Hyderabd / Bangalore / Pune / Gurgoan / Noida / NCR.Years of experience in IT industry in Planning, deploying, and configuring GCP based solutions. Mandatory to have knowledge of Big Data A...Show more

Last updated: 30+ days ago • Promoted

Sunware Technologies - Big Data Engineer - Hadoop / Spark

Sunware Technologies • Hyderabad

Description : Location : Hyderabad Experience : 610 years Key Responsibili...Show more

Last updated: 28 days ago • Promoted

Data Engineer - Spark / Hadoop

TalenTree • Hyderabad

Key Responsibilities : - Build and optimise data ingestion, transformation, and integration pipelines across multiple sources - clinical trials, EHR / EMR, laboratory ...Show more

Last updated: 17 days ago • Promoted

Data Engineer

Vriba Solutions • Hyderabad, IN

Design, develop & maintain ETL / ELT pipelines.Ingest & transform data from APIs, DBs, files, streams.Build real-time & batch processing solutions. Data validation, quality & cleansing.Translate busin...Show more

Last updated: 30+ days ago • Promoted

Senior Data Engineer

Adaptive Technology Insights • Hyderabad, India

We are looking for an experienced Data Engineer with strong expertise in Google Cloud Platform (GCP) and BigQuery to join our growing data team. You will be responsible for designing, building, and ...Show more

Last updated: 8 days ago • Promoted

Big Data Engineer (Scala, AWS)

Confidential • Hyderabad / Secunderabad, Telangana

Black And White Business Solutions is actively seeking a highly skilled.This role is ideal for a professional with extensive experience in designing, developing, and optimizing scalable big data pi...Show more

Last updated: 30+ days ago • Promoted

Big Data Engineer

Confidential • Hyderabad / Secunderabad, Telangana

Datawarehouse, Big Data, and Hadoop implementation within Azure environment.Participate in the design and implementation of analytics architecture. Hands-on experience with Hadoop Distribution and s...Show more

Last updated: 30+ days ago • Promoted

Lead Data Pipeline Engineer

Straive • Hyderabad, Republic Of India, IN

Last updated: 30+ days ago • Promoted

Data Solutions Engineer

Straive • Hyderabad, Republic Of India, IN

Design, build and maintain scalable.Implement core ETL / ELT logic in Scala and Python;.Write and optimize complex SQL for ingestion, transformation and consumption layers. Tune Spark jobs for perform...Show more

Last updated: 9 days ago • Promoted

Data Engineer

Straive • Hyderabad, Telangana, India

Design, build and maintain scalable.Implement core ETL / ELT logic in Scala and Python; author efficient Spark DataFrame / Dataset jobs. Write and optimize complex SQL for ingestion, transformation and ...Show more

Last updated: 9 days ago • Promoted

Senior Data Engineer

RapidBrains • Hyderabad, IN

Job Title : Senior Data Engineer.We are looking for a Senior Data Engineer with deep expertise in Azure Data Engineering to design, build, and optimize large-scale data pipelines.The ideal candidate...Show more

Last updated: 13 days ago • Promoted

Big Data Engineer

Tata Consultancy Services • Hyderabad, Republic Of India, IN

Greetings from Tata Consulting Services.Location : Bangalore / Chennai / Hyderabad.Show more

Last updated: 30+ days ago • Promoted