This job offer is not available in your country.

Data Engineer (Immediate Start)

Kalyani TechnologiesIndia

11 hours ago

Job description

Overview :

We are looking for a highly skilled Python Data Engineer to join our team in an on-premise data engineering environment. The ideal candidate will have experience in ETL tools, data processing technologies, data orchestration, and relational databases. Additionally, you should be proficient in Python scripting for data engineering tasks and have experience working with Spark, PySpark, and other relevant data technologies. While cloud tools are a good-to-have, this position primarily focuses on on-premise data infrastructure.

This is an excellent opportunity to work on exciting projects that require developing scalable data pipelines, real-time data streaming, and optimizing data processing tasks using Python.

Key Responsibilities :

ETL Development & Optimization : Design, develop, and optimize ETL pipelines using open-source or cloud ETL tools (e.g., Apache Nifi, Talend, Pentaho, Airflow, AWS Glue).
Python Scripting for Data Engineering : Write Python scripts to automate data extraction, transformation, and loading (ETL) processes. Ensure that the code is optimized for performance and scalability.
Big Data Processing : Work with Apache Spark and PySpark to process large datasets in a distributed computing environment. Optimize Spark jobs for performance and resource efficiency.
Job Orchestration : Use Apache Airflow or other orchestration tools to schedule, monitor, and automate data pipeline workflows.
Data Streaming : Design and implement real-time data streaming solutions using technologies like Apache Kafka or AWS Kinesis for high-throughput, low-latency data processing.
File Formats & Table Formats : Work with open-source table formats like Apache Parquet, Apache Avro, or Delta Lake, and other structured / unstructured data formats for efficient data storage and access.
Database Management : Work with relational databases (e.g., PostgreSQL, MySQL, SQL Server) for data storage, management, and optimization. Understand database concepts such as normalization, indexing, and query optimization.
SQL Expertise : Write and optimize complex SQL queries for data extraction, transformations, and aggregation across large datasets. Ensure queries are efficient and scalable.
BI & Data Warehouse Knowledge : Exposure to BI tools and data warehousing concepts is a plus, ensuring the data is structured in a way that supports analytics and reporting.

Required Skills & Experience :

ETL Tools : Experience working with open-source ETL tools such as Apache Nifi, Talend, or Pentaho. Cloud-based tools like AWS Glue or Azure Data Factory are good to have.

Python Scripting : Proficiency in Python for automating data processing tasks, writing data pipelines, and working with libraries such as Pandas, Dask, PySpark, etc.

Big Data Technologies : Experience with Apache Spark and PySpark for distributed data processing, along with optimization techniques.

Data Orchestration : Experience using Apache Airflow or similar tools for scheduling and automating data pipelines.

Data Streaming : Experience with Apache Kafka or AWS Kinesis for building and managing real-time data pipelines.

Open-Source File Formats : Knowledge of Apache Parquet, Apache Avro, Delta Lake, or similar open-source table formats for efficient data storage and retrieval.

Relational Databases : Strong experience with at least one relational database (e.g., PostgreSQL, MySQL, SQL Server) and a solid understanding of database concepts like indexing, normalization, and query optimization.

SQL Expertise : Strong skills in writing and optimizing complex SQL queries for data extraction, transformations, and aggregation.

Nice to Have :

BI / Analytics Tools : Familiarity with BI tools like Power BI, Tableau, Looker, or similar reporting and data visualization platforms.

Data Warehousing : Knowledge of data warehousing principles, schema design (e.g., star / snowflake), and optimization techniques for large datasets.

Cloud Technologies : Experience with cloud data platforms like Databricks, Snowflake, or Azure Synapse is beneficial, though the role is focused on on-prem environments.

Containerization : Familiarity with containerization tools like Docker or Kubernetes for deploying data engineering workloads.

Educational Qualifications :

Bachelor’s or Master’s degree in Computer Science, Engineering, Information Systems, or a related field (or equivalent work experience).

Additional Qualities :

Excellent problem-solving and troubleshooting skills.

Ability to work both independently and in a collaborative environment.

Strong communication skills, both written and verbal.

Detail-oriented with a focus on data quality and performance optimization.

Proactive attitude and the ability to take ownership of projects.

Create a job alert for this search

Data Engineer • India

Related jobs

Promoted

Senior Data Engineer

InfogainNagpur, IN

Big Data Engineer (Lead) : As a Big Data Engineer (Lead), you will be responsible for leading a team of big data engineers. You will work closely with clients and team members to understand their req...Show moreLast updated: 30+ days ago

Promoted

Data Engineer

Randstad EnterpriseNagpur, IN

Shift Timing : 2 : 00 Pm - 11 : 00 Pm.Experience : 2- 4 years relevant only ( this is a Junior position with us ).GCP - 2 years minimum working Experience. Worked with global stakeholders.Randstad Sourc...Show moreLast updated: 30+ days ago

Promoted

Senior Data Engineer

Confidential JobsNagpur, IN

This role involves collaborating with cross-functional teams to ensure data reliability, scalability, and performance.The candidate will work closely with data scientists, analysts and software eng...Show moreLast updated: 4 days ago

Promoted

Principal Data Engineer

XebiaNagpur, IN

We’re Hiring : Principal Data Engineer | Any Xebia Location (Hybrid, 3 days in office per week).Any Xebia Location (Hybrid – 3 days in office per week). Data Engineering with 4+ years team leadership...Show moreLast updated: 30+ days ago

Promoted

Data Engineer

XebiaNagpur, IN

We’re Hiring : Data Engineer | Xebia.Any Xebia location (Hybrid, 3 days office per week).Immediate to 2 weeks – only apply if you can join early. Databricks, Python, SQL, and Postgres.The ideal candi...Show moreLast updated: 14 days ago

Promoted

Data Engineer

IntraEdgeNagpur, IN

Snowflake, AWS (Lambda, Glue), DBT, and SQL.The ideal candidate will be responsible for enabling seamless data integration, transformation, and analytics to support business intelligence and advanc...Show moreLast updated: 30+ days ago

Promoted

Lead Data Engineer

Eucloid Data SolutionsNagpur, IN

Eucloid is looking for a Lead Data Engineer to join our Data Platform team supporting various business applications.The ideal candidate will support development of data infrastructure on Databricks...Show moreLast updated: 16 days ago

Promoted

Senior Data Engineer

CareerXperts ConsultingNagpur, IN

We are looking for a highly skilled.The ideal candidate will have strong expertise in modern data engineering tools and frameworks, along with the ability to collaborate closely with data scientist...Show moreLast updated: 4 days ago

Promoted

Fabric Data Engineer (Remote)

Thinkgrid LabsNagpur, IN

Remote

Thinkgrid Labs is at the forefront of innovation and technology.Our expert team of software engineers, architects, and UI / UX designers specialises in crafting bespoke web, mobile, cloud application...Show moreLast updated: 16 days ago

Promoted

Data Engineer

RevXNagpur, IN

RevX helps app businesses acquire and reengage users via programmatic to retain, monetize, and accelerate revenue.We're all about taking your app businesses to a new growth level.We rely on data sc...Show moreLast updated: 30+ days ago

Promoted

Data Engineer

Kumaran SystemsNagpur, IN

Strong hands-on experience in Databricks.Proven expertise in building and managing data ingestion pipelines.Exposure to Databricks data ingestion jobs along with incident management frameworks.Expe...Show moreLast updated: 16 days ago

Promoted

Lead Data Engineer

Searce IncNagpur, IN

Searce means ‘a fine sieve’ & indicates ‘to refine, to analyze, to improve’.It signifies our way of working : To improve to the finest degree of excellence, ‘solving for better’ every time.Searcians...Show moreLast updated: 3 days ago

Promoted

Data Engineer

Canopus Infosystems - A CMMI Level 3 CompanyNagpur, IN

Python expertise and hands-on experience in handling large datasets, data cleaning, analysis, and visualization.The ideal candidate should be capable of building data pipelines, performing web scra...Show moreLast updated: 26 days ago

Promoted

Data Platform Engineer

Intuitive.CloudNagpur, IN

With the reputation of being a.Digital Transformation challenges across following Intuitive Superpowers : .Application & Database Modernization. Platform Engineering (IaC / EaC, DevSecOps & SRE).Cloud N...Show moreLast updated: 14 days ago

Promoted

GCP Data Engineer

TEKsystemsNagpur, IN

We are actively hiring for Skilled GCP Data Engineers for a global banking client.Years of experience : 4+ years (relevant). Looking for immediate joiners • •.Onboard new data sources - negotiate, agre...Show moreLast updated: 16 days ago

Promoted

Data Engineer

EveriseNagpur, IN

Join us on our mission to elevate customer experiences for people around the world.As a member of the Everise family, you will be part of a global experience company that believes in being people-f...Show moreLast updated: 15 days ago

Promoted

Data / ML Engineer

GleantapNagpur, IN

Gleantap is a customer engagement platform powering fitness, wellness, and service businesses.Python (scikit-learn, LightGBM, XGBoost). Set up retraining and monitoring pipelines (Airflow, MLflow, o...Show moreLast updated: 30+ days ago

Promoted

Senior Big Data Engineer

VeltrisNagpur, IN

Veltris is a Digital Product Engineering Services partner committed to driving technology-enabled transformation across enterprises, businesses, and industries. We specialize in delivering next-gene...Show moreLast updated: 23 days ago

Promoted

Data Engineer

Bahwan CyberTekNagpur, IN

Job Title : Data Engineer – Google Cloud Platform (GCP).We are seeking a skilled and motivated Data Engineer with hands-on experience in building scalable data pipelines and cloud-native data soluti...Show moreLast updated: 26 days ago

Promoted

Data Engineer

Vriba SolutionsNagpur, IN

Design, develop & maintain ETL / ELT pipelines.Ingest & transform data from APIs, DBs, files, streams.Build real-time & batch processing solutions. Data validation, quality & cleansing.Translate busin...Show moreLast updated: 4 days ago