Design, development, and implementation of performant ETL pipelines using python API (pySpark) of Apache Spark on AWS EMR.
Writing reusable, testable, and efficient code
Integration of data storage solutions in spark – especially with AWS S3 object storage. Performance tuning of pySpark scripts.
Need to ensure overall build delivery quality is good and on-time delivery is done at all times.
Should be able to handle meetings with customers with ease.
Need to have excellent communication skills to interact with the customer.
Be a team player and willing to work in an onsite-offshore model, mentor other folks in the team (onsite as well as offshore)
5+ years of experience in programming with python. Strong proficiency in python
Familiarity with functional programming concepts
3+ years of hands-on experience in developing ETL data pipelines using pySpark on AWS EMR
Experience in building pipelines and data lake for large enterprises on AWS
Good understanding of Spark’s Dataframe and API
Experience in configuring EMR clusters on AWS
Experience in dealing with AWS S3 object storage from Spark.
Experience in troubleshooting spark jobs. Knowledge of monitoring spark jobs using Spark UI
Performance tuning of Spark jobs.
Understanding fundamental design principles behind business processes
Process Knowledge and Expertise :
Demonstrated experience in change management processes, including understanding of governance frameworks and preparation of supporting artefacts required for approvals.
Strong clarity on the path to production, with hands-on involvement in deployments, testing cycles, and obtaining business sign-offs.
Proven track record in technical solution design, with the ability to provide architectural guidance and support implementation strategies.
Databricks-Specific Skills :
Experience in at least developing and delivering end-to-end Proof of Concept (POC) solutions covering the below :
Basic proficiency in Databricks, including creating jobs and configuring clusters.
Exposure to connecting external data sources (e.g., Amazon S3) to Databricks.
Understanding of Unity Catalog and its role in data governance.
Familiarity with notebook orchestration and implementing modular code structures to enhance scalability and maintainability.
Important Pointers :
Candidates must have actual hands-on work experience , not just home projects or academic exercises.
Profiles should clearly state how much experience they have in each skill area , as this helps streamline the interview process.
Candidates must know their CV / profile inside out , including all projects and responsibilities listed. Any ambiguity or lack of clarity on the candidate’s part can lead to immediate rejection, as we value accuracy and ownership.
They should be able to confidently explain their past experience, challenges handled, and technical contributions.
Create a job alert for this search
Aws Data Engineer • pune, India
Related jobs
AWS Database Eng
Tata Consultancy Services • Pune, Maharashtra, India
Desired Competencies (Technical / Behavioral Competency).Should have expertise in creating data warehouses in AWS utilizing the following tools : EC2, S3, EMR, Athena, Sagemaker, Aurora and Snowflake....Show more
Last updated: 30+ days ago • Promoted
AWS Data Engineer / Snowflake Data Engineer
Numeric Technologies • pune, maharashtra, in
Please apply only if you are comfortable to work in rotational shift.Apply only if you are an immediate to 15 days joiner.
Work Mode - Monthly once to office in Bangalore.Years of experience - 2 to ...Show more
Last updated: 8 days ago • Promoted
Celebal Technologies - AWS Data Engineer
Celebal Technologies • Pune
DATA ENGINEER (Databricks & AWS) Overview : As a Data Engineer, you will work with multiple teams to deliver solutions on the AWS Cl...Show more
Last updated: 30+ days ago • Promoted
AWS Data Engineer
Confidential • Pune
Work with AWS services including S3, Lambda, Glue, API Gateway, and SQS to build and maintain data pipelines.Develop and manage scalable data workflows using Python, PySpark, and SQL.Handle batch j...Show more
Last updated: 30+ days ago • Promoted
Data Engineer AWS ( Full-time at a Fortune 500 tech MNC )
HARP • Pune, Maharashtra, India
Kubernetes (Moderate level knowledge).Technical Skills – Nice to Have.Technical Skills – Ok to Train Later.Monitor the health status of applications using Datadog or similar monitoring tools.Check ...Show more
Last updated: 2 days ago • Promoted
AWS Data Platform Engineer
Persistent Systems • Pune, Maharashtra, India
We’re looking for an AWS Data Platform Engineer to help automate and scale our cloud-based analytics environment.You’ll work with our BI and Data Engineering teams to build secure, automated, and r...Show more
Last updated: 2 days ago • Promoted
AWS Data Engineer
Tata Consultancy Services • pune, maharashtra, in
Aws data engineer having strong experience of Python.Technical / Behavioral Competency.Proficient in Python, with experience in deploying Python packages and OOP, Experience in ingesting data from di...Show more
Last updated: 18 days ago • Promoted
AWS Cloud Engineer
Futurism Technologies, INC. • pune, maharashtra, in
Immediate to 15 Days preferred.Proficient experience with AWS Cloud.Infrastructure solution and cloud account migration.Experience in Cloud Networking and network configuration.Implementing CI / CD f...Show more
Last updated: 9 days ago • Promoted
Senior AWS Data Engineer
CYAN360 • Pune, IN
Position : Senior AWS Data Engineer.Work Timings : 2 : 30 PM to 11 : 30 PM IST.Need someone who can join immediately or in 15 days • • •.
Design, develop, and deploy end-to-end data pipelines on AWS cloud in...Show more
Last updated: 30+ days ago • Promoted
AWS Engineer
Spryc Systems • Pune, IN
We are seeking an experienced AWS Engineer to design, implement, and maintain AWS infrastructure and services in a managed service environment.
The ideal candidate will possess deep expertise in AWS...Show more
Last updated: 9 days ago • Promoted
AWS Cloud Engineer
Proglite • Pune, IN
Infrastructure & System Administration : .Deploy, manage, and optimize EC2 instances across dev, test, and production environments.
Perform system administration and troubleshooting for Linux and Wind...Show more
Last updated: 30+ days ago • Promoted
Data Engineer
Tata Consultancy Services • Pune, Maharashtra, India
TCS is hiring for AWS Data engineer, please find the below JD.Experience range – 6 to 8 years.Location - Hyderabad / Pune / Indore.
Skills Required - AWS, Pyspark, EMR, Glue.AMI Rehydration, Cloud for...Show more
Last updated: 30+ days ago • Promoted
AWS Data Engineer
Coforge • pune, maharashtra, in
We are seeking a highly skilled and experienced.The ideal candidate will have hands-on experience in architecting and implementing scalable data solutions using modern cloud-native tools and framew...Show more
Last updated: 30+ days ago • Promoted
AWS Data Architect
ACL Digital • Pune, IN
AWS (S3, Redshift, Glue, Lake Formation, IAM).Proficient in data modeling, performance tuning, and security best practices.
.AWS Certified Solutions Architect preferred.Show more
Last updated: 11 hours ago • Promoted • New!
Senior Data Engineer- AWS
InfoCepts • Pune, Maharashtra, India
Key Result Areas and Activities : .Study existing technology landscape and understand current data integration frameworks and do impact assessment for the requirements.
Should be able to design comple...Show more
Last updated: 30+ days ago • Promoted
AWS Data Engineer
Insight Global • pune, maharashtra, in
We are seeking a highly skilled AWS Data Engineer to join our team and help design, build, and maintain scalable data pipelines and infrastructure in the cloud.
The ideal candidate will have experie...Show more
Last updated: 7 days ago • Promoted
Cloud Engineer (AWS, Azure)
BPMLinks • pune, maharashtra, in
Job Description – Cloud Engineer (AWS, Azure, Snowflake) : .We are looking for a skilled Cloud Engineer to take ownership of the existing multi-cloud infrastructure and data integration setup.Manage ...Show more
Last updated: 8 days ago • Promoted
Data Engineer
Reqpedia • shir, maharashtra, in
Job Title : Azure Data Engineer.Minimum 6+ years of experience in a Cloud Data Engineering role.Hands-on experience with Azure Cloud data tools (ADF, SHIR,.
Logic Apps, ADLS Gen2, Blob Storage), Data...Show more