Excellent hands-on experience in AWS Data technologies Glue, S3, Athena, EMR, IAM etc.
- Mandatory - Hands on experience in Python and PySpark. Python as a language is practically usable for anything, we are looking for application Development and Extract / Transform / Load and Datalake curation experience using Python.
- Hands on experience in version control tools like Git.
- Worked on Amazon’s Analytics services like Amazon Athena, DynamoDB and AWS Glue
- Worked on Amazon’s Compute services like Amazon Lambda, Amazon EC2 and Amazon’s Storage service like S3 and few other services.
- Experience / knowledge of bash / shell scripting will be a plus.
- Has built ETL processes to take data, copy it, structurally transform it etc. involving a wide variety of formats like CSV, fixed width, XML and JSON.
- Have worked with columnar storage formats- Parquet, Avro, and ORC etc.
- Hands on experience in tools like Jenkins to build, test and deploy the applications.
- Excellent debugging skills.
- Ability to quickly perform critical analysis and use creative approaches for solving complex problems.
- Strong academic background.
- Excellent written and verbal communication skills, and strong relationship building skills'