Job Role : AWS Data Location : Chennai / Bangalore / Hyderabad / Pune (Hybrid)
Key Skills : AWS Glue, RedShift, S3, Lambda Athena.
We are seeking a highly skilled AWS Data Engineer with 5 - 8 years of experience in designing, developing, and deploying cloud-based data solutions. The role involves building scalable ETL pipelines, data warehouses, and data processing workflows on AWS, ensuring data quality, integrity, and performance. The ideal candidate should have strong hands-on experience in AWS services, SQL, Python, and big data processing frameworks while collaborating with cross-functional teams to deliver enterprise-grade data solutions.
- Hands on experience in Data Engineer with AWS, Glue, Lambda, SQL, Python, Redshift.
- Must have working knowledge in designing and implementing data pipelines on any of the cloud providers (AWS is
preferred).
Must be able to work with large volumes of data coming from various sources. Perform data cleansing, data validation etc.Hands on ETL developer who is good at python, SQL.AWS services like glue, glue crawlers, lambda, red shift, athena, s3, EC2, IAM, Monitoring and Logging mechanisms, AWS cloudwatch, setting up alerts.Deployment knowledge on cloud.Integrate CI / CD pipeline to build artifacts and deploy changed to higher Environments.Scheduling frame works Airflow, AWS Step functions.Excellent Communication skills, should be able to work collaboratively with other teams.Key Responsibilities :
Design, implement, and optimize ETL / ELT pipelines for structured and unstructured data.Use AWS Glue, Glue Crawlers, Lambda, and Step Functions to build scalable workflows.Handle large-scale data ingestion from multiple sources into S3, Redshift, or Athena.Perform data cleansing, transformation, and validation to ensure high data quality.Develop and optimize SQL queries, stored procedures, and scripts for analytics and reporting.Work with Redshift, Athena, and other AWS data warehouse solutions for efficient querying.Deploy and manage data solutions using AWS EC2, S3, IAM, and security best practices.Integrate pipelines with CI / CD frameworks to automate deployments across environments.Implement monitoring and logging using CloudWatch, set up alerts for system health.(ref : hirist.tech)