Qualification : Degree in Computer Science (or similar), alternatively well-founded professional experience in the desired field.
Experience Range : 3 to 5 Years.
Roles & Responsibilities :
As a Senior Data Engineer, you manage and develop the solutions in close alignment with various business and Spoke stakeholders.
You are responsible for the implementation of the IT governance with the Spokes Data Scientists, Data Analysts, and Business Analysts, when relevant.
Tasks :
- Create and manage data pipeline architecture for data ingestion, pipeline setup and data curation.
- Experience working with and creating cloud data solutions.
- Assemble large, complex data sets that meet functional / non-functional business requirements.
- Implement the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using Pyspark, SQL and AWS big Build analytics tools that use the data pipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics.
- Manipulate data at scale : getting data in a ready-to-use state in close alignment with various business and Spoke stakeholders.
Must Have : Advanced knowledge :
ETLData Lake, Data Warehouse, RDS architectures knowledgePython, SQL (Any other OOP language is also valuable)Pyspark (preferably) or Spark KnowledgeObject-oriented programming, Clean Code and good documentation skillsAWS : S3, Athena, Lambda, Glue, IAM, SQS, EC2, Quicksight, and etcGitData Analysis & VisualizationOptional :
AWS CDK Cloud Development KitCI / CD knowledge(ref : hirist.tech)