Summary : We are seeking a Senior Data Engineer to build and optimize data systems that power batch processing, real-time streaming, pipeline orchestration, data lake management, and data cataloging. You will have the opportunity to use your expertise in solving big data problems and apply design thinking, coding and analytical skills to develop data pipelines that support marketing data products. We’re looking for talented Data Engineers passionate about building new data-driven solutions with the latest Big Data technology.
Technical Proficiency :
- 6+ years of experience and a bachelor’s degree in computer science or a related field, or equivalent work experience
- In-depth working experience of distributed systems Hadoop, Spark, Hive, DBT and Airflow / Dagster
- At least 4 years of solid production quality coding experience in data pipeline implementation in Python
- Experience working with public cloud platforms, preferably AWS
- Experience working with Databricks and / or Snowflake
- Experience in Git, JIRA, Jenkins, shell scripting
- Familiarity with Agile methodology, test-driven development, source control management and test automation
Role and Responsibilities :
You will build services that will integrate directly with products and with external vendors.You will work with modern data technologies such as Hadoop, Spark, DBT, Dagster / Airflow, Atlan, Trino, etc., modern data platforms such as Databricks and Snowflake and cloud technologies across AWS stackIdentify, design, and implement automation of manual processes, optimizing data delivery, reducing Cloud cost, etc.Implement processes and systems to monitor Data Quality, Observability, Governance and Lineage.Support operations to manage the production environment and help in resolving production issues with RCAWrite unit / integration tests, adopt Test-driven development, contribute to engineering wiki, and document design / implementation etc.Education : Bachelor’s degree in computer science, Software Engineering, MIS or equivalent combination of education and experience
Key Skills : SQL, Python, Hadoop, Spark, Hive, DBT and Airflow / Dagster, AWS