As a Data Engineer, you will be responsible for designing, developing and maintaining data assets and related products.
Strong knowledge of Python and Pyspark is required. You should also have skills in SQL, Hadoop, Hive, Azure, Databricks and Greenplum.
Familiarity with big data technologies like Hadoop, Spark and distributed computing frameworks is essential. Use Hue to run Hive SQL queries and schedule Apache Oozie jobs to automate data workflows.
Strong problem-solving and troubleshooting skills are expected. Establish comprehensive data quality test cases and implement automated validation processes.
A degree in Data Science, Statistics, Computer Science or related fields is required. 4-7 years of experience as a Data Engineer is necessary.
Chief • Hyderabad, Telangana, India