Sound knowledge of relational databases (SQL) and experience with large SQL-based systems.
Experience with Java and shell scripting
Excellent analytical capabilities - Strong interest in algorithms
Proven understanding and experience with Cloudera Hadoop, IMPALA, Hive, Flume, HBase, Sqoop, Spark, and Kafka
Ability to analyze the existing shell scripts / python code to debug any issues
Ability to identify, analyze and address problems to resolve issues whenever possible in a way that minimizes negative impact and risk to the organization
Ability to work closely with Architect, Developers, and testers to ensure requirements and functional designs are translated accurately into working technical designs and that test plans and scripts serve customer needs.
Strong knowledge in performance tuning of Hadoop clusters and ecosystem components and jobs. This includes the management and review of Hadoop log files.
in-depth understanding of data modeling
5+ years of experience in software design, development & maintaining Big data applications.