What You'll Do
- Design, develop, and code Hadoop applications to process and analyze large data collections.
- Create and maintain scalable data processing frameworks for various business needs.
- Extract, transform, and load (ETL) data and isolate data clusters for analysis.
- Test Hadoop scripts and analyze results to ensure data accuracy and system reliability.
- Troubleshoot and resolve application bugs and performance issues.
- Maintain the security, integrity, and confidentiality of company data.
- Develop and implement data tracking programs to monitor workflows and outputs.
- Produce and maintain Hadoop development documentation for internal teams and stakeholders.
Skills Required
Java, Python, Etl, Data Analysis, Debugging, Spark