ETL Developer + Cloudera
Job Summary :
The Senior Informatica and Cloudera Developer (BDM) is responsible for designing, developing, and implementing advanced data integration and big data solutions within on-premises Cloudera environments. This role leverages Informatica PowerCenter, Informatica Big Data Management (BDM), and Cloudera components (HDFS, Hive, Spark, Impala) to deliver optimized, scalable, and secure data workflows. The developer ensures efficient data processing, high-quality standards, and seamless integration across enterprise systems to support business objectives.
Key Responsibilities :
- Design, develop, and optimize ETL processes using Informatica PowerCenter and Informatica BDM within on-prem Cloudera environments.
- Implement data integration, transformation, and migration solutions across heterogeneous data sources, including relational databases, flat files, and Hadoop-based systems.
- Configure and manage Cloudera ecosystem components such as HDFS, Hive, Spark, Impala, and Oozie, integrating them with Informatica BDM workflows.
- Develop, deploy, and maintain BDM applications, mappings, and workflows for both batch and real-time data processing.
- Build and optimize Spark-based data pipelines leveraging Cloudera's distributed processing capabilities.
- Create reusable ETL frameworks and templates to standardize and accelerate data integration processes.
- Perform performance tuning and troubleshooting of Informatica and Spark jobs to ensure reliability and efficiency.
- Implement and maintain data quality, validation, and metadata management practices across environments.
- Collaborate with data architects, analysts, and business users to translate business requirements into scalable technical solutions.
- Manage workflow scheduling, monitoring, and incident resolution for production pipelines.
- Ensure security, governance, and compliance in handling on-prem enterprise data.
- Mentor junior developers and promote adherence to best practices, development standards, and performance guidelines.
Skills Required
Metadata Management, Performance Tuning, Data Integration, Informatica Powercenter