Key Responsibilities :
- Python Development :
- Design, develop, and maintain Python scripts for data extraction, transformation, and automation.
- Write and optimize Python code to manipulate, clean, and process large datasets.
- Develop Python-based automation solutions to streamline data workflows and integrate systems.
- Build Python-based APIs or services for data exchange with other applications and systems.
- SQL Development :
- Write complex SQL queries to extract, manipulate, and load data from and to relational databases (e.g., SQL Server , Oracle , MySQL ).
- Optimize database queries for performance to handle large volumes of data.
- Create, modify, and maintain stored procedures , views , functions , and triggers to support data processes.
- Informatica ETL Development :
- Design, develop, and maintain ETL workflows using Informatica PowerCenter .
- Integrate data from multiple sources using Informatica mappings and transformations.
- Ensure efficient data loading and transformation through Informatica sessions and workflows.
- Monitor and troubleshoot ETL jobs, ensuring data quality and integrity.
- Data Integration and Automation :
- Collaborate with business analysts to understand data integration needs and implement solutions using Python , SQL , and Informatica .
- Develop automated data pipelines to move data between internal and external systems.
- Utilize Python and Informatica to integrate data from various sources such as databases , APIs , flat files , and cloud services .
- Performance Optimization :
- Optimize SQL queries and Informatica workflows to improve performance and reduce runtime.
- Ensure efficient data transformation processes that handle high-volume data without bottlenecks.
- Monitor performance and troubleshoot issues related to data loading, transformation, and processing.
- Testing and Documentation :
- Conduct unit testing, integration testing, and performance testing for Python scripts and ETL processes.
- Create and maintain documentation for Python code, SQL queries, and Informatica workflows.
- Ensure that data processes adhere to company standards, best practices, and compliance requirements.
- Collaboration and Support :
- Collaborate with cross-functional teams, including data engineers, data scientists, and business analysts, to deliver solutions.
- Provide support for data-related issues, including debugging and optimizing existing workflows and scripts.
- Participate in code reviews to ensure high-quality code and adherence to best practices.
Skills and Qualifications :
Mandatory Skills :
Proficiency in Python for data manipulation, automation, and integration.Strong knowledge of SQL , including the ability to write complex queries, stored procedures, and performance tuning.Hands-on experience with Informatica PowerCenter for ETL processes, including mapping, transformations, and session management.Experience in data integration from multiple sources such as databases, flat files, and APIs.Understanding of data quality best practices and techniques for ensuring clean and accurate data.Knowledge of data pipelines and the ability to automate the flow of data using Python and Informatica .Familiarity with version control tools like Git for source code management.Skills Required
Informatica, Python, Sql, Git, Etl