Key Responsibilities :
- Design, implement, and execute manual and automated tests for data pipelines and ETL processes
- Perform data validation and ensure data quality across all layers of the pipeline (Bronze, Silver, Gold)
- Collaborate with data engineers and developers to understand pipeline architecture and contribute to test planning
- Perform performance testing and identify bottlenecks in the data pipeline
- Monitor and analyse test results, identify defects, and ensure timely issue resolution
- Develop and maintain test documentation including test cases, plans, and bug reports
- Participate in agile sprints and work closely with cross-functional teams to support project deadlines
Required Skills and Qualifications :
Strong experience in QA / testing for data pipelines, ETL processes, and data validationFamiliarity with SQL and experience in querying large datasets for validation purposesExperience with data warehousing and cloud-based data platforms (preferred AWS)Good communication and problem-solving skills, with an attention to detailPreferred Qualifications :
Proficient in SQL scripting and Apache sparkKnowledge of Python, or other scripting languages for automating testsFamiliarity with Agile / Scrum methodologiesExperience with CI / CD and automation tools (e.g., Jenkins, Git)'Skills Required
Sql, Python, Jenkins, Git, Aws, Testing Qa