Description : Job Summary :
We are seeking a highly skilled ETL Tester with strong expertise in SQL, data validation, and cloud-based ETL ecosystems. The ideal candidate will have hands-on experience in GCP data services, Airflow pipelines, BigQuery, and enterprise-level test management. This role requires deep technical proficiency, analytical thinking, and the ability to validate data flows across complex data ingestion, transformation, and consumption layers.
Key Responsibilities :
ETL & Data Pipeline Testing :
- Design, prepare, and execute detailed test cases for ETL workflows, including source-to-target data validation, transformation logic verification, and schema-level validations.
- Validate data ingestion, data transformation, and data loading processes across multiple GCP services such as Data Fusion, BigQuery, and GCS.
- Perform end-to-end testing of Airflow DAGs, ensuring task dependencies, scheduling logic, and pipeline reliability are functioning correctly.
- Verify data integrity, data quality, and consistency across multiple stages of the data pipeline.
SQL & Database Testing :
Write and execute complex SQL queries involving analytical functions, nested subqueries, CTEs, performance-tuned joins, and large dataset comparisons.Conduct database-level validations including referential integrity checks, data reconciliation, and aggregate-level validations.Optimize SQL queries for high performance and reliability in large-scale data environments.Cloud Platform & GCP Services :
Test cloud-native ETL workflows and data pipelines built on GCP.Validate data movement between GCS, Data Fusion pipelines, and BigQuery datasets.Work closely with data engineering teams to analyze Airflow DAG failures, pipeline errors, and performance bottlenecks.Support data quality checks and validation frameworks in cloud environments.Defect Management & Documentation :
Log, track, and manage defects using JIRA, ALM, or similar test management platforms.Work with developers and data engineers to triage, reproduce, and resolve data-related issues.Prepare comprehensive test reports, execution summaries, and requirement traceability matrices.Collaboration & Stakeholder Communication :
Work closely with cross-functional teams including Data Engineering, DevOps, and Business Analysts.Communicate effectively to clarify requirements, raise risks, and highlight data inconsistencies.Participate in sprint planning, reviews, and daily standups within an Agile framework.Required Skills & Expertise :
Must-Have Skills :
Strong expertise in SQL with ability to write and optimize complex queries.Deep understanding of ETL concepts, data quality checks, and data integrity validation.Hands-on testing experience with GCP-based data pipelines.Experience with ETL / data pipeline tools such as Airflow (DAGs), Data Fusion, BigQuery, and GCS.Proficiency with JIRA, ALM, or related testing and defect management tools.Good-to-Have Skills :
Experience with CI / CD integrations for test execution.Knowledge of Python for automation of data validations.Familiarity with version control (Git) and cloud monitoring tools.Understanding of data warehousing and dimensional modeling concepts.(ref : hirist.tech)