QA Automation Engineer
Job Summary :
We are seeking a skilled QA Automation Engineer to join our data engineering team. The ideal candidate will be responsible for designing, developing, and maintaining automated test frameworks for complex data pipelines that ingest, transform, and load data into Google BigQuery. These pipelines process large-scale data from file-based sources and produce flattened outputs for downstream systems.
Key Responsibilities :
- Design and implement automated test strategies for end-to-end data pipelines, including file ingestion, transformation, and output validation.
- Develop robust test automation scripts using Python, SQL, and data validation frameworks (e.g., Great Expectations, dbt tests).
- Validate data integrity, schema conformance, and business logic across multiple pipeline stages.
- Create and manage test datasets to simulate real-world telecom data scenarios
- Collaborate with data engineers and business analysts to ensure comprehensive test coverage and early defect detection.
- Collaborate with the QA Lead and development team to define, develop, and maintain the overall testing strategy, including test plans and test data management.
- Monitor and troubleshoot test failures, data anomalies, and pipeline issues.
- Maintain documentation for test cases, test plans, and test results.
Required Skills & Experience :
6+ years of experience in QA automation, preferably in data engineering environments.Understanding of data modeling, ETL / ELT processes, and data quality best practices.Strong proficiency in SQL, especially with complex joins, window functions, and data aggregation.Strong proficiency in writing functions / Stored Procedures and Scheduled jobs.Experience with file-based data ingestion (CSV, JSON, Avro, Parquet) and validation.Proficiency in Python for scripting and automation.Optional : Familiarity with data testing frameworks like Great Expectations, dbt tests, or custom Python-based validators.Optional : Hands-on experience with Google BigQuery and GCP services (e.g., Cloud Storage, Dataflow, Pub / Sub).Good written and verbal communication skills.Strong analytical and problem-solving abilities with meticulous attention to detail.