Job Title : Data Quality Engineer
Experience : 4+ Years
Location : Pune
Job Summary :
We are seeking experienced ETL Testers who are ready to work as Data Quality Engineers (DQE) , possessing strong programming experience in Advanced SQL and Python (or Java). Candidates should have hands-on testing experience, especially in data validation , functional testing , and be comfortable working in CI / CD environments. Exposure to BDD frameworks (like Selenium, Cucumber) and modern software testing practices is expected.
Key Responsibilities :
- Develop and execute complex SQL queries for data validation , data profiling , and test case automation .
- Perform ETL testing on large-scale datasets to ensure data integrity across systems.
- Use Python or Java to build automation scripts for data quality checks.
- Write and maintain test scripts using BDD frameworks (Selenium, Cucumber) where applicable.
- Collaborate with Data Engineers and Developers for continuous integration and deployment ( CI / CD ) pipeline validations.
- Implement testing strategies for structured and unstructured data sources.
- Contribute to the identification and resolution of data quality issues and anomalies.
- Work with cross-functional teams to define test plans, data scenarios, and ensure full test coverage.
- Participate in code reviews and optimize SQL queries for performance improvements.
Required Technical Skills :
Must Have :
Advanced SQL – strong proficiency in :Window Functions (e.g., RANK, ROW_NUMBER, LEAD / LAG, PARTITION BY)CTEs (Recursive and Multiple)Subqueries (Scalar, Correlated, EXISTS / NOT EXISTS)Analytical Functions (CUME_DIST, NTILE, PERCENT_RANK)Full-text search , hierarchical queries , and query optimizationProgramming Skills – Proficiency in Python or JavaStrong functional testing background (manual + automation)Working knowledge of CI / CD pipelinesGood to Have :
Exposure to Selenium , Cucumber , or any BDD frameworkExperience in working with ETL pipelines , data lakes , or big data systemsKnowledge of data governance or data profiling tools (e.g., Informatica DQ, Talend, etc.)Soft Skills & Competencies :
Strong analytical and problem-solving skillsAbility to communicate technical concepts clearly and effectivelySelf-driven and able to work independently as well as part of a distributed teamQuick learner with adaptability to new technologies and frameworksOther Details :
Candidates should be open to working in a hybrid model with 3 days in office .Immediate to 30-day joiners preferred.Flexibility to coordinate with distributed teams across locations.