About the Role :
We are looking for a highly motivated and experienced GCP Data Engineer to join our team in Chennai.
In this role, you will play a critical part in designing and developing robust data engineering solutions on Google Cloud Platform (GCP).
You will be responsible for building, maintaining, and optimizing large-scale, cloud-based data pipelines, enabling efficient data ingestion, transformation, and analysis.
This role requires deep expertise in GCP, Python, SQL, and BigQuery, and a strong understanding of modern data architecture, ETL processes, and best practices for cloud-based data Responsibilities :
- Design and implement scalable, fault-tolerant, and high-performance data pipelines on GCP using BigQuery, Dataflow, Pub / Sub, and other GCP-native services.
- Develop data ingestion frameworks for both batch and streaming data from diverse sources.
- Design and optimize schemas in BigQuery for analytics and reporting purposes.
- Write efficient and reusable SQL queries for data transformation, aggregation, and reporting.
- Implement data quality checks, validation rules, and lineage tracking.
- Work with infrastructure-as-code tools (Terraform, Deployment Manager) for provisioning GCP resources.
- Automate deployment and monitoring of data pipelines using CI / CD tools (e.g., Cloud Build, Jenkins).
- Implement robust logging and alerting using tools like Cloud Monitoring and Cloud Logging.
- Collaborate with data scientists, analysts, and business stakeholders to gather data requirements and support analytics use cases.
- Maintain detailed documentation of pipelines, data flows, and system architecture.
- Advocate for data governance, security, and compliance best Skills & Qualifications Skills :
- Strong hands-on experience with Google Cloud Platform (GCP) services including :
1. BigQuery
2. Dataflow (Apache Beam)
3. Cloud Storage
4. Pub / Sub
5. Cloud Composer (Apache Airflow)
Proficiency in Python for data pipeline development, automation, and scripting.Advanced knowledge of SQL for data manipulation and analysis.Experience with data pipeline orchestration tools (e.g., Apache Airflow, Cloud Composer)ref : hirist.tech)