Talent.com
This job offer is not available in your country.
Lead Data Engineer - Python / ETL / Scala

Lead Data Engineer - Python / ETL / Scala

Cynosure Corporate SolutionsChennai
19 days ago
Job description

About the Role :

We are seeking a highly skilled and experienced Senior / Lead Data Engineer to design, develop, and maintain scalable, reliable, and efficient data pipelines and ETL solutions. The role requires strong expertise across multi-cloud environments, modern data warehousing platforms, programming languages, and data orchestration tools. You will play a pivotal role in transforming raw data into actionable insights, ensuring data quality, and enabling analytics and reporting initiatives across the :

  • Design, build, and optimize complex ETL / ELT data pipelines using Python, PySpark, Scala, and advanced SQL.
  • Implement and manage ETL processes using Informatica PowerCenter, Databricks, AWS Glue, and Snowflake.
  • Develop and deploy scalable data solutions across AWS, Azure, GCP, and Microsoft Fabric using cloud-native services.
  • Manage and optimize databases, including Redshift, SQL Server, and AWS RDS.
  • Orchestrate and monitor data workflows with Apache Airflow to ensure reliable and timely delivery.
  • Implement streaming solutions with Apache Kafka and containerized services with Kubernetes.
  • Automate data workflows and system monitoring using Unix shell scripting.
  • Apply CI / CD practices to data pipelines and enforce Data Cleanroom principles for privacy-compliant collaboration.
  • Collaborate with BI / reporting teams to deliver optimized datasets for Tableau, Looker, and Power BI.
  • Troubleshoot and resolve performance issues in pipelines and database queries.
  • Maintain detailed technical documentation and collaborate closely with cross-functional teams.

Qualifications :

  • Bachelors or Masters degree in Computer Science, Engineering, Information Technology, or a related field.
  • Experience : 5+ years for Senior Data Engineer, 8+ years for Lead Data Engineer.
  • Languages : Proficiency in SQL, Python (including PySpark), Scala, and Unix Shell Scripting.
  • ETL Tools : Hands-on experience with Informatica PowerCenter, Databricks, and AWS Glue.
  • Data Warehousing : Expertise in Snowflake and Redshift.
  • Cloud Platforms : Strong exposure to at least two of AWS, Azure, and GCP; familiarity with Microsoft Fabric.
  • Databases : Solid knowledge of Redshift, SQL Server, and AWS RDS.
  • Orchestration : Proven experience with Apache Airflow.
  • Streaming & Containerization : Practical experience with Apache Kafka and Kubernetes.
  • Concepts : Working knowledge of CI / CD pipelines and Data Cleanroom practices.
  • Reporting Tools : Understanding of data provisioning for Tableau, Looker, or Power BI.
  • Strong problem-solving skills, communication ability, and a proactive approach to emerging technologies.
  • (ref : hirist.tech)

    Create a job alert for this search

    Lead Data Engineer • Chennai