Key Responsibilities :
Data Architecture & Strategy :
- Define and maintain the organizations data architecture blueprint, including data lakes, data warehouses, and streaming platforms.
- Develop data models, schemas, and integration patterns that support business intelligence, analytics, and AI / ML initiatives.
- Lead data modernization and migration initiatives (e.g., on-prem to cloud).
- Align data architecture with enterprise IT strategies and business goals.
Data Engineering & Platform Design :
Oversee the design and implementation of ETL / ELT pipelines for ingesting, transforming, and storing structured, semi-structured, and unstructured data.Ensure platforms support real-time, near-real-time, and batch data processing.Define standards for data APIs, microservices, and integration with enterprise applications.Governance & Compliance :
Establish data governance frameworks covering quality, security, privacy, and compliance (GDPR, HIPAA, etc.).Define and enforce data lineage, cataloging, and stewardship practices.Work with security teams to implement role-based access controls, encryption, and secure data sharing mechanisms.Collaboration & Stakeholder Engagement :
Partner with business leaders, product managers, and analytics teams to translaterequirements into technical data solutions.
Work closely with data engineers, BI developers, and data scientists to enable advanced analytics and AI adoption.Act as a subject matter expert on enterprise data architecture and best practices.Performance & Optimization :
Ensure databases, warehouses, and data lakes are optimized for scalability, reliability, and Monitor data platform performance, troubleshoot bottlenecks, and recommendimprovements.
Evaluate and recommend emerging technologies in data management, AI-readiness, and cloud computing.Required Technical Skills :
Data Modeling & Warehousing : Dimensional modeling, data lakes, data marts, star / snowflake schema design.Databases : Strong knowledge of SQL and experience with RDBMS (Oracle, PostgreSQL,MySQL) and NoSQL (MongoDB, Cassandra, DynamoDB).
Big Data & Processing : Hadoop, Spark, Databricks, Flink, Kafka.Cloud Platforms : AWS (Redshift, Glue, S3, Lake Formation), Azure (Synapse, Data Factory,ADLS), GCP (BigQuery, Dataflow, Pub / Sub).
ETL / ELT Tools : Informatica, Talend, Apache NiFi, dbt, Airflow.BI & Analytics : Power BI, Tableau, Looker, Qlik (for enabling reporting and visualization).Programming / Scripting : SQL, Python, PySpark, Scala, Shell scripting.Data Governance Tools : Collibra, Alation, Apache Atlas, Informatica Data Governance.APIs & Integration : Experience with REST, GraphQL, and event-driven architectures(ref : hirist.tech)