Talent.com
This job offer is not available in your country.
Lead Data Engineering

Lead Data Engineering

ANRGI TECH Pvt. Ltd.Bangalore North, KA, in
6 days ago
Job type
  • Quick Apply
Job description

Job Description

This role requires proficiency in developing data pipelines including coding and testing for ingesting wrangling transforming and joining data from various sources. The ideal candidate should be adept in ETL tools like Informatica Glue Databricks and DataProc with strong coding skills in Python PySpark and SQL. This position demands independence and proficiency across various data domains. Expertise in data warehousing solutions such as Snowflake BigQuery Lakehouse and Delta Lake is essential including the ability to calculate processing costs and address performance issues. A solid understanding of DevOps and infrastructure needs is also required.

Outcomes :

Act creatively to develop pipelines / applications by selecting appropriate technical options optimizing application development maintenance and performance through design patterns and reusing proven solutions. Support the Project Manager in day-to-day project execution and account for the developmental activities of others.

Interpret requirements create optimal architecture and design solutions in accordance with specifications.

Document and communicate milestones / stages for end-to-end delivery.

Code using best standards debug and test solutions to ensure best-in-class quality.

Tune performance of code and align it with the appropriate infrastructure understanding cost implications of licenses and infrastructure.

Create data schemas and models effectively.

Develop and manage data storage solutions including relational databases NoSQL databases Delta Lakes and data lakes.

Validate results with user representatives integrating the overall solution.

Influence and enhance customer satisfaction and employee engagement within project teams.

Measures of Outcomes :

TeamOne's Adherence to engineering processes and standards

TeamOne's Adherence to schedule / timelines

TeamOne's Adhere to SLAs where applicable

TeamOne's # of defects post delivery

TeamOne's # of non-compliance issues

TeamOne's Reduction of reoccurrence of known defects

TeamOne's Quickly turnaround production bugs

Completion of applicable technical / domain certifications

Completion of all mandatory training requirementst

Efficiency improvements in data pipelines (e.g. reduced resource consumption faster run times).

TeamOne's Average time to detect respond to and resolve pipeline failures or data issues.

TeamOne's Number of data security incidents or compliance breaches.

Outputs Expected : Code :

Develop data processing code with guidance

ensuring performance and scalability requirements are met.

Define coding standards

templates

and checklists.

Review code for team and peers.

Documentation :

Create / review templates

checklists

guidelines

and standards for design / process / development.

Create / review deliverable documents

including design documents

architecture documents

infra costing

business requirements

source-target mappings

test cases

and results.

Configure :

Define and govern the configuration management plan.

Ensure compliance from the team.

Test :

Review / create unit test cases

scenarios

and execution.

Review test plans and strategies created by the testing team.

Provide clarifications to the testing team.

Domain Relevance :

Advise data engineers on the design and development of features and components

leveraging a deeper understanding of business needs.

Learn more about the customer domain and identify opportunities to add value.

Complete relevant domain certifications.

Manage Project :

Support the Project Manager with project inputs.

Provide inputs on project plans or sprints as needed.

Manage the delivery of modules.

Manage Defects :

Perform defect root cause analysis (RCA) and mitigation.

Identify defect trends and implement proactive measures to improve quality.

Estimate :

Create and provide input for effort and size estimation

and plan resources for projects.

Manage Knowledge :

Consume and contribute to project-related documents

SharePoint

libraries

and client universities.

Review reusable documents created by the team.

Release :

Execute and monitor the release process.

Design :

Contribute to the creation of design (HLD

LLD

SAD) / architecture for applications

business components

and data models.

Interface with Customer :

Clarify requirements and provide guidance to the Development Team.

Present design options to customers.

Conduct product demos.

Collaborate closely with customer architects to finalize designs.

Manage Team :

Set FAST goals and provide feedback.

Understand team members' aspirations and provide guidance and opportunities.

Ensure team members are upskilled.

Engage the team in projects.

Proactively identify attrition risks and collaborate with BSE on retention measures.

Certifications :

Obtain relevant domain and technology certifications.

Skill Examples :

Proficiency in SQL Python or other programming languages used for data manipulation.

Experience with ETL tools such as Apache Airflow Talend Informatica AWS Glue Dataproc and Azure ADF.

Hands-on experience with cloud platforms like AWS Azure or Google Cloud particularly with data-related services (e.g. AWS Glue BigQuery).

Conduct tests on data pipelines and evaluate results against data quality and performance specifications.

Experience in performance tuning.

Experience in data warehouse design and cost improvements.

Apply and optimize data models for efficient storage retrieval and processing of large datasets.

Communicate and explain design / development aspects to customers.

Estimate time and resource requirements for developing / debugging features / components.

Participate in RFP responses and solutioning.

Mentor team members and guide them in relevant upskilling and certification.

Knowledge Examples : Knowledge Examples

Knowledge of various ETL services used by cloud providers including Apache PySpark AWS Glue GCP DataProc / Dataflow Azure ADF and ADLF.

Proficient in SQL for analytics and windowing functions.

Understanding of data schemas and models.

Familiarity with domain-related data.

Knowledge of data warehouse optimization techniques.

Understanding of data security concepts.

Awareness of patterns frameworks and automation practices.

Additional Comments :

Sr Data Engineer Position Location : OUS with minimum of 6 hrs. overlap with US timings. Must have Skills

  • 1. 15 years of experience in design and delivery of Distributed Systems capable of handling petabytes of data in a distributed environment. 2. 10 years of experience in the development of Data Lakes with Data Ingestion from disparate data sources, including relational databases, flat files, APIs, and streaming data. 3. Experience in providing Design and development of Data Platforms and data ingestion from disparate data sources into the cloud. 4. Expertise in core AWS Services including AWS IAM, VPC, EC2, EKS / ECS, S3, RDS, DMS, Lambda, CloudWatch, CloudFormation, CloudTrail, CloudWatch. 5. Proficiency in programming languages like Python and PySpark to ensure efficient data processing. preferably Python. 6. Architect and implement robust ETL pipelines using AWS Glue, defining data extraction methods, transformation logic, and data loading procedures across different data sources 7. 15 years of Experience in using IaC tools like Terraform etc. 8. 10 years of experience in development of CI / CD pipelines (GitHub Actions, Jenkins). 9. Experience in the development of Event-Driven Distributed Systems in the Cloud using Serverless Architecture. 10. Ability to work with Infrastructure team for AWS service provisioning for databases, services, network design, IAM roles and AWS cluster. 11. 2-3 years of experience working with Document DB. 12. Ability to design, orchestrate and schedule jobs using Airflow. 13. Knowledge of AWS AI Services like AWS Entity Resolution, AWS Comprehend. 14. Ability to run custom LLMs using Amazon SageMaker. 15. Ability to use Large Language Models (LLMs) for Data Classification and Identification of PII data entities Nice to have Skills : 1. 10 years of experience in the development of Data Audit, Compliance and Retention standards for Data Governance, and automation of the governance processes. 2. Experience in data modelling with NoSQL Databases like Document DB. 3. Experience in using column-oriented data file format like Apache Parquet, and Apache Iceberg as the table format for analytical datasets. 4. Expertise in development of Retrieval-Augmented Generation (RAG) and Agentic Workflows for providing context to LLMs based on proprietary enterprise data. 5. Ability to develop re-ranking strategies using results from Index and Vector stores for LLMs to improve the quality of the output.

Skills : Data Lake,AWS,Python

Requirements

AWS ,Python, Data Lake

Create a job alert for this search

Data Engineering Lead • Bangalore North, KA, in

Related jobs
  • Promoted
Senior Full Stack SDE with Data Engineering for Analytics

Senior Full Stack SDE with Data Engineering for Analytics

Truckmentumhosur, tamil nadu, in
Truckmentum is seeking a Senior Full Stack Software Development Engineer (SDE) with deep data engineering experience to help us build cutting-edge software and data infrastructure for our AI-driven...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Data Architect

Data Architect

G10Xhosur, tamil nadu, in
Senior Data Architect (15+ Years Experience).We are looking for a seasoned Data Architect to lead and shape enterprise-wide data initiatives. This role requires a strategic leader with deep technica...Show moreLast updated: 10 hours ago
  • Promoted
  • New!
Data Engineering Manager

Data Engineering Manager

iMerit Technologybangalore, karnataka, in
Merit is a leading AI data solutions company that transforms unstructured data into structured intelligence for advanced machine learning and analytics. Our customers span autonomous mobility, medic...Show moreLast updated: 10 hours ago
  • Promoted
ETL LEAD with IBM DataStage experienced

ETL LEAD with IBM DataStage experienced

IntraEdgehosur, tamil nadu, in
ETL Developer – DataStage, AWS, Snowflake.We are looking for a talented and motivated ETL Developer / Senior Developer.You will work on building scalable and efficient data pipelines using.IBM Data...Show moreLast updated: 30+ days ago
  • Promoted
Principal Data Engineer

Principal Data Engineer

Xebiahosur, tamil nadu, in
We’re Hiring : Principal Data Engineer | Any Xebia Location (Hybrid, 3 days in office per week).Any Xebia Location (Hybrid – 3 days in office per week). Data Engineering with 4+ years team leadership...Show moreLast updated: 21 days ago
  • Promoted
Senior Big Data Engineer

Senior Big Data Engineer

Veltrishosur, tamil nadu, in
Veltris is a Digital Product Engineering Services partner committed to driving technology-enabled transformation across enterprises, businesses, and industries. We specialize in delivering next-gene...Show moreLast updated: 14 days ago
  • Promoted
  • New!
Lead Data Engineer & ML Analyst

Lead Data Engineer & ML Analyst

Eltropyhosur, tamil nadu, in
We’re looking for someone with.Design and manage scalable ETL / ELT pipelines using AWS Glue, Redshift, S3, and Kafka / Kinesis. Architect and implement data lake and warehouse solutions following best ...Show moreLast updated: 10 hours ago
  • Promoted
Senior Data Engineer

Senior Data Engineer

Deltacubeshosur, tamil nadu, in
Build and maintain scalable ETL / ELT pipelines.Work with Snowflake and BigQuery for data storage.Implement orchestration with Airflow or Prefect. Integrate data workflows with Python.Optimize data pi...Show moreLast updated: 15 days ago
  • Promoted
Lead Azure Data Engineer

Lead Azure Data Engineer

RandomTreeshosur, tamil nadu, in
We’re a leading software company specializing in Artificial Intelligence, Machine Learning, Data Analytics, Innovative data solutions, Cloud-based technologies. If you're passionate about building r...Show moreLast updated: 26 days ago
  • Promoted
  • New!
Data Engineering Azure databricks

Data Engineering Azure databricks

EXLhosur, tamil nadu, in
Data Engineer (DE) Consultant is responsible for designing, developing, and maintaining data assets and data related products by liaising with multiple stakeholders. Work with stakeholders to unders...Show moreLast updated: 10 hours ago
  • Promoted
Data Engineer

Data Engineer

INFEC Serviceshosur, tamil nadu, in
Design, develop, and optimize data pipelines and ETL processes on GCP or Azure.Work with structured and unstructured data, integrating sources such as databases, APIs, and streaming platforms.Imple...Show moreLast updated: 4 days ago
  • Promoted
Senior Manager - Data Engineering Lead

Senior Manager - Data Engineering Lead

DIAGEO IndiaBengaluru, Karnataka, India
Senior Manager - Data Engineering Lead.Bachelor’s or master’s degree in computer science, Data Engineering, or related field. Experience in data engineering.Proven experience in cloud platforms (AWS...Show moreLast updated: 26 days ago
  • Promoted
Senior Engineering Manager-Big Data, Generative AI

Senior Engineering Manager-Big Data, Generative AI

Extreme Networkshosur, tamil nadu, in
Over 50,000 customers globally trust our end-to-end, cloud-driven networking solutions.They rely on our top-rated services and support to accelerate their digital transformation efforts and deliver...Show moreLast updated: 7 days ago
  • Promoted
  • New!
Lead Engineer

Lead Engineer

HCLTechhosur, tamil nadu, in
Architect efficient and reusable front-end systems to support complex interactions within Meta HW infrastructure.Develop full-stack web applications for internal infrastructure tooling using techno...Show moreLast updated: 10 hours ago
  • Promoted
Data Engineer

Data Engineer

Kanerika Inchosur, tamil nadu, in
Following are high level responsibilities that you will play but not limited to : .Analyze the Data Model and do GAP analysis with Business Requirements and Power BI. Design and Model Power BI schema....Show moreLast updated: 19 days ago
  • Promoted
Principal / Senior Data Architect

Principal / Senior Data Architect

Aayshosur, tamil nadu, in
Position : Principal / Senior Data Architect.You will act as a key member of the consulting team helping Clients to re-invent their corporate finance function by leveraging advanced analytics.You wil...Show moreLast updated: 14 days ago
  • Promoted
Senior Data Engineer

Senior Data Engineer

CEShosur, tamil nadu, in
As part of our data engineering team, you’ll build and maintain.This is a hands-on role where you’ll design and optimize modern data infrastructure, ensuring reliability, scalability, and performan...Show moreLast updated: 5 days ago
  • Promoted
Associate Architect - Data Engineering

Associate Architect - Data Engineering

Response Informaticshosur, tamil nadu, in
We are seeking an experienced Data Architect to lead the transformation of enterprise data.Alteryx workflows into Azure Databricks. Microsoft Azure ecosystem, including Azure.Data Factory, Databrick...Show moreLast updated: 26 days ago
  • Promoted
Lead Data Engineer

Lead Data Engineer

Eucloid Data Solutionshosur, tamil nadu, in
Eucloid is looking for a Lead Data Engineer to join our Data Platform team supporting various business applications.The ideal candidate will support development of data infrastructure on Databricks...Show moreLast updated: 7 days ago
  • Promoted
Data Engineer Team Lead

Data Engineer Team Lead

SGIhosur, tamil nadu, in
To be discussed based on your skills and experience.Strong hands-on data engineering experience with a proven ability to design, build, and optimize scalable data pipelines in .Deep technical exper...Show moreLast updated: 6 days ago