JOB DESCRIPTION
Req ID :
We are currently seeking a AWS Sr Data Engineer to join our team in Bangalore or Remote, Karnātaka (IN-KA), India (IN).
Position Location :
OUS with minimum of 6 hrs. overlap with US timings.
Project Overview (If Possible) :
It’s one of the workstreams of Project Acuity. PASD Data Platform includes centralized web application for internal PASD users across the Recruitment Business to support marketing and operational use cases. Building a database at the patient level will provide significant benefit to PASD’s future reporting capabilities and engagement of external stakeholders.
Role Scope / Deliverables
As a Sr Data Engineer on the AWS Cloud team, you will be responsible for providing design and development of data ingestion pipelines from disparate data sources into the cloud. You will lead the delivery of the data products, leveraging Cloud Native strategies and best practices, drawing from 15+ years of IT experience.
Must have Skills
1. 15 years of experience in design and delivery of Distributed Systems capable of handling petabytes of data in a distributed environment.
2. 10 years of experience in the development of Data Lakes with Data Ingestion from disparate data sources, including relational databases, flat files, APIs, and streaming data.
3. Experience in providing Design and development of Data Platforms and data ingestion from disparate data sources into the cloud.
4. Expertise in core AWS Services including AWS IAM, VPC, EC2, EKS / ECS, S3, RDS, DMS, Lambda, CloudWatch, CloudFormation, CloudTrail, CloudWatch.
5. Proficiency in programming languages like Python and PySpark to ensure efficient data processing. preferably Python.
6. Architect and implement robust ETL pipelines using AWS Glue, defining data extraction methods, transformation logic, and data loading procedures across different data sources
7. 15 years of Experience in using IaC tools like Terraform etc.
8. 10 years of experience in development of CI / CD pipelines (GitHub Actions, Jenkins).
9. Experience in the development of Event-Driven Distributed Systems in the Cloud using Serverless Architecture.
10. Ability to work with Infrastructure team for AWS service provisioning for databases, services, network design, IAM roles and AWS cluster.
11. 2-3 years of experience working with Document DB.
12. Ability to design, orchestrate and schedule jobs using Airflow.
13. Knowledge of AWS AI Services like AWS Entity Resolution, AWS Comprehend.
14. Ability to run custom LLMs using Amazon SageMaker.
15. Ability to use Large Language Models (LLMs) for Data Classification and Identification of PII data entities
Nice to have Skills :
1. 10 years of experience in the development of Data Audit, Compliance and Retention standards for Data Governance, and automation of the governance processes.
2. Experience in data modelling with NoSQL Databases like Document DB.
3. Experience in using column-oriented data file format like Apache Parquet, and Apache Iceberg as the table format for analytical datasets.
4. Expertise in development of Retrieval-Augmented Generation (RAG) and Agentic Workflows for providing context to LLMs based on proprietary enterprise data.
5. Ability to develop re-ranking strategies using results from Index and Vector stores for LLMs to improve the quality of the output.
About NTT DATA
NTT DATA is a $30 billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long term success. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure and connectivity. We are one of the leading providers of digital and AI infrastructure in the world. NTT DATA is a part of NTT Group, which invests over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. Visit us at
Aws Data Engineer • bangalore, India