Genpact (NYSE : G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000+ people across 30+ countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose - the relentless pursuit of a world that works better for people - we serve and transform leading enterprises, including the Fortune Global 500, with our deep business and industry knowledge, digital operations services, and expertise in data, technology, and AI.
Inviting applications for the role of Principal Consultant- AWS Developer
We are seeking an experienced Developer with expertise in AWS-based big data solutions, particularly leveraging Apache Spark on AWS EMR, along with strong backend development skills in Java and Spring. The ideal candidate will also possess a solid background in data warehousing, ETL pipelines, and large-scale data processing systems..
Responsibilities
- Design and implement scalable data processing solutions using Apache Spark on AWS EMR.
- Develop microservices and backend components using Java and the Spring framework.
- Build, optimize, and maintain ETL pipelines for structured and unstructured data.
- Integrate data pipelines with AWS services such as S3, Lambda, Glue, Redshift, and Athena.
- Collaborate with data architects, analysts, and DevOps teams to support data warehousing initiatives.
- Write efficient, reusable, and reliable code following best practices.
- Ensure data quality, governance, and lineage across the architecture.
- Troubleshoot and optimize Spark jobs and cloud-based processing workflows.
- Participate in code reviews, testing, and deployments in Agile environments.
Qualifications we seek in you!
Minimum Qualifications
Bachelor’s degreePreferred Qualifications / Skills
Strong experience with Apache Spark and AWS EMR in production environments.Solid understanding of AWS ecosystem, including services like S3, Lambda, Glue, Redshift, and CloudWatch.Proven experience in designing and managing large-scale data warehousing systems.Expertise in building and maintaining ETL pipelines and data transformation workflows.Strong SQL skills and familiarity with performance tuning for analytical queries.Experience working in Agile development environments using tools such as Git, JIRA, and CI / CD pipelines.Familiarity with data modeling concepts and tools (e.g., Star Schema, Snowflake Schema).Knowledge of data governance tools and metadata management.Experience with containerization (Docker, Kubernetes) and serverless architectures.