Requirements – JAVA, spring Boot, Restful, DS Algo, Experience in building, optimizing, and maintaining large data pipelines , building distributed data services and solutions.
Development, Testing, and deployment experience with GitHub, Docker / Kubernetes
Job Description :
- Strong expertise in Java and the Spring Framework, particularly Spring Boot.
- Good understanding of SQL oracle, SQL server or similar languages.
- Core Engineering : Data structures & algorithms, testing (Junit etc), Git, clean code.
- Experience with graph databases (e.g., Neo4j, JanusGraph) and graph query languages.
- Ops & CI / CD : Monitoring (Prometheus / Grafana), logging, pipelines (Jenkins / GitHub Actions).
- Cloud Native : Docker, Kubernetes (deploy, network, scale, troubleshoot).
- 6+ years of directly applicable experience
- BS in Computer Science, Engineering, or equivalent experience.
About the team –
Our Cloud Data Technologies team oversees data infrastructure and the management of the end-to-end data lifecycle.
Project and role –
As a Software Engineer on our team, you will work on our Hadoop-based data warehouse and cloud development environment, contributing to scalable and reliable big data solutions and services for analytics and business insights. This is a hands-on role focused on building, optimizing, and maintaining large data pipelines , building distributed data services and solutions.
Key Responsibilities
Design, develop, and maintain robust data solutions and services and related Hadoop ecosystems, ensuring data reliability, scalability, and performance.Optimize and troubleshoot distributed systems for ingestion, storage, and processing.Collaborate with data engineers, analysts, and platform engineers to align solutions with business needs.Ensure data security, integrity, and compliance throughout the infrastructure.Maintain documentation and contribute to architecture reviews.Participate in incident response and operational excellence initiatives for the data warehouse.Continuously learn and apply new Hadoop ecosystem tools and data technologies.