Job Description
Oracle's AI Data Platform is accelerating enterprise AI and redefining how AI applications are built. The AI Data Platform team is seeking an experienced Principal Member of Technical Staff (IC4) to help lead development, delivery, and operations of a modern, scalable AI platform.
You'll be responsible for, and lead efforts in, designing and building scalable, distributed, and resilient software components and services to support our product. You've built and operated high-scale cloud infrastructure services and understand how to continuously improve the technology. You can balance feature delivery with iteration and incremental improvement.
This is a senior engineering position, and we expect you develop and operate high quality software systems and lead / mentor other engineers along the way.
Responsibilities :
- Help drive next-generation AI capabilities, with a focus on microservices, using Oracle standard tools, technology and development practices along with leading open-source frameworks
- Drive complex areas and do a detailed design of some components.
- Work directly with architects to ensure new capabilities are built with right design principles
- Work with remote and geographically distributed teams to develop and operate complex cloud systems including participation in on-call rotation and related operational activities
- Be very technically hands-on developing software and cloud infrastructure
- Own / drive key end-to-end product / services
- Ready to learn and jump into new areas and domains
- Recruit and mentor other engineers to support overall org growth objectives
Qualifications :
BS / MS / PhD in Computer Science or equivalent related fieldsProven experience designing, implementing, and operating large scale cloud servicesStrong proficiency in Java, Python and related technologiesCloud development experience on one of the major platforms (Oracle, AWS, Azure, GCP)Solid understanding of networking concepts, security principles, and best practices.Experience with containerization technologies (e.g., Docker, Kubernetes) and orchestration tools for managing distributed systemsFamiliarity with DevOps automation and tools for continuous integration, deployment, and monitoring (e.g., Terraform, Jenkins, GitLab CI / CD, Prometheus).Excellent problem-solving skills, with the ability to troubleshoot complex issues and drive resolution in a fast-paced environment.Strong communication skills, with the ability to work effectively in cross-functional teamsAI / ML experience (highly desired) :Experience building and operating Generative AI, Agentic, and / or RAG systems
Familiarity with latest open-source AI models / frameworks (Llama, LangGraph, etc)Understanding and experience with ML Ops and related tooling / infrastructureCareer Level : IC4
Career Level - IC4