Oracle Cloud Infrastructure blends the speed of a startup with the scale of an enterprise leader. Our Generative AI Solutions team builds advanced AI solutions that run on powerful cloud infrastructure tackling real-world, global challenges. As part of this team, you'll contribute to large-scale cloud solutions utilizing cutting-edge machine learning and Generative AI technologies, aimed at addressing complex global challenges.
Join our OCI Gen-AI Solutions team to build cutting-edge AI applications that tackle global challenges. We’re seeking an experienced Principal Machine Learning Engineer (IC4) to design, develop, and deploy customized Generative AI solutions for strategic customers focusing on Agentic solutions and Retrieval Augmented Generation (RAG). You’ll work closely with applied scientists and product managers to turn innovation into real-world impact.
In this role, you will :
- Design, build, and deploy cutting-edge machine learning and generative AI solutions, with a focus on Large Language Models (LLMs), AI agents, Retrieval-Augmented Generation (RAG), and large-scale search.
- Collaborate with scientists, engineers, and product teams to turn complex problems into scalable, cloud-ready AI solutions for enterprises.
- Run experiments, explore new algorithms, and push the boundaries of AI to optimize performance, customer experience, and business outcomes.
- Ensure ethical and responsible AI practices in all solutions.
We’re looking for a Principal Machine Learning Engineer with deep expertise in applied ML / AI, hands-on experience building production-grade solutions, and the creativity to innovate at the intersection of AI and enterprise cloud.
As part of the OCI Gen AI Solutions Engineering for strategic customersteam, you will be responsible for developing innovative Agentic workflows and solutions for our strategic customers. As a Principal member of the technical staff, you'll be part of the development of advanced Gen AI solutions using the latest ML technologies combined with Oracle's cloud expertise. Your work will significantly impact sectors like financial services, telecom, healthcare, and code generation by creating distributed, scalable, high-performance solutions for strategic customers.
Work directly with key customers and accompany them on their Gen AI journey – understanding their requirements, help them envision and design and build the right solutions and work together with their ML engineering to remove blockers.Design and implement agentic workflows using diverse frameworks, incorporating prompt engineering to optimize performance.You will dive deep into model structure to optimize model performance and scalability.You will build state of art solutions with brand new technologies in this fast-evolving area.You will configure large scale OpenSearch clusters, setting up data ingestion pipelines to get the data into the OpenSearch.You will diagnose, troubleshoot, and resolve issues in AI model training and serving. You may also perform other duties as assigned.Build re-usable solution patterns and reference solutions / showcases that can apply across multiple customers.Be an enthusiastic, self-motivated, and a great collaborator.Be our product evangelist - engage directly with customers and partners, participate and present in external events and conferences, etc.Qualifications
Bachelor’s or Master’s degree in Computer Science or related technical field, with 10+ years of experience in AI, ML, or data-driven solution development.Proven track record designing, building, and deploying scalable AI / ML solutions in production environments.Deep expertise in Large Language Models (LLMs), Generative AI, Agentic solutions, and advanced ML techniques (fine-tuning, prompt engineering, model optimization).Strong experience with OpenSearch, vector databases, data ingestion pipelines, and large-scale search optimization.Skilled in diagnosing, troubleshooting, and resolving issues in AI model training and serving.Hands-on experience with MCP, NLP, NLU, RAG architectures, Agents, and modern AI frameworks (., LangChain, LangGraph LlamaIndex).Proficient in Python and shell scripting, with familiarity in deep learning frameworks (PyTorch, TensorFlow, JAX, or Transformers).Experience with popular model training and serving frameworks like KServe, KubeFlow, Triton etc.Excellent communication skills for translating complex technical concepts into clear proposals, designs, and presentations.Collaborative mindset with experience working closely with product managers, engineers, and customers.Ability to mentor and guide junior data scientists or ML engineers.Experience acting as a technical evangelist, presenting at conferences, customer briefings, or industry events.Career Level - IC4