We are seeking a seasoned Cloud AI Specialist to spearhead the design, development, and deployment of cutting-edge artificial intelligence and machine learning solutions on Google Cloud Platform (GCP).
The ideal candidate will possess strong programming proficiency in Python with significant experience in developing web APIs using FastAPI and demonstrable expertise with GCP services including Vertex AI, Cloud Run / Functions, and Cloud Storage.
- Develop and deploy production-grade AI applications and microservices primarily using Python and FastAPI ensuring robust API design security and scalability.
- Design and implement end-to-end LLM pipelines encompassing data ingestion processing model inference and output generation.
- Utilize GCP services extensively including Vertex AI Generative AI Model Garden Workbench Cloud Functions Cloud Run Cloud Storage and BigQuery to build train and deploy LLMs and AI models.
- Expertly apply prompt engineering techniques and strategies to optimize LLM responses manage context windows and reduce hallucinations.
- Implement and manage embeddings and vector stores for efficient information retrieval and Retrieval-Augmented Generation (RAG) patterns.
Responsibilities
Participate in code reviews establish best practices for AI application development and contribute to a culture of technical excellence.Keep abreast of the latest advancements in GCP AI / ML services and broader AI / ML technologies evaluating and recommending new tools and approaches.