We are seeking a highly skilled AI Engineer to join our dynamic team. The successful candidate will be instrumental in designing, developing, and deploying cutting-edge artificial intelligence solutions leveraging the full suite of cloud services.
- Lead the end-to-end development cycle of AI applications, from conceptualization and prototyping to deployment and optimization, with a core focus on large language models.
- Architect and implement highly performant and scalable AI services, effectively integrating with cloud's comprehensive AI ecosystem.
Responsibilities
Develop and deploy production-grade AI applications primarily using Python and FastAPI, ensuring robust API design, security, and scalability.Design and implement end-to-end pipelines, encompassing data ingestion, processing, model inference, and output generation.Key Skills
Strong programming proficiency in Python, with significant experience in developing web APIs using FastAPI.Demonstrable expertise with cloud services, specifically with services like Vertex AI (Generative AI), Cloud Run / Functions, and Cloud Storage.Requirements
Two or more years of hands-on experience as an AI Engineer with a focus on building and deploying AI applications, particularly those involving Large Language Models.Practical knowledge and application of embeddings and vector stores for semantic search and retrieval architectures.Hands-on experience with at least one major LLM orchestration framework.Preferred qualifications include a Bachelor's or Master's degree in Computer Science and experience with MLOps practices for deploying, monitoring, and maintaining AI models in production.