Position : GenAI Lead / Architect
Experience : 5+ Years
Job Summary :
We are seeking a seasoned GenAI Lead / Architect with a minimum of 5+ years of professional experience. The ideal candidate will be a hands-on leader, responsible for designing and building scalable, production-grade Generative AI applications. This role requires deep expertise in Python and FastAPI for backend development, coupled with a strong command of AWS cloud services to deploy and manage robust and resilient AI systems. You will play a crucial role in shaping our GenAI strategy, from architectural design to implementation.
Key Responsibilities :
- Architectural Design & Strategy : Lead the design and implementation of end-to-end GenAI system architectures. You will be responsible for creating scalable, secure, and cost-effective solutions that leverage both proprietary and open-source models.
- Application Development : Build and deploy scalable Generative AI applications using Python and the FastAPI framework. This includes developing high-performance RESTful APIs and microservices for seamless integration.
- Model Integration & Optimization : Select, integrate, and fine-tune various GenAI models (LLMs, Vision models, etc.) into new and existing applications. You will be responsible for optimizing model performance and managing inference costs.
- Cloud Infrastructure Management : Utilize AWS cloud services to deploy, manage, and scale GenAI applications. This includes configuring services for data pipelines, model hosting, and application security.
- Team Leadership : Act as a technical leader, guiding junior developers, conducting thorough code reviews, and establishing best practices for development, testing, and deployment.
Required Skills & Qualifications :
A minimum of 5+ years of experience in software development or machine learning engineering.Proven experience in designing and building scalable applications with Python.Strong expertise with the FastAPI framework for building robust APIs.Hands-on experience with AWS cloud services for deploying and managing applications.A solid understanding of Generative AI principles and a strong background in working with Large Language Models (LLMs).Experience with building and deploying applications in a production environment.Preferred Skills :
Experience with LLM frameworks such as LangChain or LlamaIndex.Knowledge of specific AWS AI / ML services like Amazon SageMaker, Amazon Bedrock, or Amazon EKS for containerized workloads.Familiarity with containerization technologies like Docker and orchestration with Kubernetes.Experience with other cloud platforms like Azure or GCP.A background in machine learning operations (MLOps).(ref : hirist.tech)