Description : Role Summary :
We are seeking a highly skilled Gen AI Developer Specialist with extensive experience in building, deploying, and scaling production-grade Generative AI applications leveraging Large Language Models (LLMs), vector databases, and cloud AI platforms.
The ideal candidate will have hands-on expertise in implementing real-world AI solutions that solve complex business challenges while adhering to ethical AI practices.
You will lead the end-to-end design, development, and deployment of scalable GenAI systems, working closely with cross-functional teams to drive innovation and operational excellence.
Key Responsibilities :
- Architect and develop GenAI applications using state-of-the-art LLMs such as OpenAI GPT, Claude, LLaMA, and other transformer-based models.
- Build Retrieval-Augmented Generation (RAG) pipelines integrating vector search with knowledge bases.
- Implement multimodal AI use cases combining text, images, audio, and video data.
- Develop agent-based AI workflows for autonomous task execution and decision-making.
- Identify AI-driven business opportunities and design cost-effective AI solutions.
- Optimize model usage, API calls, and infrastructure costs without compromising performance.
- Apply responsible AI principles to ensure fairness, transparency, and ethical use.
- Implement safeguards against prompt injection, jailbreaking, and other security threats.
- Design Human-in-the-Loop (HITL) systems for supervised AI outputs and continuous improvement.
- Process and analyze real-time voice and text inputs, including chunking, parsing, and conversion for downstream AI models.
- Deploy and manage scalable GenAI applications on Azure AI Studio, AWS Bedrock, or similar cloud platforms.
- Work with DevOps and cloud engineering teams to implement CI / CD pipelines, monitoring, and automated scaling.
- Collaborate with data scientists, software engineers, product managers, and stakeholders to deliver AI-powered features.
- Mentor junior developers and promote best practices in GenAI development and deployment.
Required Skills & Experience :
Proven experience with Azure OpenAI Service and / or AWS Bedrock platforms for AI application development.Strong expertise in Python programming, with frameworks such as FastAPI, LangChain, or equivalent for building AI-powered APIs and pipelines.Hands-on experience with vector databases like FAISS, Pinecone, or Weaviate for embedding-based search and retrieval.Deep understanding of Retrieval-Augmented Generation (RAG) techniques and implementation.Familiarity with prompt engineering and prompt safety best practices to secure AI models.Experience building and deploying production-grade AI applications with scalable architecture.Knowledge of responsible AI principles, bias mitigation, and AI ethics.Expertise in handling real-time data ingestion, chunking, and pre-processing for AI workflows.Strong problem-solving skills and ability to communicate complex AI concepts to technical and non-technical audiences(ref : hirist.tech)