Design and execute test strategies for Gen AI models including LLMs image generators and multimodal systemsCreate and maintain test cases and prompt libraries to evaluate model output quality eg relevance coherence creativity factual accuracyTest model behaviour across different scenarios languages and user intents eg hallucinations bias toxicity safety risksPerform manual and automated testing of Gen AI APIs chatbots and content generation toolsValidate input output behaviour prompt testing response evaluation token level analysisCollaborate with engineering teams to implement prompt evaluation frameworks log monitoring and output scoring systemsTrack model performance metrics BLEU ROUGE perplexity toxicity score etc and suggest areas for improvementConduct fairness bias and safety testing to ensure compliance with ethical and regulatory guidelinesAssist in evaluating AB experiments and fine tuning model behavioursDocument issues clearly and contribute to root cause analysis with dev teams
Key Skills
Academics,Apache Commons,Apache Tomcat,Filing,Condition Monitoring
Employment Type : Full-Time
Experience : years
Vacancy : 1
Gen Ai • Chennai, Tamil Nadu, India