Talent.com
AI System Evaluation Engineer

AI System Evaluation Engineer

BharatGenRepublic Of India, IN
13 days ago
Job description

Job Summary :

We are looking for an AI Evaluation & Test Engineer to join our growing team to ensure that our generative AI models and applications are safe, accurate, trustworthy, and deliver an elegant user experience. You will serve as the first customer of our AI systems. This role is ideal for product-minded engineers who obsess over product quality and customer-centricity, and are passionate about shaping the behavior of AI systems in the real world.

Key Responsibilities :

  • Build and maintain AI evaluation pipelines to test, measure, and evaluate the behavior and performance of AI systems.
  • Implement traces, spans, and session tracking for observability and identify error propagation in multi-step pipelines.
  • Define AI quality metrics and KPIs around factuality, faithfulness, toxicity, grounding precision / recall, latency, cost, etc., with clear acceptance bars.
  • Implement evaluation and testing automation to enable end-to-end system and regression testing at scale.
  • Define criteria for and implement release gates in the CI / CD pipeline.
  • Find creative ways to break products.
  • Assist in root cause analysis and troubleshooting of bugs and field issues.
  • Collaborate with cross-functional teammates from product, engineering, linguistics, and customer support to shape human-AI interaction paradigms and ensure that our AI models and applications deliver the desired outcome and user experience.

Minimum Qualifications and Experience :

  • Bachelor’s or Master’s degree in CS / CE / IT / EE / E&TC or related fields with 5+ years of experience in manual and automation testing of software products, with at least 2 years in evaluating and testing AI / ML products.
  • Required Expertise :

  • Strong software testing fundamentals and expertise in writing test plans, executing test cases, and generating detailed reports and dashboards.
  • Strong analytical and debugging skills, and attention to detail.
  • Proficiency in Python, scripting, and software testing automation frameworks and tools such as Pytest, Selenium, Robot Framework, etc.
  • Working knowledge of generative AI models, AI agents, and related concepts such as retrieval augmented generation (RAG), prompt engineering, context engineering, explainability, traceability, observability, guard rails, reasoning, specificity, etc.
  • Sound understanding of the fundamental differences in the approach for testing conventional software versus evaluating generative AI systems.
  • Team player with excellent interpersonal skills and the ability to collaborate effectively with remote and cross-functional team members.
  • Go-getter attitude and ability to flourish in a fast-paced, startup environment.
  • Experience in any of the following would be a big plus -
  • AI evaluation frameworks such as Arize, Braintrust, DeepEval, LangSmith, Ragas
  • AI safety and red teaming experience, e.G., prompt injection, jailbreak, adversarial and stress testing.
  • Different types of AI evaluation methods, e.G, Human-in-the-loop, LLM-as-a-Judge.
  • Create a job alert for this search

    Ai Engineer • Republic Of India, IN

    Related jobs
    • Promoted
    Senior AI Engineer

    Senior AI Engineer

    Nous InfosystemsNagpur, IN
    Nous Infosystems is a CMMI® Level 5 and ISO 9001 : 2000 certified global information Technology Company with expertise in providing quality software solutions and IT-enabled support services to a wid...Show moreLast updated: 21 days ago
    • Promoted
    Generative AI STEM Evaluator

    Generative AI STEM Evaluator

    AceolutionRepublic Of India, IN
    As a STEM Rater for [Maths / Physics / Chemistry / Biology / Coding / Finance], you will be a crucial contributor to shaping the "brain" of our AI. Working closely with the STEM Lead and your team, your prima...Show moreLast updated: 30+ days ago
    • Promoted
    AI Exploration Engineer

    AI Exploration Engineer

    Mitchell Martin Inc.India, India
    Design and execute machine learning experiments to evaluate emerging AI technologies and frameworks.Prototype and assess end-to-end AI solutions to inform product and platform strategy.Formulate hy...Show moreLast updated: 30+ days ago
    • Promoted
    AI Systems Engineer

    AI Systems Engineer

    go4WorldBusiness.com - Import | Export | Trade | Worldwide.New Delhi, Republic Of India, IN
    AI / ML development, ideally with some exposure to leading projects or mentoring others.We are setting up a dedicated AI department at go4WorldBusiness to power the next generation of intelligent, AI...Show moreLast updated: 30+ days ago
    • Promoted
    Responsible AI

    Responsible AI

    EXLNagpur, IN
    We are seeking a highly skilled and principled Responsible AI Evaluator to assess, audit, and ensure the ethical development and deployment of AI models across the enterprise.This role spans tradit...Show moreLast updated: 30+ days ago
    • Promoted
    Gen AI Engineer

    Gen AI Engineer

    ADPnagpur, maharashtra, in
    We are seeking a highly skilled and experienced Senior Generative AI Engineer to lead the development of intelligent agents powered by advanced generative AI models. In this role, you will be respon...Show moreLast updated: 18 days ago
    • Promoted
    Ai Exploration Engineer

    Ai Exploration Engineer

    Mitchell Martin Inc.Republic Of India, IN
    Design and execute machine learning experiments to evaluate emerging AI technologies and frameworks.Prototype and assess end-to-end AI solutions to inform product and platform strategy.Formulate hy...Show moreLast updated: 30+ days ago
    • Promoted
    AI Engineer

    AI Engineer

    TalentBridgeNagpur, IN
    Job Type : 6-Month Contract, after 6 months it will convert to fulltime.We are looking for an experienced AIML Engineer with 4–8 years of expertise in AI / ML solutions, specifically in building intel...Show moreLast updated: 19 days ago
    • Promoted
    Full Stack AI engineer

    Full Stack AI engineer

    AnswerThis (YC F25)Nagpur, IN
    Remote (Applications open worldwide).Semantic Search, Vector Databases, Prompt Engineering, GenAI Frameworks, React Agents, Graph Agents, Document Parsing, Python, Scalable APIs.AnswerThis is an AI...Show moreLast updated: 30+ days ago
    • Promoted
    AI Technology Evaluation Engineer

    AI Technology Evaluation Engineer

    Mitchell Martin Inc.Republic Of India, IN
    Design and execute machine learning experiments to evaluate emerging AI technologies and frameworks.Prototype and assess end-to-end AI solutions to inform product and platform strategy.Formulate hy...Show moreLast updated: 30+ days ago
    • Promoted
    AI Engineer

    AI Engineer

    QX Labsnagpur, maharashtra, in
    QX Labs is looking for a full stack AI engineer with a strong software background to join our expanding team of AI experts. We are an early-stage London-based startup building cutting-edge workflow ...Show moreLast updated: 12 days ago
    • Promoted
    Generative AI Systems Engineer

    Generative AI Systems Engineer

    Servion Global SolutionsChennai, Republic Of India, IN
    Design and develop enterprise-scale agentic AI solutions using LangGraph and related frameworks.Build and optimize RAG systems (chunking, retrieval strategies, evaluation) with an emphasis on accur...Show moreLast updated: 10 days ago
    • Promoted
    Generative AI Systems Engineer

    Generative AI Systems Engineer

    L&T Technology ServicesRepublic Of India, IN
    Only immediate to 15 days joiner.Develop and Deploy AI Solutions : .Design, build, and deploy end-to-end Machine Learning and Generative AI pipelines on. Google Cloud Platform, using Vertex AI service...Show moreLast updated: 18 days ago
    • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    Nityo Infotechnagpur, maharashtra, in
    AI Agent Development & LLM Integration.Build AI agents using frameworks like LangGraph, Autogen, Crew, or PydanticAI.Design and optimize prompt engineering workflows for LLMs (e.Develop modular, re...Show moreLast updated: 21 days ago
    • Promoted
    AI Systems Evaluation Engineer (Remote)

    AI Systems Evaluation Engineer (Remote)

    TaskifyRepublic Of India, IN
    Remote
    We're Hiring "Software Engineers (Freelance / Remote)" | Earn up to $2500 per month.Join a global community of talented writers to shape the future of AI. Contribute to training and refining cutting-e...Show moreLast updated: 20 days ago
    • Promoted
    Generative AI Engineer (LLM Expert – AWS Focus)

    Generative AI Engineer (LLM Expert – AWS Focus)

    BigRioNagpur, IN
    Job Title : Generative AI Engineer (LLM Expert – AWS Focus).Employment Type : Ongoing Contract.Boston-based, remote-first technology consulting firm. We partner with forward-thinking organizations to ...Show moreLast updated: 11 days ago
    • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Nexoria Techworks Inc.nagpur, maharashtra, in
    Job Description : Generative AI Engineer.AI models, large language model (LLM) applications, and agentic flows.Your core responsibilities will include : . Build Production RAG / Agentic Flows.Develop and...Show moreLast updated: 21 days ago
    • Promoted
    AiOps Engineer

    AiOps Engineer

    L&T Technology Servicesnagpur, maharashtra, in
    Only immediate to 15 days joiner.Develop and Deploy AI Solutions : .Design, build, and deploy end-to-end Machine Learning and Generative AI pipelines on. Google Cloud Platform, using Vertex AI service...Show moreLast updated: 18 days ago