Talent.com
AI Evals & Test Engineer

AI Evals & Test Engineer

BharatGenMumbai, Maharashtra, India
1 day ago
Job description

Job Summary :

We are looking for an AI Evaluation & Test Engineer to join our growing team to ensure that our generative AI models and applications are safe, accurate, trustworthy, and deliver an elegant user experience. You will serve as the first customer of our AI systems. This role is ideal for product-minded engineers who obsess over product quality and customer-centricity, and are passionate about shaping the behavior of AI systems in the real world.

Key Responsibilities :

  • Build and maintain AI evaluation pipelines to test, measure, and evaluate the behavior and performance of AI systems.
  • Implement traces, spans, and session tracking for observability and identify error propagation in multi-step pipelines.
  • Define AI quality metrics and KPIs around factuality, faithfulness, toxicity, grounding precision / recall, latency, cost, etc., with clear acceptance bars.
  • Implement evaluation and testing automation to enable end-to-end system and regression testing at scale.
  • Define criteria for and implement release gates in the CI / CD pipeline.
  • Find creative ways to break products.
  • Assist in root cause analysis and troubleshooting of bugs and field issues.
  • Collaborate with cross-functional teammates from product, engineering, linguistics, and customer support to shape human-AI interaction paradigms and ensure that our AI models and applications deliver the desired outcome and user experience.

Minimum Qualifications and Experience :

  • Bachelor’s or Master’s degree in CS / CE / IT / EE / E&TC or related fields with 5+ years of experience in manual and automation testing of software products, with at least 2 years in evaluating and testing AI / ML products.
  • Required Expertise :

  • Strong software testing fundamentals and expertise in writing test plans, executing test cases, and generating detailed reports and dashboards.
  • Strong analytical and debugging skills, and attention to detail.
  • Proficiency in Python, scripting, and software testing automation frameworks and tools such as Pytest, Selenium, Robot Framework, etc.
  • Working knowledge of generative AI models, AI agents, and related concepts such as retrieval augmented generation (RAG), prompt engineering, context engineering, explainability, traceability, observability, guard rails, reasoning, specificity, etc.
  • Sound understanding of the fundamental differences in the approach for testing conventional software versus evaluating generative AI systems.
  • Team player with excellent interpersonal skills and the ability to collaborate effectively with remote and cross-functional team members.
  • Go-getter attitude and ability to flourish in a fast-paced, startup environment.
  • Experience in any of the following would be a big plus -
  • AI evaluation frameworks such as Arize, Braintrust, DeepEval, LangSmith, Ragas
  • AI safety and red teaming experience, e.g., prompt injection, jailbreak, adversarial and stress testing.
  • Different types of AI evaluation methods, e.g, Human-in-the-loop, LLM-as-a-Judge.
  • Create a job alert for this search

    Test Engineer • Mumbai, Maharashtra, India

    Related jobs
    • Promoted
    AiOps Engineer

    AiOps Engineer

    L&T Technology Servicesmumbai, maharashtra, in
    Only immediate to 15 days joiner.Develop and Deploy AI Solutions : .Design, build, and deploy end-to-end Machine Learning and Generative AI pipelines on. Google Cloud Platform, using Vertex AI service...Show moreLast updated: 7 days ago
    • Promoted
    Lead QA Automation Engineer – AI Test Intelligence

    Lead QA Automation Engineer – AI Test Intelligence

    Lean Impeccable Techdombivli, maharashtra, in
    We are looking for a passionate and experienced .This role blends deep expertise in test automation frameworks with the ability to leverage . ChatGPT, Testim, Mabl, Katalon, or custom LLM-based test...Show moreLast updated: 9 days ago
    • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    Nityo Infotechdombivli, maharashtra, in
    AI Agent Development & LLM Integration.Build AI agents using frameworks like LangGraph, Autogen, Crew, or PydanticAI.Design and optimize prompt engineering workflows for LLMs (e.Develop modular, re...Show moreLast updated: 10 days ago
    • Promoted
    Data Test Engineer

    Data Test Engineer

    Creditsafe Technologymumbai, maharashtra, in
    We are looking for a Test Engineer who will become part of our team building and testing the Creditsafe data.You will be working closely with the database teams and data engineering to build specif...Show moreLast updated: 30+ days ago
    • Promoted
    AI Engineer

    AI Engineer

    TalentBridgedombivli, India
    Job Type : 6-Month Contract, after 6 months it will convert to fulltime.We are looking for an experienced AIML Engineer with 4–8 years of expertise in AI / ML solutions, specifically in building intel...Show moreLast updated: 3 days ago
    • Promoted
    Gen Ai - Engineer

    Gen Ai - Engineer

    Diligente TechnologiesKalyan-Dombivli, IN
    Hands-on experience with Generative AI (GenAI) and agent-based AI frameworks.Proficiency in backend programming languages such as Node. Strong knowledge of both SQL and NoSQL databases.Experience wi...Show moreLast updated: 7 days ago
    • Promoted
    Technical Lead - Gen AI

    Technical Lead - Gen AI

    Aceolutiondombivli, maharashtra, in
    Freelance Remote Opportunity : Tech Lead – GenAI Code Initiatives.Tech Lead / Senior Software Engineer.AI-driven code generation systems. Write, evaluate, and refine complex code solutions.This is a ...Show moreLast updated: 5 days ago
    • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Nexoria Techworks Inc.mumbai city, maharashtra, in
    Job Description : Generative AI Engineer.AI models, large language model (LLM) applications, and agentic flows.Your core responsibilities will include : . Build Production RAG / Agentic Flows.Develop and...Show moreLast updated: 10 days ago
    • Promoted
    AI Agents Developer

    AI Agents Developer

    BeGigMumbai, IN
    Agentic AI Engineer – Real Estate & Construction.We are seeking a talented engineer to.You will own the full lifecycle of agent frameworks, including toolchain architecture, prompt engineering, dat...Show moreLast updated: 2 days ago
    • Promoted
    Gen AI Engineer

    Gen AI Engineer

    Greymatter InnovationzThane, IN
    Full-time / Onsite (or specify if hybrid / remote).Gen AI Engineer / Junior Consultant.The ideal candidate will possess strong expertise in. LLMs, RAG frameworks, LangChain, and Vector Databases.You w...Show moreLast updated: 7 days ago
    • Promoted
    AI Agent Developer

    AI Agent Developer

    Sikich IndiaKalyan-Dombivli, IN
    Sikich is seeking a talented and driven developers with 3-5 years of experience to help us design, build, and deploy intelligent agents using Microsoft’s ecosystem. This role involves working with M...Show moreLast updated: 24 days ago
    • Promoted
    AI-First QA Engineer (with Business Analysis Expertise)

    AI-First QA Engineer (with Business Analysis Expertise)

    CESmumbai, maharashtra, in
    The role focuses on building quality from the first requirement using.You’ll work closely with product and development teams, using an AI-first approach to make testing faster, sharper, and more ef...Show moreLast updated: 8 days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Chargebeethane, India
    Chargebee’s GBT AI team builds internal AI agents and workflows that power smarter, faster operations across Finance, HR, Legal , Marketing, RevOps and GTM. We’re looking for an AI Engineer who can ...Show moreLast updated: 3 days ago
    • Promoted
    AI Engineer

    AI Engineer

    MightyBotThane, IN
    Join our team as an AI Engineer, where we're focused on graduating AI from interesting demos to indispensable products.You will build reliable, self-improving systems that empower subject matter ex...Show moreLast updated: 30+ days ago
    • Promoted
    Senior AI Engineer

    Senior AI Engineer

    Nous InfosystemsThane, IN
    Nous Infosystems is a CMMI® Level 5 and ISO 9001 : 2000 certified global information Technology Company with expertise in providing quality software solutions and IT-enabled support services to a wid...Show moreLast updated: 10 days ago
    • Promoted
    Senior Test Engineer

    Senior Test Engineer

    IntelliasMumbai, Maharashtra, India
    At least 5 years of relevant experience.Strong understanding of SDLC and Agile.Strong knowledge of software QA methodologies, tools, and processes. Experience with test management tools.Very good co...Show moreLast updated: 6 days ago
    • Promoted
    Al Engineer

    Al Engineer

    Aura Recruitment SolutionsMumbai, IN
    Pay starts from 1,00,000 INR per Month.As an AI Engineer, you will design, develop, and deploy AI-driven solutions You’ll collaborate across engineering, product, and data teams to build intelligen...Show moreLast updated: 2 days ago
    • Promoted
    Responsible AI

    Responsible AI

    EXLThane, IN
    We are seeking a highly skilled and principled Responsible AI Evaluator to assess, audit, and ensure the ethical development and deployment of AI models across the enterprise.This role spans tradit...Show moreLast updated: 30+ days ago