Talent.com
AI Evals & Test Engineer

AI Evals & Test Engineer

BharatGenMumbai, Maharashtra, India
2 days ago
Job description

Job Summary :

We are looking for an AI Evaluation & Test Engineer to join our growing team to ensure that our generative AI models and applications are safe, accurate, trustworthy, and deliver an elegant user experience. You will serve as the first customer of our AI systems. This role is ideal for product-minded engineers who obsess over product quality and customer-centricity, and are passionate about shaping the behavior of AI systems in the real world.

Key Responsibilities :

  • Build and maintain AI evaluation pipelines to test, measure, and evaluate the behavior and performance of AI systems.
  • Implement traces, spans, and session tracking for observability and identify error propagation in multi-step pipelines.
  • Define AI quality metrics and KPIs around factuality, faithfulness, toxicity, grounding precision / recall, latency, cost, etc., with clear acceptance bars.
  • Implement evaluation and testing automation to enable end-to-end system and regression testing at scale.
  • Define criteria for and implement release gates in the CI / CD pipeline.
  • Find creative ways to break products.
  • Assist in root cause analysis and troubleshooting of bugs and field issues.
  • Collaborate with cross-functional teammates from product, engineering, linguistics, and customer support to shape human-AI interaction paradigms and ensure that our AI models and applications deliver the desired outcome and user experience.

Minimum Qualifications and Experience :

  • Bachelor’s or Master’s degree in CS / CE / IT / EE / E&TC or related fields with 5+ years of experience in manual and automation testing of software products, with at least 2 years in evaluating and testing AI / ML products.
  • Required Expertise :

  • Strong software testing fundamentals and expertise in writing test plans, executing test cases, and generating detailed reports and dashboards.
  • Strong analytical and debugging skills, and attention to detail.
  • Proficiency in Python, scripting, and software testing automation frameworks and tools such as Pytest, Selenium, Robot Framework, etc.
  • Working knowledge of generative AI models, AI agents, and related concepts such as retrieval augmented generation (RAG), prompt engineering, context engineering, explainability, traceability, observability, guard rails, reasoning, specificity, etc.
  • Sound understanding of the fundamental differences in the approach for testing conventional software versus evaluating generative AI systems.
  • Team player with excellent interpersonal skills and the ability to collaborate effectively with remote and cross-functional team members.
  • Go-getter attitude and ability to flourish in a fast-paced, startup environment.
  • Experience in any of the following would be a big plus -
  • AI evaluation frameworks such as Arize, Braintrust, DeepEval, LangSmith, Ragas
  • AI safety and red teaming experience, e.g., prompt injection, jailbreak, adversarial and stress testing.
  • Different types of AI evaluation methods, e.g, Human-in-the-loop, LLM-as-a-Judge.
  • Create a job alert for this search

    Test Engineer • Mumbai, Maharashtra, India

    Related jobs
    • Promoted
    AI Engineer

    AI Engineer

    MightyBotKalyan-Dombivli, IN
    Join our team as an AI Engineer, where we're focused on graduating AI from interesting demos to indispensable products.You will build reliable, self-improving systems that empower subject matter ex...Show moreLast updated: 30+ days ago
    • Promoted
    Lead QA Automation Engineer – AI Test Intelligence

    Lead QA Automation Engineer – AI Test Intelligence

    Lean Impeccable Techthane, maharashtra, in
    We are looking for a passionate and experienced .This role blends deep expertise in test automation frameworks with the ability to leverage . ChatGPT, Testim, Mabl, Katalon, or custom LLM-based test...Show moreLast updated: 10 days ago
    • Promoted
    Data Test Engineer

    Data Test Engineer

    Creditsafe Technologydombivli, maharashtra, in
    We are looking for a Test Engineer who will become part of our team building and testing the Creditsafe data.You will be working closely with the database teams and data engineering to build specif...Show moreLast updated: 30+ days ago
    • Promoted
    AI Agent Developer

    AI Agent Developer

    Sikich IndiaThane, IN
    Sikich is seeking a talented and driven developers with 3-5 years of experience to help us design, build, and deploy intelligent agents using Microsoft’s ecosystem. This role involves working with M...Show moreLast updated: 24 days ago
    • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    Nityo Infotechdombivli, maharashtra, in
    AI Agent Development & LLM Integration.Build AI agents using frameworks like LangGraph, Autogen, Crew, or PydanticAI.Design and optimize prompt engineering workflows for LLMs (e.Develop modular, re...Show moreLast updated: 10 days ago
    • Promoted
    Gen Ai - Engineer

    Gen Ai - Engineer

    Diligente TechnologiesKalyan-Dombivli, IN
    Hands-on experience with Generative AI (GenAI) and agent-based AI frameworks.Proficiency in backend programming languages such as Node. Strong knowledge of both SQL and NoSQL databases.Experience wi...Show moreLast updated: 8 days ago
    • Promoted
    Sr. Forward Deployed Engineer (Voice AI Agents)

    Sr. Forward Deployed Engineer (Voice AI Agents)

    HireginieKalyan-Dombivli, IN
    Our client is a tech-enabled outsourcing platform that integrates AI with human expertise to deliver exceptional customer experiences. Their services—ranging from telecalling to sales and support—le...Show moreLast updated: 30+ days ago
    • Promoted
    AiOps Engineer

    AiOps Engineer

    L&T Technology Servicesnavi mumbai, maharashtra, in
    Only immediate to 15 days joiner.Develop and Deploy AI Solutions : .Design, build, and deploy end-to-end Machine Learning and Generative AI pipelines on. Google Cloud Platform, using Vertex AI service...Show moreLast updated: 8 days ago
    • Promoted
    Technical Lead - Gen AI

    Technical Lead - Gen AI

    Aceolutionnavi mumbai, maharashtra, in
    Freelance Remote Opportunity : Tech Lead – GenAI Code Initiatives.Tech Lead / Senior Software Engineer.AI-driven code generation systems. Write, evaluate, and refine complex code solutions.This is a ...Show moreLast updated: 6 days ago
    • Promoted
    AI Evals & Test Engineer

    AI Evals & Test Engineer

    BharatGenMumbai, Maharashtra, India
    We are looking for an AI Evaluation & Test Engineer to join our growing team to ensure that our generative AI models and applications are safe, accurate, trustworthy, and deliver an elegant user ex...Show moreLast updated: 2 days ago
    • Promoted
    Gen AI Engineer

    Gen AI Engineer

    Greymatter InnovationzThane, IN
    Full-time / Onsite (or specify if hybrid / remote).Gen AI Engineer / Junior Consultant.The ideal candidate will possess strong expertise in. LLMs, RAG frameworks, LangChain, and Vector Databases.You w...Show moreLast updated: 8 days ago
    • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Nexoria Techworks Inc.navi mumbai, maharashtra, in
    Job Description : Generative AI Engineer.AI models, large language model (LLM) applications, and agentic flows.Your core responsibilities will include : . Build Production RAG / Agentic Flows.Develop and...Show moreLast updated: 10 days ago
    • Promoted
    AI-First QA Engineer (with Business Analysis Expertise)

    AI-First QA Engineer (with Business Analysis Expertise)

    CESdombivli, maharashtra, in
    The role focuses on building quality from the first requirement using.You’ll work closely with product and development teams, using an AI-first approach to make testing faster, sharper, and more ef...Show moreLast updated: 9 days ago
    • Promoted
    Senior AI Engineer

    Senior AI Engineer

    Nous InfosystemsThane, IN
    Nous Infosystems is a CMMI® Level 5 and ISO 9001 : 2000 certified global information Technology Company with expertise in providing quality software solutions and IT-enabled support services to a wid...Show moreLast updated: 10 days ago
    • Promoted
    Senior Test Engineer

    Senior Test Engineer

    IntelliasMumbai, Maharashtra, India
    At least 5 years of relevant experience.Strong understanding of SDLC and Agile.Strong knowledge of software QA methodologies, tools, and processes. Experience with test management tools.Very good co...Show moreLast updated: 7 days ago
    • Promoted
    • New!
    Quality Engineer

    Quality Engineer

    Digitalzonemumbai, maharashtra, in
    We are transforming our QA practice into an AI-powered testing discipline, “Vibe Testing.As a Quality Engineer, you’ll work with AI-assisted testing tools (Claude AI, GitHub Copilot, AI-based testi...Show moreLast updated: 10 hours ago
    • Promoted
    Software Engineer in Test

    Software Engineer in Test

    Auxiamumbai city, maharashtra, in
    Auxia is an AI-powered Growth and Personalization Platform that is reinventing how companies activate, engage, retain and monetize their customers. Auxia’s software delivers real-time personalizatio...Show moreLast updated: 30+ days ago
    • Promoted
    Al Engineer

    Al Engineer

    Aura Recruitment SolutionsMumbai, IN
    Pay starts from 1,00,000 INR per Month.As an AI Engineer, you will design, develop, and deploy AI-driven solutions You’ll collaborate across engineering, product, and data teams to build intelligen...Show moreLast updated: 2 days ago
    • Promoted
    AI Agents Developer

    AI Agents Developer

    BeGigThane, IN
    Agentic AI Engineer – Real Estate & Construction.We are seeking a talented engineer to.You will own the full lifecycle of agent frameworks, including toolchain architecture, prompt engineering, dat...Show moreLast updated: 2 days ago
    • Promoted
    Responsible AI

    Responsible AI

    EXLThane, IN
    We are seeking a highly skilled and principled Responsible AI Evaluator to assess, audit, and ensure the ethical development and deployment of AI models across the enterprise.This role spans tradit...Show moreLast updated: 30+ days ago