Talent.com
AI Evals & Test Engineer
AI Evals & Test EngineerBharatGen • Mumbai, Mumbai (district)
AI Evals & Test Engineer

AI Evals & Test Engineer

BharatGen • Mumbai, Mumbai (district)
30+ days ago
Job description

Job Summary :

We are looking for an AI Evaluation & Test Engineer to join our growing team to ensure that our generative AI models and applications are safe, accurate, trustworthy, and deliver an elegant user experience. You will serve as the first customer of our AI systems. This role is ideal for product-minded engineers who obsess over product quality and customer-centricity, and are passionate about shaping the behavior of AI systems in the real world.

Key Responsibilities :

  • Build and maintain AI evaluation pipelines to test, measure, and evaluate the behavior and performance of AI systems.
  • Implement traces, spans, and session tracking for observability and identify error propagation in multi-step pipelines.
  • Define AI quality metrics and KPIs around factuality, faithfulness, toxicity, grounding precision / recall, latency, cost, etc., with clear acceptance bars.
  • Implement evaluation and testing automation to enable end-to-end system and regression testing at scale.
  • Define criteria for and implement release gates in the CI / CD pipeline.
  • Find creative ways to break products.
  • Assist in root cause analysis and troubleshooting of bugs and field issues.
  • Collaborate with cross-functional teammates from product, engineering, linguistics, and customer support to shape human-AI interaction paradigms and ensure that our AI models and applications deliver the desired outcome and user experience.

Minimum Qualifications and Experience :

  • Bachelor’s or Master’s degree in CS / CE / IT / EE / E&TC or related fields with 5+ years of experience in manual and automation testing of software products, with at least 2 years in evaluating and testing AI / ML products.
  • Required Expertise :

  • Strong software testing fundamentals and expertise in writing test plans, executing test cases, and generating detailed reports and dashboards.
  • Strong analytical and debugging skills, and attention to detail.
  • Proficiency in Python, scripting, and software testing automation frameworks and tools such as Pytest, Selenium, Robot Framework, etc.
  • Working knowledge of generative AI models, AI agents, and related concepts such as retrieval augmented generation (RAG), prompt engineering, context engineering, explainability, traceability, observability, guard rails, reasoning, specificity, etc.
  • Sound understanding of the fundamental differences in the approach for testing conventional software versus evaluating generative AI systems.
  • Team player with excellent interpersonal skills and the ability to collaborate effectively with remote and cross-functional team members.
  • Go-getter attitude and ability to flourish in a fast-paced, startup environment.
  • Experience in any of the following would be a big plus -
  • AI evaluation frameworks such as Arize, Braintrust, DeepEval, LangSmith, Ragas
  • AI safety and red teaming experience, e.g., prompt injection, jailbreak, adversarial and stress testing.
  • Different types of AI evaluation methods, e.g, Human-in-the-loop, LLM-as-a-Judge.
  • Create a job alert for this search

    Test Engineer • Mumbai, Mumbai (district)

    Related jobs
    STEM Educator

    STEM Educator

    Podar ORT International School (IB) • Worli, Maharashtra, India
    Podar ORT International School, WORLI-Mumbai.This position is responsible for teaching and guiding students for Robotics, Electronics, IoT, Embedded Systems and programming from std.Providing STEM ...Show more
    Last updated: 27 days ago • Promoted
    Junior IVF Consultant

    Junior IVF Consultant

    Luma fertility • Bandra, Maharashtra, India
    Job Summary The Junior IVF Consultant will support the fertility team in providing high-quality reproductive healthcare services, including patient assessment, treatment planning, IVF procedures, a...Show more
    Last updated: 6 days ago • Promoted
    Project Manager

    Project Manager

    Zoo Media • Worli, Maharashtra, India
    Project Manager – Job Description.We are looking for an experienced.The ideal candidate should be highly organised, proactive, and able to translate briefs into structured execution plans while ens...Show more
    Last updated: 6 days ago • Promoted
    AI Solution Engineer

    AI Solution Engineer

    PristineAI • Mumbai, Mumbai (district)
    AI Solution Engineer — Mumbai & Chennai.Help Build the Future of AI-Enabled Enterprises.With Kai, business users don’t just automate — they. If you believe AI should make work radically.Design and b...Show more
    Last updated: 2 days ago • Promoted
    Administrative Assistant

    Administrative Assistant

    Bottomline Media Pvt Ltd • Bandra, Maharashtra, India
    Do you thrive on bringing order to chaos, love juggling schedules, and get a thrill out of making things just work?Bottomalinemedia is a global integrated media and marketing agency , with 360 solu...Show more
    Last updated: 22 days ago • Promoted
    AI Data Engineer

    AI Data Engineer

    Turing • Mumbai, Mumbai (district)
    We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show more
    Last updated: 6 hours ago • Promoted • New!
    MLOps Engineer

    MLOps Engineer

    Yotta Data Services Private Limited • Mumbai, Mumbai (district)
    We’re looking for a strategic Senior MLOps Engineer to lead the end-to-end design, implementation, and scaling of our AI infrastructure. You’ll partner with researchers, product teams, and DevOps to...Show more
    Last updated: 20 days ago • Promoted
    Software Test Engineer

    Software Test Engineer

    QualityKiosk Technologies • Mumbai, Mumbai (district)
    Manual testing, Back office, Equity & Derivates, MTF.Must have clarify of Testing fundamentals / concepts.Must be able to analyse requirement & derive Test Scenarios as per provided requirements.Mu...Show more
    Last updated: 6 hours ago • Promoted • New!
    Clinical Coordinator

    Clinical Coordinator

    Luma fertility • Bandra, Maharashtra, India
    About Us : Luma Fertility is a next-generation fertility company founded by second-time entrepreneur Neha Motwani and backed by Peak XV Partners. We are redefining fertility care through science, com...Show more
    Last updated: 6 days ago • Promoted
    AI Engineer

    AI Engineer

    Shapoorji Pallonji Finance Private Limited • Mumbai, Mumbai (district)
    We are looking to grow our AI innovation lab team to expand the AI product lines already deployed with large financial institutions. This is an exciting opportunity to work with some of the smartest...Show more
    Last updated: 1 day ago • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Quantiphi • Mumbai, Mumbai (district)
    Role : Senior Machine Learning Engineer.Work location : Bangalore / Mumbai.Experience in AI / ML, with a track record of working in large-scale programs and solving complex use cases using GCP AI Platfor...Show more
    Last updated: 1 day ago • Promoted
    Quality Assurance Engineer

    Quality Assurance Engineer

    iRage • Mumbai, Mumbai (district)
    While prior trading / HFT experience is a plus, the candidate must have a strong foundation in automation testing, APIs, and databases, and be eager to work on high-performance, business-critical sys...Show more
    Last updated: 2 days ago • Promoted
    Senior Backend Engineer

    Senior Backend Engineer

    Babblebots AI • Mumbai, Mumbai (district)
    At Babblebots, our love affair with AI started with VoiceAI.Then we moved to documents, then intelligent conversations and AI based assessments. And the journey is just starting.We are excited to pu...Show more
    Last updated: 30+ days ago • Promoted
    Shopify Developer

    Shopify Developer

    Work Store Limited • Worli, Maharashtra, India
    Analyze business requirements and develop creative Shopify solutions to meet project objectives.Efficiently manage tasks and deliver projects within specified timelines. Demonstrate a deep understan...Show more
    Last updated: 13 days ago • Promoted
    Strategist (AI-Enhanced Insight & Data Modelling)

    Strategist (AI-Enhanced Insight & Data Modelling)

    Conran Design Mumbai • Mumbai, Mumbai (district)
    AI-Enhanced Insight & Data Modelling).Strategist – AI-Enhanced Insight & Data Modelling.This role harnesses the power of. AI, advanced analytics, and predictive modeling.Uses AI-driven intelligence ...Show more
    Last updated: 1 day ago • Promoted
    Coding and Robotics Co-Ordinator

    Coding and Robotics Co-Ordinator

    Amazeheads LLP • Colaba, Maharashtra, India
    Designation- Robotics & Coding Co-Ordinator.Department-Management & Training.Selected candidate’s day-to-day responsibilities include : . To train, manage and correspond with the teachers daily and pr...Show more
    Last updated: 6 days ago • Promoted
    Junior Motion Graphic Designer

    Junior Motion Graphic Designer

    Zoo Media • Worli, Maharashtra, India
    Junior Motion Graphic Designer.The Starter Labs – Part of the Zoo Media Network.We are looking for a creative and detail-oriented. Junior Motion Graphic Designer.The ideal candidate should have a st...Show more
    Last updated: 6 days ago • Promoted
    Test Automation Specialist

    Test Automation Specialist

    LTIMindtree • Mumbai, Mumbai (district)
    LTIMindtree is seeking passionate GenAI Specialists who are eager to go beyond theory and work hands-on with cutting-edge GenAI and LLM testing. Join our GenAI Assurance team and be part of a dynami...Show more
    Last updated: 1 day ago • Promoted