Talent.com
Evaluation Scenario Writer - AI Agent Testing Specialist
Evaluation Scenario Writer - AI Agent Testing SpecialistMindrift • Pune, MH, IN
Evaluation Scenario Writer - AI Agent Testing Specialist

Evaluation Scenario Writer - AI Agent Testing Specialist

Mindrift • Pune, MH, IN
24 days ago
Job type
  • Remote
  • Quick Apply
Job description

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English.

At Mindrift , innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.

What we do

The Mindrift platform, launched and powered by Toloka , connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

About the Role

We’re looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate human-performed tasks and define gold-standard behavior to compare agent actions against. You’ll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse. You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.

Although every project is unique, you might typically :

  • Designing structured test scenarios based on real-world tasks.
  • Defining the golden path and acceptable agent behavior.
  • Annotating task steps, expected outputs, and edge cases.
  • Working with devs to test your scenarios and improve clarity.
  • Reviewing agent outputs and adapting tests accordingly

How to get started

Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.

Requirements

  • Bachelor's and / or Master’s Degreein Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields.
  • Background in QA, software testing, data analysis, or NLP annotation.
  • Good understanding of test design principles (e.g., reproducibility, coverage, edge cases).
  • Strong written communication skills in English.
  • Comfortable with structured formats like JSON / YAML for scenario description.
  • Can define expected agent behaviors (gold paths) and scoring logic.
  • Basic experience with Python and JS.
  • Curious and open to working with AI-generated content, agent logs, and prompt-based behavior.
  • You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.
  • Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.
  • Nice to Have

  • Experience in writing manual or automated test cases.
  • Familiarity with LLM capabilities and typical failure modes.
  • Understanding of scoring metrics (precision, recall, coverage, reward functions).
  • Benefits

    Contribute on your own schedule, from anywhere in the world. This opportunity allows you to :

  • Get paid for your expertise, with  rates that can go up to $17 / hour  depending on your skills, experience, and project needs.
  • Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.
  • Participate in an advanced AI project and gain valuable experience to enhance your portfolio.
  • Influence how future AI models understand and communicate in your field of expertise.
  • Create a job alert for this search

    Testing • Pune, MH, IN

    Related jobs
    AI Data Trainer

    AI Data Trainer

    Innodata Inc. • Pune, IN
    AI and Machine Learning talent network.Data Annotators and Content Moderators (Review & Labeling).If you enjoy working with data, pay close attention to detail, and want to contribute to real-world...Show more
    Last updated: 15 days ago • Promoted
    Search Engine Optimization Specialist

    Search Engine Optimization Specialist

    Abacus.AI • Pune, IN
    We’re looking for a skilled SEO Specialist to grow and optimize our organic traffic.You’ll be responsible for developing and executing SEO strategies that improve search rankings, drive qualified t...Show more
    Last updated: 8 days ago • Promoted
    Responsible AI

    Responsible AI

    EXL • Pune, IN
    We are seeking a highly skilled and principled Responsible AI Evaluator to assess, audit, and ensure the ethical development and deployment of AI models across the enterprise.This role spans tradit...Show more
    Last updated: 30+ days ago • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    BeGig • Pune, IN
    Agentic AI Engineer – Real Estate & Construction.We are seeking a talented engineer to.You will own the full lifecycle of agent frameworks, including toolchain architecture, prompt engineering, dat...Show more
    Last updated: 23 days ago • Promoted
    AI Platform Engineer

    AI Platform Engineer

    BayOne Solutions • Pune, IN
    We are seeking a highly skilled.In this role, you will work on advanced AI systems including.Retrieval-Augmented Generation (RAG). Model Context Protocol (MCP) tools.OpenWebUI or custom-built soluti...Show more
    Last updated: 16 days ago • Promoted
    Research Analyst - 45426

    Research Analyst - 45426

    Turing • Pune, IN
    Join us as an Research Analyst and help shape the future of large language models (like GPT).You’ll work on fascinating analytical questions, research real-world scenarios, and create structured co...Show more
    Last updated: 30+ days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Philodesign Technologies Inc • Pune, IN
    Gen AI Engineer | Remote | 4+ Years Experience | Budget : 1 LPM.We are looking for an experienced.GenAI solutions for global clients. If you have a solid background in AI / ML engineering and have deli...Show more
    Last updated: 19 days ago • Promoted
    Senior AI Engineer

    Senior AI Engineer

    Xtnsion.AI • Pune, IN
    AI is building the agentic CX layer for modern businesses — AI voice + chat agents that autonomously handle bookings, lead follow-up, support workflows, CRM actions, and more across phone, WhatsApp...Show more
    Last updated: 4 hours ago • Promoted • New!
    AI Engineer

    AI Engineer

    Aura Recruitment Solutions • Pune, IN
    Pay starts from 150,000 INR per Month.We’re hiring on behalf of our client, a fast-growing, AI-first company building cutting-edge AI-native applications that transform complex, real-world data int...Show more
    Last updated: 19 days ago • Promoted
    AI Engineer

    AI Engineer

    Asite • Pune, IN
    We start with a simple idea : the built environment should be smarter, safer and more sustainable.Everything we do is about helping the people behind major construction and infrastructure projects w...Show more
    Last updated: 7 days ago • Promoted
    AI Engineer Production AI Systems

    AI Engineer Production AI Systems

    Wednesday Solutions • Pune, Maharashtra, India
    Wednesday is a global engineering consultancy.We partner with ambitious companies to build AI-native products, modernize data platforms, and accelerate software development.We work at the early and...Show more
    Last updated: 30+ days ago • Promoted
    GEN AI with Vertex AI

    GEN AI with Vertex AI

    Capgemini • Pune, Maharashtra, India
    Proven experience delivering AI solutions on.Professional ML Engineer certification.If interested, please apply on below link : . GenAI with vertex AI-virtual interview with Capgemini all locations, ...Show more
    Last updated: 5 days ago • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    Intellectt Inc • Pune, IN
    Agentic AI Engineer (100% Remote).Intellectt is seeking a highly experienced.The ideal candidate will have hands-on expertise in. LLMs, LangChain, LangGraph, RAG.AI applications for real-world use.O...Show more
    Last updated: 19 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Live Connections • Pune, IN
    Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field; Master’s or PhD preferred. ML / Data Science with a focus on generative AI, LLMs, or co...Show more
    Last updated: 17 days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Turing • pune, maharashtra, in
    Turing is looking for people with LLM experience to join us in solving business problems for our Fortune 500 customers.You will be a key member of the Turing GenAI delivery organization and part of...Show more
    Last updated: 21 days ago • Promoted
    AI Agent Architect

    AI Agent Architect

    Luxoft • Pune, IN
    We are seeking a hands-on and qualified AI Agent Architect to design and deploy advanced Agentic AI systems—comprising task-specific autonomous tools governed by a master agent—to support complex t...Show more
    Last updated: 19 days ago • Promoted
    AI Engineer

    AI Engineer

    NyxaLabs • Pune, IN
    We're seeking an exceptional AI Engineer with deep expertise in TensorFlow model training to design and build next-generation AI systems. This role focuses on developing sophisticated machine learni...Show more
    Last updated: 3 days ago • Promoted
    Agentic & AI Tech Ops Engineer

    Agentic & AI Tech Ops Engineer

    Insight Global • Pune, IN
    Agentic & AI Tech Ops Engineer.Agentic & AI Tech Ops Engineer.AI and Agentic AI systems in production.You will manage deployments, monitor performance, troubleshoot issues, and implement best pract...Show more
    Last updated: 2 days ago • Promoted