Talent.com
AI Agent Evaluation Analyst

AI Agent Evaluation Analyst

ConfidentialHyderabad / Secunderabad, Telangana, India
5 days ago
Job description

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.

At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.

What We Do

The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

Who we're looking for :

We're looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil's advocate.

Are you comfortable with ambiguity and complexity Does an async, remote, flexible opportunity sound exciting Would you like to learn how modern AI systems are tested and evaluated

This is a flexible, project-based opportunity well-suited for :

  • Analysts, researchers, or consultants with strong critical thinking skills
  • Students (senior undergrads / grad students) looking for an intellectually interesting gig
  • People open to a part-time and non-permanent opportunity

About the project :

We're on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you'll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.

You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you've ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.

What you'll be doing :

  • Reviewing evaluation tasks and scenarios for logic, completeness, and realism
  • Identifying inconsistencies, missing assumptions, or unclear decision points
  • Helping define clear expected behaviors (gold standards) for AI agents
  • Annotating cause-effect relationships, reasoning paths, and plausible alternatives
  • Thinking through complex systems and policies as a human would to ensure agents are tested properly
  • Working closely with QA, writers, or developers to suggest refinements or edge case coverage
  • How to get started :

    Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.

    Requirements

  • Excellent analytical thinking : Can reason about complex systems, scenarios, and logical implications
  • Strong attention to detail : Can spot contradictions, ambiguities, and vague requirements
  • Familiarity with structured data formats : Can read, not necessarily write JSON / YAML
  • Ability to assess scenarios holistically : What's missing, what's unrealistic, what might break
  • Good communication and clear writing (in English) to document your findings.
  • We also value applicants who have :

  • Experience with policy evaluation, logic puzzles, case studies, or structured scenario design
  • Background in consulting, academia, olympiads (e.g. logic / math / informatics), or research
  • Exposure to LLMs, prompt engineering, or AI-generated content
  • Familiarity with QA or test-case thinking (edge cases, failure modes, 'what could go wrong')
  • Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.)
  • Benefits

  • Get paid for your expertise, with rates that can go up to $15 / hour depending on your skills, experience, and project needs
  • Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments
  • Participate in an advanced AI project and gain valuable experience to enhance your portfolio
  • Influence how future AI models understand and communicate in your field of expertise
  • Skills Required

    Analytical Thinking, Qa, case studies , Attention To Detail

    Create a job alert for this search

    Ai Analyst • Hyderabad / Secunderabad, Telangana, India

    Related jobs
    • Promoted
    AI Exploration Engineer

    AI Exploration Engineer

    Mitchell Martin Inc.Hyderabad, IN
    Design and execute machine learning experiments to evaluate emerging AI technologies and frameworks.Prototype and assess end-to-end AI solutions to inform product and platform strategy.Formulate hy...Show moreLast updated: 30+ days ago
    • Promoted
    Generative AI Engineer

    Generative AI Engineer

    LTIMindtreeHyderabad, Telangana, India
    Develop and deploy Generative AI models for enterprise-grade applications.Integrate LLMs with external tools, APIs, and vector databases for enhanced agent capabilities. Architect and implement agen...Show moreLast updated: 22 days ago
    • Promoted
    Search Engine Optimization Analyst

    Search Engine Optimization Analyst

    HighRadiusHyderabad, Telangana, India
    SEO Analyst About the Role We are seeking a dedicated SEO Analyst to join our Global Central Digital Marketing Team.In this role, you will primarily drive organic traffic, and content marketing e...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Agentic AI Engineer - Copilot Studio / Power Virtual Agents

    Senior Agentic AI Engineer - Copilot Studio / Power Virtual Agents

    BradsolHyderabad
    Are you passionate about the future of autonomous agents, AI orchestration, and Microsofts Copilot ecosystem? Join BRADSOL as a Senior Agentic AI Engineer, where youll lead the...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Ai Evaluation Engineer

    Principal Ai Evaluation Engineer

    BackbaseHyderabad, Republic Of India, IN
    Principal AI Evaluation Engineer.You will own the evaluation strategy for AI assistants and agentic workflows, ensuring they are reliable, observable, and safeguarded with strong guardrails.Beyond ...Show moreLast updated: 1 day ago
    • Promoted
    Agentic Ai Developer

    Agentic Ai Developer

    InterScripts, Inc.Hyderabad, Republic Of India, IN
    We are seeking an experienced Agentic AI Developer to design and deploy advanced AI systems that reason, plan, and act autonomously. The ideal candidate will have hands-on experience integrating lea...Show moreLast updated: 22 days ago
    • Promoted
    Agentic AI Engineer

    Agentic AI Engineer

    Intellectt Inchyderabad, telangana, in
    We are seeking an experienced Agentic AI Engineer to design and implement intelligent systems leveraging autonomous agents, LLMs, and advanced Python frameworks. The ideal candidate will have hands-...Show moreLast updated: 1 day ago
    • Promoted
    Agentic AI Developer

    Agentic AI Developer

    InterScripts, Inc.Hyderabad, Telangana, India
    We are seeking an experienced Agentic AI Developer to design and deploy advanced AI systems that reason, plan, and act autonomously. The ideal candidate will have hands-on experience integrating lea...Show moreLast updated: 22 days ago
    • Promoted
    AI Conversation Analyst

    AI Conversation Analyst

    Joveohyderabad, telangana, in
    At Joveo, we're on a mission to transform the job search landscape with the power of AI, automation, and human insight.Our cutting-edge platform enables global organizations to attract, engage, and...Show moreLast updated: 1 day ago
    • Promoted
    Ai Data Quality Analyst

    Ai Data Quality Analyst

    EqvistaHyderabad, Republic Of India, IN
    Eqvista is an integrated SaaS system that helps companies to manage private company equity by minimizing costs by automation, accounting, sharing and compliance tools built into the system.We also ...Show moreLast updated: 13 days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Chargebeehyderabad, telangana, in
    Chargebee’s GBT AI team builds internal AI agents and workflows that power smarter, faster operations across Finance, HR, Legal , Marketing, RevOps and GTM. We’re looking for an AI Engineer who can ...Show moreLast updated: 21 days ago
    • Promoted
    Research Analyst - 45426

    Research Analyst - 45426

    TuringHyderabad, Telangana, India
    Join us as an Research Analyst and help shape the future of large language models (like GPT).You’ll work on fascinating analytical questions, research real-world scenarios, and create structured co...Show moreLast updated: 30+ days ago
    AI Agent Evaluation Analyst

    AI Agent Evaluation Analyst

    MindriftHyderabad, TS, IN
    Remote
    Quick Apply
    This opportunity is only for candidates currently residing in the specified country.Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of En...Show moreLast updated: 24 days ago
    • Promoted
    Agentic Ai Engineer

    Agentic Ai Engineer

    Intellectt IncHyderabad, Republic Of India, IN
    We are seeking an experienced Agentic AI Engineer to design and implement intelligent systems leveraging autonomous agents, LLMs, and advanced Python frameworks. The ideal candidate will have hands-...Show moreLast updated: 1 day ago
    • Promoted
    Search Engine Evaluator

    Search Engine Evaluator

    ConfidentialNashik, Hyderabad / Secunderabad, Telangana
    We are looking for freshers / entry-level candidates to join our team as Search Engine Evaluators.This role involves evaluating search engine results to ensure their relevance and quality, making it ...Show moreLast updated: 30+ days ago
    • Promoted
    Applied AI Engineer

    Applied AI Engineer

    Strategic Talent PartnerHyderabad, IN
    Design and deploy advanced multi-agent pipelines for credit analysis.Optimize inference and prompt chains using frameworks like DSPy, GEPA, and LangChain. Implement reasoning techniques (CoT, ToT, G...Show moreLast updated: 1 day ago
    • Promoted
    Principal AI Evaluation Engineer

    Principal AI Evaluation Engineer

    Backbasehyderabad, telangana, in
    Principal AI Evaluation Engineer.You will own the evaluation strategy for AI assistants and agentic workflows, ensuring they are reliable, observable, and safeguarded with strong guardrails.Beyond ...Show moreLast updated: 1 day ago
    • Promoted
    AI Data Quality Analyst

    AI Data Quality Analyst

    Eqvistahyderabad, telangana, in
    Eqvista is an integrated SaaS system that helps companies to manage private company equity by minimizing costs by automation, accounting, sharing and compliance tools built into the system.We also ...Show moreLast updated: 13 days ago