Talent.com
Remote Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI) - AI Trainer ($25-$30 per hour)
Remote Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI) - AI Trainer ($25-$30 per hour)Mercor • India
Remote Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI) - AI Trainer ($25-$30 per hour)

Remote Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI) - AI Trainer ($25-$30 per hour)

Mercor • India
11 hours ago
Job type
  • Remote
Job description

Mercor is hiring a Technical Reviewer on behalf of a leading AI lab to evaluate and refine benchmarking pipelines for reinforcement learning (RL) environments and agentic AI systems. In this role, you’ll be responsible for

  • reviewing environment design, terminal conditions, and evaluation protocols
  • to ensure accuracy, reproducibility, and fairness in benchmarking. You’ll work closely with researchers and engineers to provide technical feedback that strengthens experimental rigor and system reliability.
  • ###
  • You’re a great fit if you :
  • Have a background in
  • reinforcement learning, computer science, or applied AI research
  • . - Are experienced with
  • RL environments
  • . - Understand
  • benchmarking methodologies, terminal conditions, and evaluation metrics
  • for RL tasks. - Are comfortable reading and reviewing codebases in
  • Python
  • (PyTorch / TensorFlow a plus). - Have strong critical thinking skills and can provide
  • structured technical feedback
  • . - Care deeply about
  • experimental reproducibility, fairness, and standardization
  • in agentic AI. - Are detail-oriented and capable of reviewing both
  • theoretical formulations and implementation details
  • ###
  • Primary Goal of This Role
  • To review, validate, and improve reinforcement learning environment benchmarking pipelines, ensuring that terminal conditions, evaluation metrics, and system behaviors are robust, reproducible, and aligned with agentic AI research goals.
  • ###
  • What You’ll Do
  • Review RL environments and
  • evaluate terminal conditions
  • for correctness and consistency. - Assess
  • benchmarking pipelines
  • for fairness, reproducibility, and alignment with research objectives. - Provide
  • structured technical feedback
  • on code implementations and documentation. - Collaborate with researchers to refine
  • evaluation metrics and methodologies
  • . - Ensure reproducibility by validating results across different
  • runs, seeds, and hardware setups
  • . - Document findings and recommend improvements for
  • environment design and benchmarking standards
  • ###
  • Why This Role Is Exciting
  • You’ll directly influence the
  • reliability of benchmarking in agentic AI research
  • . - You’ll work on
  • cutting-edge RL environments
  • that test the limits of intelligent agents. - You’ll help establish
  • standards for evaluation and reproducibility
  • in a fast-moving field. - You’ll collaborate with researchers shaping the
  • future of agentic AI systems
  • ###
  • Pay & Work Structure
  • You’ll be classified as a
  • full-time hourly contractor
  • to Mercor. - Paid weekly via Stripe Connect, based on hours logged. - 40 hours / week commitment with flexible scheduling. - Remote and flexible working style.
Create a job alert for this search

Environment Terminal • India

Related jobs
Stem Rater

Stem Rater

Aceolution • India, India
As a STEM Rater for [Maths / Physics / Chemistry / Biology / Coding / Finance], you will be a crucial contributor to shaping the "brain" of our AI. Working closely with the STEM Lead and your team, your prima...Show more
Last updated: 30+ days ago • Promoted
Artificial Intelligence Engineer

Artificial Intelligence Engineer

ACL Digital • India, India
We are Hiring : AI Engineer : Remote Opportunity.Design, develop and deploy scalable.Machine Learning and AI models.Perform data extraction, cleaning, transformation and modeling using.Develop end-to...Show more
Last updated: 10 days ago • Promoted
AI / ML Engineer (Remote) - Contractual (3 months)

AI / ML Engineer (Remote) - Contractual (3 months)

DataOrbit AI • India, India
Remote
We are looking for a Machine Learning Engineer to build efficient, data-driven artificial intelligence systems that advance our predictive automation capabilities. The candidate should be highly ski...Show more
Last updated: 6 days ago • Promoted
Vermilion Reporting Suite

Vermilion Reporting Suite

Vista Applied Solutions Group Inc • India, India
Hiring | Vermilion Reporting Suite | Long Term Contract | Remote.Role : Vermilion Reporting Suite.Designer, Publisher, Workflow modules. Layout design and template automation.Data models, mappings, a...Show more
Last updated: 2 hours ago • Promoted • New!
Stem Rater - Gen AI

Stem Rater - Gen AI

Aceolution India • India, India
As a STEM Rater for [Maths / Physics / Chemistry / Biology / Coding / Finance], you will be a crucial contributor to shaping the "brain" of our AI. Working closely with the STEM Lead and your team, your prima...Show more
Last updated: 17 days ago • Promoted
AI Data Trainer

AI Data Trainer

Innodata Inc. • India, India
AI and Machine Learning talent network.Data Annotators and Content Moderators (Review & Labeling).If you enjoy working with data, pay close attention to detail, and want to contribute to real-world...Show more
Last updated: 19 days ago • Promoted
AI / ML Engineer – LLM & Agentic AI Systems (3 to 9 yrs)

AI / ML Engineer – LLM & Agentic AI Systems (3 to 9 yrs)

AIMLEAP • India, India
AI / ML Engineer – LLM & Agentic AI Systems.Tech in Computer Science, AI / ML, or related field.LLM and agentic AI development. AI pipelines, APIs, and integrations.LangChain, LlamaIndex, AutoGen.AI sys...Show more
Last updated: 12 hours ago • Promoted • New!
Generative AI Engineer

Generative AI Engineer

Avensys Consulting UK • India, India
Rate : 450-500 GBP Per Day – Inside IR35 MAX.The Gen AI Engineer will be a specialized type of artificial intelligence professional, focused on designing, developing & implementing generative AI mod...Show more
Last updated: 1 day ago • Promoted
Remote NLP Engineer - AI Trainer ($10.5-$10.5 per hour)

Remote NLP Engineer - AI Trainer ($10.5-$10.5 per hour)

Mercor • India
Remote
Mercor is hiring an NLP Engineer on behalf of a leading AI lab.In this role, you’ll build language pipelines for classification, retrieval-augmented generation (RAG), and tokenization.You’ll design...Show more
Last updated: 11 hours ago • Promoted • New!
Remote Parallel Computing Engineer - AI Trainer ($10.5-$10.5 per hour)

Remote Parallel Computing Engineer - AI Trainer ($10.5-$10.5 per hour)

Mercor • India
Remote
Mercor is hiring a Parallel Computing Engineer on behalf of a leading AI lab.In this role, you’ll • •accelerate numeric and simulation kernels • • through GPU / CPU parallelism, memory-hierarchy tuning,...Show more
Last updated: 11 hours ago • Promoted • New!
Remote Bioinformatics Researcher - AI Trainer ($10.5-$10.5 per hour)

Remote Bioinformatics Researcher - AI Trainer ($10.5-$10.5 per hour)

Mercor • India
Remote
Mercor is hiring a Bioinformatics Researcher on behalf of a leading AI lab.In this role, you’ll • •build reproducible pipelines for genomics and proteomics data • •, applying machine learning and stat...Show more
Last updated: 11 hours ago • Promoted • New!
Responsible AI

Responsible AI

EXL • India, India
We are seeking a highly skilled and principled Responsible AI Evaluator to assess, audit, and ensure the ethical development and deployment of AI models across the enterprise.This role spans tradit...Show more
Last updated: 30+ days ago • Promoted
AI Agent Architect

AI Agent Architect

Luxoft • India, India
We are seeking a hands-on and qualified AI Agent Architect to design and deploy advanced Agentic AI systems—comprising task-specific autonomous tools governed by a master agent—to support complex t...Show more
Last updated: 23 days ago • Promoted
Remote Machine Learning Engineer - India - AI Trainer ($14-$14 per hour)

Remote Machine Learning Engineer - India - AI Trainer ($14-$14 per hour)

Mercor • India
Remote
Mercor is hiring a Machine Learning Engineer • • to help design, train, and deploy large-scale learning systems powering autonomous AI agents for its AI lab partner. This role is ideal for engineers p...Show more
Last updated: 11 hours ago • Promoted • New!
Agentic & AI Tech Ops Engineer

Agentic & AI Tech Ops Engineer

Insight Global • India, India
Agentic & AI Tech Ops Engineer.Agentic & AI Tech Ops Engineer.AI and Agentic AI systems in production.You will manage deployments, monitor performance, troubleshoot issues, and implement best pract...Show more
Last updated: 6 days ago • Promoted
Analyst

Analyst

Innodata Inc. • India, India
Innodata is collaborating with a leading international conglomerate, to contract subject matter experts (SMEs) for a complex prompt data annotation project. SMEs will create complex prompts and resp...Show more
Last updated: 29 days ago • Promoted
Artificial Intelligence Engineer

Artificial Intelligence Engineer

INSPYR Solutions • India, India
Hybrid / Remote / Onsite — Add as needed].LLMs, RAG systems, AI agents, and autonomous workflows.You will play a key role in designing architecture, building scalable pipelines, and enabling production...Show more
Last updated: 12 days ago • Promoted
Research Engineer – Generative AI (LLMs)

Research Engineer – Generative AI (LLMs)

Abacus.AI • India, India
Research Engineer – Generative AI (LLMs).AI is a leading Generative AI company building a future where AI assists and automates most work and business processes for enterprises and professionals.We...Show more
Last updated: 1 day ago • Promoted