Remote Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI) - AI Trainer ($25-$30 per hour)Mercor • India

Remote Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI) - AI Trainer ($25-$30 per hour)

Mercor • India

11 hours ago

Job type

Remote

Job description

Mercor is hiring a Technical Reviewer on behalf of a leading AI lab to evaluate and refine benchmarking pipelines for reinforcement learning (RL) environments and agentic AI systems. In this role, you’ll be responsible for

reviewing environment design, terminal conditions, and evaluation protocols
to ensure accuracy, reproducibility, and fairness in benchmarking. You’ll work closely with researchers and engineers to provide technical feedback that strengthens experimental rigor and system reliability.
###
You’re a great fit if you :
Have a background in
reinforcement learning, computer science, or applied AI research
. - Are experienced with
RL environments
. - Understand
benchmarking methodologies, terminal conditions, and evaluation metrics
for RL tasks. - Are comfortable reading and reviewing codebases in
Python
(PyTorch / TensorFlow a plus). - Have strong critical thinking skills and can provide
structured technical feedback
. - Care deeply about
experimental reproducibility, fairness, and standardization
in agentic AI. - Are detail-oriented and capable of reviewing both
theoretical formulations and implementation details
###
Primary Goal of This Role
To review, validate, and improve reinforcement learning environment benchmarking pipelines, ensuring that terminal conditions, evaluation metrics, and system behaviors are robust, reproducible, and aligned with agentic AI research goals.
###
What You’ll Do
Review RL environments and
evaluate terminal conditions
for correctness and consistency. - Assess
benchmarking pipelines
for fairness, reproducibility, and alignment with research objectives. - Provide
structured technical feedback
on code implementations and documentation. - Collaborate with researchers to refine
evaluation metrics and methodologies
. - Ensure reproducibility by validating results across different
runs, seeds, and hardware setups
. - Document findings and recommend improvements for
environment design and benchmarking standards
###
Why This Role Is Exciting
You’ll directly influence the
reliability of benchmarking in agentic AI research
. - You’ll work on
cutting-edge RL environments
that test the limits of intelligent agents. - You’ll help establish
standards for evaluation and reproducibility
in a fast-moving field. - You’ll collaborate with researchers shaping the
future of agentic AI systems
###
Pay & Work Structure
You’ll be classified as a
full-time hourly contractor
to Mercor. - Paid weekly via Stripe Connect, based on hours logged. - 40 hours / week commitment with flexible scheduling. - Remote and flexible working style.

Create a job alert for this search

Environment Terminal • India

Related jobs

Stem Rater

Aceolution • India, India

As a STEM Rater for [Maths / Physics / Chemistry / Biology / Coding / Finance], you will be a crucial contributor to shaping the "brain" of our AI. Working closely with the STEM Lead and your team, your prima...Show more

Last updated: 30+ days ago • Promoted

Artificial Intelligence Engineer

ACL Digital • India, India

We are Hiring : AI Engineer : Remote Opportunity.Design, develop and deploy scalable.Machine Learning and AI models.Perform data extraction, cleaning, transformation and modeling using.Develop end-to...Show more

Last updated: 10 days ago • Promoted

AI / ML Engineer (Remote) - Contractual (3 months)

DataOrbit AI • India, India

Remote

We are looking for a Machine Learning Engineer to build efficient, data-driven artificial intelligence systems that advance our predictive automation capabilities. The candidate should be highly ski...Show more

Last updated: 6 days ago • Promoted

Vermilion Reporting Suite

Vista Applied Solutions Group Inc • India, India

Hiring | Vermilion Reporting Suite | Long Term Contract | Remote.Role : Vermilion Reporting Suite.Designer, Publisher, Workflow modules. Layout design and template automation.Data models, mappings, a...Show more

Last updated: 2 hours ago • Promoted • New!

Stem Rater - Gen AI

Aceolution India • India, India

Last updated: 17 days ago • Promoted

AI Data Trainer

Innodata Inc. • India, India

AI and Machine Learning talent network.Data Annotators and Content Moderators (Review & Labeling).If you enjoy working with data, pay close attention to detail, and want to contribute to real-world...Show more

Last updated: 19 days ago • Promoted

AI / ML Engineer – LLM & Agentic AI Systems (3 to 9 yrs)

AIMLEAP • India, India

AI / ML Engineer – LLM & Agentic AI Systems.Tech in Computer Science, AI / ML, or related field.LLM and agentic AI development. AI pipelines, APIs, and integrations.LangChain, LlamaIndex, AutoGen.AI sys...Show more

Last updated: 12 hours ago • Promoted • New!

Generative AI Engineer

Avensys Consulting UK • India, India

Rate : 450-500 GBP Per Day – Inside IR35 MAX.The Gen AI Engineer will be a specialized type of artificial intelligence professional, focused on designing, developing & implementing generative AI mod...Show more

Last updated: 1 day ago • Promoted

Remote NLP Engineer - AI Trainer ($10.5-$10.5 per hour)

Mercor • India

Remote

Mercor is hiring an NLP Engineer on behalf of a leading AI lab.In this role, you’ll build language pipelines for classification, retrieval-augmented generation (RAG), and tokenization.You’ll design...Show more

Last updated: 11 hours ago • Promoted • New!

Remote Parallel Computing Engineer - AI Trainer ($10.5-$10.5 per hour)

Mercor • India

Remote

Mercor is hiring a Parallel Computing Engineer on behalf of a leading AI lab.In this role, you’ll • •accelerate numeric and simulation kernels • • through GPU / CPU parallelism, memory-hierarchy tuning,...Show more

Last updated: 11 hours ago • Promoted • New!

Remote Bioinformatics Researcher - AI Trainer ($10.5-$10.5 per hour)

Mercor • India

Remote

Mercor is hiring a Bioinformatics Researcher on behalf of a leading AI lab.In this role, you’ll • •build reproducible pipelines for genomics and proteomics data • •, applying machine learning and stat...Show more

Last updated: 11 hours ago • Promoted • New!

Responsible AI

EXL • India, India

We are seeking a highly skilled and principled Responsible AI Evaluator to assess, audit, and ensure the ethical development and deployment of AI models across the enterprise.This role spans tradit...Show more

Last updated: 30+ days ago • Promoted

AI Agent Architect

Luxoft • India, India

We are seeking a hands-on and qualified AI Agent Architect to design and deploy advanced Agentic AI systems—comprising task-specific autonomous tools governed by a master agent—to support complex t...Show more

Last updated: 23 days ago • Promoted

Remote Machine Learning Engineer - India - AI Trainer ($14-$14 per hour)

Mercor • India

Remote

Mercor is hiring a Machine Learning Engineer • • to help design, train, and deploy large-scale learning systems powering autonomous AI agents for its AI lab partner. This role is ideal for engineers p...Show more

Last updated: 11 hours ago • Promoted • New!

Agentic & AI Tech Ops Engineer

Insight Global • India, India

Agentic & AI Tech Ops Engineer.Agentic & AI Tech Ops Engineer.AI and Agentic AI systems in production.You will manage deployments, monitor performance, troubleshoot issues, and implement best pract...Show more

Last updated: 6 days ago • Promoted

Analyst

Innodata Inc. • India, India

Innodata is collaborating with a leading international conglomerate, to contract subject matter experts (SMEs) for a complex prompt data annotation project. SMEs will create complex prompts and resp...Show more

Last updated: 29 days ago • Promoted

Artificial Intelligence Engineer

INSPYR Solutions • India, India

Hybrid / Remote / Onsite — Add as needed].LLMs, RAG systems, AI agents, and autonomous workflows.You will play a key role in designing architecture, building scalable pipelines, and enabling production...Show more

Last updated: 12 days ago • Promoted

Research Engineer – Generative AI (LLMs)

Abacus.AI • India, India

Research Engineer – Generative AI (LLMs).AI is a leading Generative AI company building a future where AI assists and automates most work and business processes for enterprises and professionals.We...Show more

Last updated: 1 day ago • Promoted