Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI)Mercor • India

No longer accepting applications

Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI)

Mercor • India

5 days ago

Job description

Mercor is hiring a Technical Reviewer on behalf of a leading AI lab to evaluate and refine benchmarking pipelines for reinforcement learning (RL) environments and agentic AI systems. In this role, you’ll be responsible for

reviewing environment design, terminal conditions, and evaluation protocols
to ensure accuracy, reproducibility, and fairness in benchmarking. You’ll work closely with researchers and engineers to provide technical feedback that strengthens experimental rigor and system reliability.
###
You’re a great fit if you :
Have a background in
reinforcement learning, computer science, or applied AI research
. - Are experienced with
RL environments
. - Understand
benchmarking methodologies, terminal conditions, and evaluation metrics
for RL tasks. - Are comfortable reading and reviewing codebases in
Python
(PyTorch / TensorFlow a plus). - Have strong critical thinking skills and can provide
structured technical feedback
. - Care deeply about
experimental reproducibility, fairness, and standardization
in agentic AI. - Are detail-oriented and capable of reviewing both
theoretical formulations and implementation details
###
Primary Goal of This Role
To review, validate, and improve reinforcement learning environment benchmarking pipelines, ensuring that terminal conditions, evaluation metrics, and system behaviors are robust, reproducible, and aligned with agentic AI research goals.
###
What You’ll Do
Review RL environments and
evaluate terminal conditions
for correctness and consistency. - Assess
benchmarking pipelines
for fairness, reproducibility, and alignment with research objectives. - Provide
structured technical feedback
on code implementations and documentation. - Collaborate with researchers to refine
evaluation metrics and methodologies
. - Ensure reproducibility by validating results across different
runs, seeds, and hardware setups
. - Document findings and recommend improvements for
environment design and benchmarking standards
###
Why This Role Is Exciting
You’ll directly influence the
reliability of benchmarking in agentic AI research
. - You’ll work on
cutting-edge RL environments
that test the limits of intelligent agents. - You’ll help establish
standards for evaluation and reproducibility
in a fast-moving field. - You’ll collaborate with researchers shaping the
future of agentic AI systems
###
Pay & Work Structure
You’ll be classified as a
full-time hourly contractor
to Mercor. - Paid weekly via Stripe Connect, based on hours logged. - 40 hours / week commitment with flexible scheduling. - Remote and flexible working style.

Create a job alert for this search

Environment • India

Related jobs

Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI)

Mercor • India

Last updated: 13 hours ago • Promoted • New!

Stem Rater

Aceolution • India, India

As a STEM Rater for [Maths / Physics / Chemistry / Biology / Coding / Finance], you will be a crucial contributor to shaping the "brain" of our AI. Working closely with the STEM Lead and your team, your prima...Show more

Last updated: 30+ days ago • Promoted

Audio Quality Reviewer

Highbrow Technology Inc • India, India

Responsibilities of Audio Reviewers.Listen to the recorded audio sessions between two individuals.Identify mistakes such as unclear speech, background noise, low audio quality, off-topic discussion...Show more

Last updated: 18 hours ago • Promoted • New!

AI / ML Engineer (Remote) - Contractual (3 months)

DataOrbit AI • India, India

Remote

We are looking for a Machine Learning Engineer to build efficient, data-driven artificial intelligence systems that advance our predictive automation capabilities. The candidate should be highly ski...Show more

Last updated: 6 days ago • Promoted

Stem Rater - Gen AI

Aceolution India • India, India

Last updated: 16 days ago • Promoted

Generative AI Engineer

Avensys Consulting UK • India, India

Rate : 450-500 GBP Per Day – Inside IR35 MAX.The Gen AI Engineer will be a specialized type of artificial intelligence professional, focused on designing, developing & implementing generative AI mod...Show more

Last updated: 16 hours ago • Promoted • New!

CTO Co-Founder | Remote | Equity linked | Part-Time

Blitz Consulting & Coaching • India, India

Remote

One of Blitz Divisions is an applied-AI Venture Studio converting domain inefficiencies into.Our portfolio spans 3 pillars -. Skilling, Consulting & Patent-driven products.The 3 Pillars are unified ...Show more

Last updated: 30+ days ago • Promoted

ML Ops

EXL • India, India

Deploy, monitor, and scale ML models on.GCP (Vertex AI, GKE, Cloud Functions).GitHub Actions / Jenkins / cloud-native tools. Containerize and orchestrate workloads with.MLflow, Feast, Prometheus / Gra...Show more

Last updated: 30+ days ago • Promoted

Responsible AI

EXL • India, India

We are seeking a highly skilled and principled Responsible AI Evaluator to assess, audit, and ensure the ethical development and deployment of AI models across the enterprise.This role spans tradit...Show more

Last updated: 30+ days ago • Promoted

AI / ML Engineer – LLM & Agentic AI Systems ( 3 to 9 yrs)

AIMLEAP • India, India

AI / ML Engineer – LLM & Agentic AI Systems.Tech in Computer Science, AI / ML, or related field.LLM and agentic AI development. AI pipelines, APIs, and integrations.LangChain, LlamaIndex, AutoGen.AI sys...Show more

Last updated: 3 days ago • Promoted

AI Agent Architect

Luxoft • India, India

We are seeking a hands-on and qualified AI Agent Architect to design and deploy advanced Agentic AI systems—comprising task-specific autonomous tools governed by a master agent—to support complex t...Show more

Last updated: 23 days ago • Promoted

Agentic & AI Tech Ops Engineer

Insight Global • India, India

Agentic & AI Tech Ops Engineer.Agentic & AI Tech Ops Engineer.AI and Agentic AI systems in production.You will manage deployments, monitor performance, troubleshoot issues, and implement best pract...Show more

Last updated: 6 days ago • Promoted

Principal Technical Engineer - Pharmacovigilance Signal Detection Solutions

Qinecsa Solutions • India, India

Principal Technical Engineer - Pharmacovigilance Signal Detection Solutions.We are seeking a Principal Technical Engineer to design and develop pharmacovigilance signal detection solutions based on...Show more

Last updated: 6 days ago • Promoted

Analyst

Innodata Inc. • India, India

We are looking for passionate, detail-oriented individuals to join us as Prompt / Content Reviewer.If you enjoy working with data, pay close attention to detail, and want to contribute to real-worl...Show more

Last updated: 28 days ago • Promoted

Geospatial Analyst - 51201

Turing • India, India

We’re looking for Geospatial Experts to help advance AI systems through Supervised Fine-Tuning (SFT), Reinforcement Learning with Human Feedback (RLHF), and Evaluation (Evals).In this role, you’ll ...Show more

Last updated: 22 days ago • Promoted

Amazing AI Systems Engineer

Storyline Health • India, India

This position is for those looking to work at the bleeding edge of next-generation AI and healthcare with hands-on experience and ownership. AI Engineers wanted for hazardous journey.Low wages, bitt...Show more

Last updated: 14 hours ago • Promoted • New!

Technical Project Manager

Roro • India, India

Roro is a product innovation studio specializing in rapid product development powered by AI tools.We build AI, IoT, mobile, and web solutions quickly and affordably. Our small team collaborates on p...Show more

Last updated: 30+ days ago • Promoted

Azure Technical Architect ( Full-time at a Fortune 500 tech MNC)

HARP • India, India

Experience in hosting and configuring applications onto the.Azure Platform as a Service (PaaS).Expertise in application containerization using. Azure Kubernetes Service (AKS).Azure Container Instanc...Show more

Last updated: 1 day ago • Promoted