Talent.com
Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI)
Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI)Mercor • India
No longer accepting applications
Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI)

Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI)

Mercor • India
5 days ago
Job description

Mercor is hiring a Technical Reviewer on behalf of a leading AI lab to evaluate and refine benchmarking pipelines for reinforcement learning (RL) environments and agentic AI systems. In this role, you’ll be responsible for

  • reviewing environment design, terminal conditions, and evaluation protocols
  • to ensure accuracy, reproducibility, and fairness in benchmarking. You’ll work closely with researchers and engineers to provide technical feedback that strengthens experimental rigor and system reliability.
  • ###
  • You’re a great fit if you :
  • Have a background in
  • reinforcement learning, computer science, or applied AI research
  • . - Are experienced with
  • RL environments
  • . - Understand
  • benchmarking methodologies, terminal conditions, and evaluation metrics
  • for RL tasks. - Are comfortable reading and reviewing codebases in
  • Python
  • (PyTorch / TensorFlow a plus). - Have strong critical thinking skills and can provide
  • structured technical feedback
  • . - Care deeply about
  • experimental reproducibility, fairness, and standardization
  • in agentic AI. - Are detail-oriented and capable of reviewing both
  • theoretical formulations and implementation details
  • ###
  • Primary Goal of This Role
  • To review, validate, and improve reinforcement learning environment benchmarking pipelines, ensuring that terminal conditions, evaluation metrics, and system behaviors are robust, reproducible, and aligned with agentic AI research goals.
  • ###
  • What You’ll Do
  • Review RL environments and
  • evaluate terminal conditions
  • for correctness and consistency. - Assess
  • benchmarking pipelines
  • for fairness, reproducibility, and alignment with research objectives. - Provide
  • structured technical feedback
  • on code implementations and documentation. - Collaborate with researchers to refine
  • evaluation metrics and methodologies
  • . - Ensure reproducibility by validating results across different
  • runs, seeds, and hardware setups
  • . - Document findings and recommend improvements for
  • environment design and benchmarking standards
  • ###
  • Why This Role Is Exciting
  • You’ll directly influence the
  • reliability of benchmarking in agentic AI research
  • . - You’ll work on
  • cutting-edge RL environments
  • that test the limits of intelligent agents. - You’ll help establish
  • standards for evaluation and reproducibility
  • in a fast-moving field. - You’ll collaborate with researchers shaping the
  • future of agentic AI systems
  • ###
  • Pay & Work Structure
  • You’ll be classified as a
  • full-time hourly contractor
  • to Mercor. - Paid weekly via Stripe Connect, based on hours logged. - 40 hours / week commitment with flexible scheduling. - Remote and flexible working style.
Create a job alert for this search

Environment • India

Related jobs
Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI)

Technical Reviewer - RL Environment Terminal Benchmarking (Agentic AI)

Mercor • India
Mercor is hiring a Technical Reviewer on behalf of a leading AI lab to evaluate and refine benchmarking pipelines for reinforcement learning (RL) environments and agentic AI systems.In this role, y...Show more
Last updated: 13 hours ago • Promoted • New!
Stem Rater

Stem Rater

Aceolution • India, India
As a STEM Rater for [Maths / Physics / Chemistry / Biology / Coding / Finance], you will be a crucial contributor to shaping the "brain" of our AI. Working closely with the STEM Lead and your team, your prima...Show more
Last updated: 30+ days ago • Promoted
Audio Quality Reviewer

Audio Quality Reviewer

Highbrow Technology Inc • India, India
Responsibilities of Audio Reviewers.Listen to the recorded audio sessions between two individuals.Identify mistakes such as unclear speech, background noise, low audio quality, off-topic discussion...Show more
Last updated: 18 hours ago • Promoted • New!
AI / ML Engineer (Remote) - Contractual (3 months)

AI / ML Engineer (Remote) - Contractual (3 months)

DataOrbit AI • India, India
Remote
We are looking for a Machine Learning Engineer to build efficient, data-driven artificial intelligence systems that advance our predictive automation capabilities. The candidate should be highly ski...Show more
Last updated: 6 days ago • Promoted
Stem Rater - Gen AI

Stem Rater - Gen AI

Aceolution India • India, India
As a STEM Rater for [Maths / Physics / Chemistry / Biology / Coding / Finance], you will be a crucial contributor to shaping the "brain" of our AI. Working closely with the STEM Lead and your team, your prima...Show more
Last updated: 16 days ago • Promoted
Generative AI Engineer

Generative AI Engineer

Avensys Consulting UK • India, India
Rate : 450-500 GBP Per Day – Inside IR35 MAX.The Gen AI Engineer will be a specialized type of artificial intelligence professional, focused on designing, developing & implementing generative AI mod...Show more
Last updated: 16 hours ago • Promoted • New!
CTO Co-Founder | Remote | Equity linked | Part-Time

CTO Co-Founder | Remote | Equity linked | Part-Time

Blitz Consulting & Coaching • India, India
Remote
One of Blitz Divisions is an applied-AI Venture Studio converting domain inefficiencies into.Our portfolio spans 3 pillars -. Skilling, Consulting & Patent-driven products.The 3 Pillars are unified ...Show more
Last updated: 30+ days ago • Promoted
ML Ops

ML Ops

EXL • India, India
Deploy, monitor, and scale ML models on.GCP (Vertex AI, GKE, Cloud Functions).GitHub Actions / Jenkins / cloud-native tools. Containerize and orchestrate workloads with.MLflow, Feast, Prometheus / Gra...Show more
Last updated: 30+ days ago • Promoted
Responsible AI

Responsible AI

EXL • India, India
We are seeking a highly skilled and principled Responsible AI Evaluator to assess, audit, and ensure the ethical development and deployment of AI models across the enterprise.This role spans tradit...Show more
Last updated: 30+ days ago • Promoted
AI / ML Engineer – LLM & Agentic AI Systems ( 3 to 9 yrs)

AI / ML Engineer – LLM & Agentic AI Systems ( 3 to 9 yrs)

AIMLEAP • India, India
AI / ML Engineer – LLM & Agentic AI Systems.Tech in Computer Science, AI / ML, or related field.LLM and agentic AI development. AI pipelines, APIs, and integrations.LangChain, LlamaIndex, AutoGen.AI sys...Show more
Last updated: 3 days ago • Promoted
AI Agent Architect

AI Agent Architect

Luxoft • India, India
We are seeking a hands-on and qualified AI Agent Architect to design and deploy advanced Agentic AI systems—comprising task-specific autonomous tools governed by a master agent—to support complex t...Show more
Last updated: 23 days ago • Promoted
Agentic & AI Tech Ops Engineer

Agentic & AI Tech Ops Engineer

Insight Global • India, India
Agentic & AI Tech Ops Engineer.Agentic & AI Tech Ops Engineer.AI and Agentic AI systems in production.You will manage deployments, monitor performance, troubleshoot issues, and implement best pract...Show more
Last updated: 6 days ago • Promoted
Principal Technical Engineer - Pharmacovigilance Signal Detection Solutions

Principal Technical Engineer - Pharmacovigilance Signal Detection Solutions

Qinecsa Solutions • India, India
Principal Technical Engineer - Pharmacovigilance Signal Detection Solutions.We are seeking a Principal Technical Engineer to design and develop pharmacovigilance signal detection solutions based on...Show more
Last updated: 6 days ago • Promoted
Analyst

Analyst

Innodata Inc. • India, India
We are looking for passionate, detail-oriented individuals to join us as Prompt / Content Reviewer.If you enjoy working with data, pay close attention to detail, and want to contribute to real-worl...Show more
Last updated: 28 days ago • Promoted
Geospatial Analyst - 51201

Geospatial Analyst - 51201

Turing • India, India
We’re looking for Geospatial Experts to help advance AI systems through Supervised Fine-Tuning (SFT), Reinforcement Learning with Human Feedback (RLHF), and Evaluation (Evals).In this role, you’ll ...Show more
Last updated: 22 days ago • Promoted
Amazing AI Systems Engineer

Amazing AI Systems Engineer

Storyline Health • India, India
This position is for those looking to work at the bleeding edge of next-generation AI and healthcare with hands-on experience and ownership. AI Engineers wanted for hazardous journey.Low wages, bitt...Show more
Last updated: 14 hours ago • Promoted • New!
Technical Project Manager

Technical Project Manager

Roro • India, India
Roro is a product innovation studio specializing in rapid product development powered by AI tools.We build AI, IoT, mobile, and web solutions quickly and affordably. Our small team collaborates on p...Show more
Last updated: 30+ days ago • Promoted
Azure Technical Architect ( Full-time at a Fortune 500 tech MNC)

Azure Technical Architect ( Full-time at a Fortune 500 tech MNC)

HARP • India, India
Experience in hosting and configuring applications onto the.Azure Platform as a Service (PaaS).Expertise in application containerization using. Azure Kubernetes Service (AKS).Azure Container Instanc...Show more
Last updated: 1 day ago • Promoted