Job Description
We are seeking accomplished professionals to advance Large Language Model (LLM) evaluation in computing and technical reasoning. The ideal candidate will design structured tasks that test AI performance in algorithms, software, and IT systems.
- Develop evaluation questions in programming, algorithms, networking, databases, and cybersecurity.
- Create structured coding tasks and datasets with clear answers
- Evaluate AI solutions for accuracy, efficiency, and rigor.
- Document failures and propose expert solutions.
Required Skills and Qualifications
The following skills and qualifications are necessary for this role :
Recent graduate in IT, Computer Science, or related field.Strong knowledge of coding and computer systems.Excellent English writing and analytical skillsBenefits
This position offers :
Competitive compensation based on experience and expertise.Flexible working hours and remote work environment.Opportunity to work on cutting-edge AI projects with leading LLM companies.Potential for contract extension based on performance and project needs.Additional Information
Please note the following details about this contract assignment :
Contract Duration : 1 monthMaximum of 30 hours / week is allowedThis contract assignment may require some overlap with UTC-8 : 00 (2-5 hrs / day) America / Los_Angeles. To be confirmed closer to the onboarding date.