Frontier AI is accelerating research and driving innovation. We seek accomplished IT and Computer Science graduates to advance Large Language Model evaluation in computing and technical reasoning.
Key responsibilities include :
- Designing structured tasks to test AI performance in algorithms, software, and IT systems
- Developing evaluation questions in programming, algorithms, networking, databases, and cybersecurity
- Creating structured coding tasks and datasets with clear answers
- Evaluating AI solutions for accuracy, efficiency, and rigor
- Documenting failures and proposing expert solutions
Required Skills and Qualifications :
IT and Computer Science graduatesExperience with LLM evaluation in computing and technical reasoningStrong understanding of algorithms, software, and IT systemsAbility to design and develop structured tasks and evaluation questionsExcellent communication and documentation skillsBenefits :
Opportunity to work with cutting-edge AI technologyCollaborative and dynamic work environmentProfessional development and growth opportunities