About Turing :
Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways : first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in coding, reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L.
Role Overview :
This role is central to advancing AI agent capabilities beyond current performance benchmarks.
- Analyze example questions and guidelines to determine the core skill being tested (e.g., complex reasoning, multi-source synthesis, nuance detection).
- Create entirely new questions on the same topic and with similar complexity to the example, ensuring the new challenge requires deep resourcefulness and avoids simple recall or pattern matching.
- Develop accurate, and comprehensive "Ground Truth" answer
- for the newly created question. This answer must serve as the gold standard for AI performance.
- Design a detailed Checklist to evaluate the quality of an answer. This checklist must be precise, quantifiable, and outline all necessary components for a "successful" response, including criteria for accuracy, completeness, logical flow, and resource citation / synthesis.
- Obtain and document the responses to the newly created question from leading large language models (e.g.,
- ChatGPT 5 and Claude Sonnet 4.5
- ).
Requirements :
Proven experience in technical writing, content creation, curriculum design, or AI data labeling / review.Exceptional analytical and critical thinking skills with the ability to deconstruct complex problems into core logical components.Mastery of synthesis : demonstrated ability to accurately and concisely combine information from multiple, potentially conflicting, sources.Meticulous attention to detail - for generating both high-quality questions and error-free, comprehensive Ground Truth answers.Deep understanding of Large Language Models (LLMs) and the common failure modes (e.g., hallucination, superficial answers, lack of depth).Ability to strictly adhere to complex guidelines and quality control standards.Perks of Freelancing With Turing :
Competitive compensation based on experience and expertise.Flexible working hours and remote work environment.Opportunity to work on cutting-edge AI projects with leading LLM companies.Potential for contract extension based on performance and project needs.Offer Details :
Commitments Required : Availability of up to 40 hours per week is preferredEngagement type : Contractor assignment / freelancer (no medical / paid leave)Duration of contract : 1 monthThis role will require some overlap with UTC-8 : 00 (2-5 hrs / day) America / Los_AngelesApplication Process :
Once you clear the assessment, you are ready to go!Know amazing talent? Refer them at turing.com / referrals , and earn money from your network.