AI Safety Specialist
We are seeking an experienced AI safety specialist to join our team. As an AI safety specialist, you will be responsible for rigorously testing and evaluating AI-generated content to identify vulnerabilities and assess risks.
Key Responsibilities :
Conduct Red Teaming Exercises : Conduct adversarial testing of large language models (LLMs) to identify harmful or unsafe outputs.Evaluate AI Prompts : Evaluate and stress-test AI prompts across multiple domains to uncover potential failure modes.Develop Test Cases : Develop and apply test cases to assess accuracy, bias, toxicity, hallucinations, and misuse potential in AI-generated responses.Collaborate with Teams : Collaborate with data scientists, safety researchers, and prompt engineers to report risks and suggest mitigations.Perform QA and Content Validation : Perform manual quality assurance and content validation across model versions to ensure factual consistency, coherence, and guideline adherence.Create Evaluation Frameworks : Create evaluation frameworks and scoring rubrics for prompt performance and safety compliance.Document Findings : Document findings, edge cases, and vulnerability reports with high clarity and structure.What We Offer :
A Dynamic Work Environment : Join a dynamic team of professionals working at the forefront of AI safety.Professional Growth Opportunities : Opportunities to develop your skills and expertise in AI safety and related fields.A Competitive Salary : A competitive salary and benefits package.