Build an automated red teaming framework for evaluating AI model safety, including jailbreak testing, bias detection, and harmful output assessment.
Specify the target model and evaluation focus. Start with the automated jailbreak tests, then expand to bias and privacy evaluations. Review all results manually before sharing reports.
Initial release
Sign in and download this prompt to leave a review.