Skip to content

Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts.

Notifications You must be signed in to change notification settings

sisl/ASTPrompter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

About

Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts.

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages