enguard/medium-guard-128m-xx-prompt-jailbreak-binary-in-the-wild Text Classification • Updated about 1 month ago • 8
enguard/medium-guard-128m-xx-general-politeness-binary-intel Text Classification • Updated about 1 month ago • 4
enguard/medium-guard-128m-xx-prompt-toxicity-binary-jigsaw Text Classification • Updated about 1 month ago • 4
enguard/medium-guard-128m-xx-prompt-safety-binary-nvidia-aegis Text Classification • Updated about 1 month ago • 4
enguard/medium-guard-128m-xx-response-safety-binary-nvidia-aegis Text Classification • Updated about 1 month ago • 10
enguard/medium-guard-128m-xx-prompt-safety-binary-polyguard Text Classification • Updated about 1 month ago • 2
enguard/medium-guard-128m-xx-prompt-safety-multilabel-polyguard Text Classification • Updated about 1 month ago • 3
enguard/medium-guard-128m-xx-response-refusal-binary-polyguard Text Classification • Updated about 1 month ago • 4
enguard/medium-guard-128m-xx-response-safety-binary-polyguard Text Classification • Updated about 1 month ago • 6
enguard/medium-guard-128m-xx-response-safety-multilabel-polyguard Text Classification • Updated Nov 3 • 3
enguard/medium-guard-128m-xx-prompt-jailbreak-binary-sok Text Classification • Updated about 1 month ago • 5
enguard/medium-guard-128m-xx-prompt-harmfulness-binary-mix Text Classification • Updated about 1 month ago • 4
enguard/medium-guard-128m-xx-general-politeness-multiclass-intel Text Classification • Updated about 1 month ago • 1
enguard/medium-guard-128m-xx-prompt-harassment-binary-moderation Text Classification • Updated about 1 month ago • 4
enguard/medium-guard-128m-xx-prompt-response-safety-binary-nvidia-aegis Text Classification • Updated about 1 month ago • 2
enguard/medium-guard-128m-xx-general-safety-education-binary-guardset Text Classification • Updated about 1 month ago • 5
enguard/medium-guard-128m-xx-general-safety-hr-binary-guardset Text Classification • Updated about 1 month ago • 6
enguard/medium-guard-128m-xx-general-safety-social-media-binary-guardset Text Classification • Updated about 1 month ago • 6
enguard/medium-guard-128m-xx-prompt-response-safety-binary-guardset Text Classification • Updated about 1 month ago • 3
enguard/medium-guard-128m-xx-prompt-safety-binary-guardset Text Classification • Updated about 1 month ago • 2
enguard/medium-guard-128m-xx-prompt-safety-cyber-binary-guardset Text Classification • Updated about 1 month ago • 1
enguard/medium-guard-128m-xx-prompt-safety-finance-binary-guardset Text Classification • Updated about 1 month ago • 6
enguard/medium-guard-128m-xx-prompt-safety-law-binary-guardset Text Classification • Updated about 1 month ago • 2
enguard/medium-guard-128m-xx-response-safety-binary-guardset Text Classification • Updated about 1 month ago • 2
enguard/medium-guard-128m-xx-response-safety-cyber-binary-guardset Text Classification • Updated about 1 month ago • 7
enguard/medium-guard-128m-xx-response-safety-finance-binary-guardset Text Classification • Updated about 1 month ago • 6
enguard/medium-guard-128m-xx-response-safety-law-binary-guardset Text Classification • Updated about 1 month ago • 8
enguard/medium-guard-128m-xx-prompt-harmfulness-binary-moderation Text Classification • Updated about 1 month ago • 3
enguard/medium-guard-128m-xx-prompt-harmfulness-multilabel-moderation Text Classification • Updated about 1 month ago • 2