-
Notifications
You must be signed in to change notification settings - Fork 218
Add WildGuard Guardrail Microservice #710
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add WildGuard Guardrail Microservice #710
Conversation
Signed-off-by: Daniel Deleon <[email protected]>
Signed-off-by: Daniel Deleon <[email protected]>
Signed-off-by: Daniel Deleon <[email protected]>
Signed-off-by: Daniel Deleon <[email protected]>
for more information, see https://pre-commit.ci
|
Hi @daniel-de-leon-user293 , please add the test script for WildGuard in |
|
Hi @daniel-de-leon-user293 , |
Signed-off-by: Daniel Deleon <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: Daniel Deleon <[email protected]>
Signed-off-by: Daniel Deleon <[email protected]>
ashahba
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @daniel-de-leon-user293 for this PR!
I only have a few minor change requests.
tests/guardrails/test_guardrails_wild_guard_langchain_on_intel_hpu.sh
Outdated
Show resolved
Hide resolved
Signed-off-by: Daniel Deleon <[email protected]>
ashahba
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
|
@daniel-de-leon-user293 Consider adding documentation or commit message explaining why someone would use wildguard instead of llama guard. @mkbhanda does OPEA have criteria for when it will accept similar features? |
|
Thank you for your suggestion @dcmiddle. The latest commit adds a bit more description of wild guard in the README. To answer your question, we’re hoping to provide a diverse set of safety models that users can pick from. |
cool. so can this be used in conjunction with llama guard? From the description you added it looks like a complementary list of topics. |
|
No, it can be used in place of Llama Guard. The lists in the README are risk taxonomies that each model was trained to identify according to their respective datasets. Although similar, the models provide different classification performance for difference use cases. If a user wanted to design an ensemble of guardrails, however, then in that case, these two models could be used in conjunction. |
I'd assert it's essential to have a variety of options AND you need to know what you are looking for if you are building LLM apps where security+privacy+bias detection > ease of use |
* add wildguard microservice Signed-off-by: Daniel Deleon <[email protected]> * fix paths for wildguard dir Signed-off-by: Daniel Deleon <[email protected]> * add README Signed-off-by: Daniel Deleon <[email protected]> * add wildguard to guardrail README table Signed-off-by: Daniel Deleon <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updates per code review Signed-off-by: Daniel Deleon <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update tokens Signed-off-by: Daniel Deleon <[email protected]> * fix endpoint for testing Signed-off-by: Daniel Deleon <[email protected]> * fixed formating Signed-off-by: Daniel Deleon <[email protected]> * add more description of wildguard to distinguish from llamaguard Signed-off-by: Daniel Deleon <[email protected]> --------- Signed-off-by: Daniel Deleon <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Letong Han <[email protected]> Co-authored-by: Abolfazl Shahbazi <[email protected]> Co-authored-by: ZePan110 <[email protected]>
Description
Add WildGuard to guard against privacy, misinformation, harmful language and malicious use on user input prompts and/or output responses generated by LLMs.
Issues
n/a
Type of change
Dependencies
n/a
Tests
guardrails_tgi.pymicroservice script directly without containerdocker runCLIdocker compose