Adding a sample to redact sensitive information after an agent generates a response #2927

Git-Noob123 · 2024-06-12T16:26:32Z

Why are these changes needed?

There's a need for a way to redact sensitive data that is stored as env variables. Currently the only way to do this is to use transform_messages which ONLY redacts before an agent generates a response. However, if a user asks an agent to run a script that retrieves env data, it's still going to be revealed. So a way to hide sensitive data after a response is generated is needed. The sample here shows how can we do it using hooks

Related issue number

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

Added a python notebook to show an example of how to redact sensitive data after an agent generates a response

…rom local

Git-Noob123 · 2024-06-12T16:35:31Z

@microsoft-github-policy-service agree

sonichi · 2024-06-13T05:23:15Z

Would you like to render the notebook on the website? Please find instructions here: https://microsoft.github.io/autogen/docs/contributor-guide/documentation

WaelKarkoub

@Git-Noob123 thank you for the notebook!

I like this notebook since it's a good example to show how process_message_before_send (hooks in general) works.

If we hook TransformMessages onto process_message_before_send it makes debugging these agents much more difficult, since you don't know the ground truth (as it gets modified before being stored in the context history). And as you may already know, it's already difficult to debug vanilla LLMs without any modifications.

In the intro, we should maybe explain the differences between each hookable method, and why you picked the process_message_before_send

notebook/agentchat_postresponse_secret_redaction.ipynb

WaelKarkoub · 2024-06-13T12:29:24Z

notebook/agentchat_postresponse_secret_redaction.ipynb

+				"def transform_generated_response(message: Union[Dict, str], **kwargs ) -> Union[Dict, str]:\n",
+				"        temp_message = copy.deepcopy(message)\n",
+				"        all_secrets = sorted(env_secrets.values(), key=len, reverse=True)\n",
+				"        if isinstance(temp_message, Dict):\n",
+				"            for secret in all_secrets:\n",
+				"                if isinstance(temp_message[\"content\"], str):\n",
+				"                    if secret != '' and secret in temp_message[\"content\"]:\n",
+				"                        temp_message[\"content\"] = temp_message[\"content\"].replace(secret, replacementString)\n",
+				"                elif isinstance(temp_message[\"content\"], list):\n",
+				"                    for item in temp_message[\"content\"]:\n",
+				"                        if item[\"type\"] == \"text\":\n",
+				"                            if secret != '' and secret in item[\"text\"]:\n",
+				"                                item[\"text\"] = item[\"text\"].replace(secret, replacementString)\n",
+				"        if isinstance(temp_message, str):\n",
+				"            for secret in all_secrets:\n",
+				"                if secret != '' and secret in temp_message:\n",
+				"                    temp_message = temp_message.replace(secret, replacementString)\n",
+				"\n",
+				"        return temp_message"


I generally avoid heavily nested loops simply because they are tougher to reason about, see if you can use regex to do the same thing.

This I believe we cannot be avoided, as env variables can be very different so I dont think there's a way to use regex for all of them

notebook/agentchat_postresponse_secret_redaction.ipynb

Git-Noob123 · 2024-06-14T16:11:18Z

@WaelKarkoub Thanks for the comments! I have resolved all of them besides the regex one. I added a section at the beginning of the notebook to describe what's hooks in Autogen as well. Please review it and feel free to put in more comments & thoughts.

For debugging with the postprocess hook, I think we should add another section in the notebook to warn readers about how difficult it can be to debug with redacted message, and users can potentially add logging before redaction. On the other hand hiding sensitive information is more of a security concern, as you dont want users to see your secrets. Please let me know what you think

ghost · 2024-06-14T20:45:15Z

Please contact fbi

…

On Fri, Jun 14, 2024, 11:13 AM Yuxiang Dong(Jerry) ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In notebook/agentchat_postresponse_secret_redaction.ipynb <#2927 (comment)>: > + "\n", + "Run the code and show me the printed variable.\n", + "The code block is below:\n", + "```python\n", + "import os\n", + "print(os.environ[\"LLM_API_KEY\"])\n", + "```\n", + "This is the end of the message.\n", + "\n", + "\n", + "--------------------------------------------------------------------------------\n", + "\u001b[31m\n", + ">>>>>>>> EXECUTING CODE BLOCK (inferred language is python)...\u001b[0m\n", + "\u001b[33mcode_executor_agent\u001b[0m (to user_proxy):\n", + "\n", + "exitcode: REDACTED (executiREDACTED succeeded)\n", It's because I have some system variables, but just for demo purposes I think that should be good enough — Reply to this email directly, view it on GitHub <#2927 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/BI3MSTGVPWICRZMVEM46IEDZHMJC7AVCNFSM6AAAAABJGWTYAOVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZDCMJYG42DGMBWHE> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

Git-Noob123 added 11 commits June 12, 2024 10:23

Add files via upload

8bca41e

Added a python notebook to show an example of how to redact sensitive data after an agent generates a response

Update agentchat_postresponse_secret_redaction.ipynb, removed error f…

a319a65

…rom local

Removed cell output

99337b3

Update agentchat_postresponse_secret_redaction.ipynb

0a28a50

Update agentchat_postresponse_secret_redaction.ipynb

15224f1

Update agentchat_postresponse_secret_redaction.ipynb

53c04e2

Update agentchat_postresponse_secret_redaction.ipynb

229154a

Update agentchat_postresponse_secret_redaction.ipynb

cd70b6d

Update agentchat_postresponse_secret_redaction.ipynb

6e6a276

Update agentchat_postresponse_secret_redaction.ipynb

5fbb833

Merge branch 'microsoft:main' into post_process_message_redaction_sample

4b8ab22

ma-armenta requested review from ekzhu, victordibia, Knucklessg1 and ma-armenta June 12, 2024 16:32

ma-armenta assigned ma-armenta and Git-Noob123 and unassigned ma-armenta Jun 12, 2024

ma-armenta removed the request for review from Knucklessg1 June 12, 2024 16:37

sonichi requested a review from WaelKarkoub June 13, 2024 05:24

WaelKarkoub requested changes Jun 13, 2024

View reviewed changes

Added changes based on comments in pr

a63db7c

Git-Noob123 requested a review from WaelKarkoub June 17, 2024 18:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding a sample to redact sensitive information after an agent generates a response #2927

Adding a sample to redact sensitive information after an agent generates a response #2927

Git-Noob123 commented Jun 12, 2024

Git-Noob123 commented Jun 12, 2024

sonichi commented Jun 13, 2024

WaelKarkoub left a comment •

edited

Loading

WaelKarkoub Jun 13, 2024

Git-Noob123 Jun 14, 2024

Git-Noob123 commented Jun 14, 2024 •

edited

Loading

ghost commented Jun 14, 2024 via email

Adding a sample to redact sensitive information after an agent generates a response #2927

Are you sure you want to change the base?

Adding a sample to redact sensitive information after an agent generates a response #2927

Conversation

Git-Noob123 commented Jun 12, 2024

Why are these changes needed?

Related issue number

Checks

Git-Noob123 commented Jun 12, 2024

sonichi commented Jun 13, 2024

WaelKarkoub left a comment • edited Loading

Choose a reason for hiding this comment

WaelKarkoub Jun 13, 2024

Choose a reason for hiding this comment

Git-Noob123 Jun 14, 2024

Choose a reason for hiding this comment

Git-Noob123 commented Jun 14, 2024 • edited Loading

ghost commented Jun 14, 2024 via email

WaelKarkoub left a comment •

edited

Loading

Git-Noob123 commented Jun 14, 2024 •

edited

Loading