📝 RLFH UX Skeleton #31

teresaqhoang · 2023-07-25T15:22:52Z

Motivation and Context

Added RLFH UI to demonstrate users can easily add this capability to their chatbot if they wanted.

Description

This feature is for demonstration purposes only. We don't actually hook up to the server to store the human feedback or send it to the model.

Details:

RLFH actions will only show on the most recent chat message in which the author is bot.
If user takes action, icon will be rendered to reflect action.
Data is only stored in frontend state. If app refreshes, all RLFH across all chats will be reset.

Actions on chat message:

Once user takes actions:

Future work

Add ability to turn of RLHF in settings dialog + add help link to docs there

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows the Contribution Guidelines and the pre-submission formatting script raises no violations
~~- [ ] All unit tests pass, and I have added new tests where possible~~
I didn't break anyone 😄

dehoward · 2023-07-25T16:54:39Z

if we were to send the feedback to the model, do we know if it would have any impact on the responses? without doing this, I'm not sure that this change aligns with our recently updated first principles given that it doesn't seem like we're currently completing the loop on the AI usage pattern.

webapp/src/components/shared/BundledIcons.tsx

teresaqhoang · 2023-07-25T17:49:25Z

if we were to send the feedback to the model, do we know if it would have any impact on the responses? without doing this, I'm not sure that this change aligns with our recently updated first principles given that it doesn't seem like we're currently completing the loop on the AI usage pattern.

No, SK doesn't hook up to any feedback loop with the models, so this would be up to the developer to develop and expose their own endpoints to train their models.

This was a big push from John to showcase of the first principles of LLMs in general Human in Control, defined here, https://microsoft.sharepoint.com/teams/ExDAI/SitePages/GPT-UX-Guide.aspx which aligns with our principles to showcase experimental usage patterns in Microsoft. Even though we don't complete the loop, it's beneficial to call out this principal of augmentation over automation. Planning to add a help link in later PR with settings that will detail this process more.

dehoward · 2023-07-25T18:55:34Z

No, SK doesn't hook up to any feedback loop with the models, so this would be up to the developer to develop and expose their own endpoints to train their models.

This was a big push from John to showcase of the first principles of LLMs in general Human in Control, defined here, https://microsoft.sharepoint.com/teams/ExDAI/SitePages/GPT-UX-Guide.aspx which aligns with our principles to showcase experimental usage patterns in Microsoft. Even though we don't complete the loop, it's beneficial to call out this principal of augmentation over automation. Planning to add a help link in later PR with settings that will detail this process more.

ok yeah that makes sense. thanks for sharing this, video is very insightful.

Planning to add a help link in later PR with settings that will detail this process more.

I think this will help a lot, especially if we can emphasize some of what you said here in that they will need to train their own models.

webapp/src/components/chat/chat-history/ChatHistoryItem.tsx

craigomatic · 2023-07-25T22:13:42Z

No, SK doesn't hook up to any feedback loop with the models, so this would be up to the developer to develop and expose their own endpoints to train their models.

Is there a simple implementation we can bring in?

If we have buttons on the UI that are not wired up we are going to end up with people wondering why they aren't working.

Simple could be using them as few shot examples in conjunction with the chat message history? Could add a state to each message?

### Motivation and Context  Added RLFH UI to demonstrate users can easily add this capability to their chatbot if they wanted. ### Description  > This feature is for demonstration purposes only. We don't actually hook up to the server to store the human feedback or send it to the model. Details: - RLFH actions will only show on the most recent chat message in which the author is `bot`. - If user takes action, icon will be rendered to reflect action. - Data is only stored in frontend state. If app refreshes, all RLFH across all chats will be reset. Actions on chat message: ![image](https://github.com/microsoft/chat-copilot/assets/125500434/1b730018-ccf3-4356-a173-9c7d50c0cec8) Once user takes actions: ![image](https://github.com/microsoft/chat-copilot/assets/125500434/0d691a1a-970a-4847-98cb-c6f05e23941f) Future work - Add ability to turn of RLHF in settings dialog + add help link to docs there ### Contribution Checklist  - [x] The code builds clean without any errors or warnings - [x] The PR follows the [Contribution Guidelines](https://github.com/microsoft/copilot-chat/blob/main/CONTRIBUTING.md) and the [pre-submission formatting script](https://github.com/microsoft/copilot-chat/blob/main/CONTRIBUTING.md#development-scripts) raises no violations ~~- [ ] All unit tests pass, and I have added new tests where possible~~ - [x] I didn't break anyone 😄

📝 User Feedback Actions + fix intro alert

c7dd7ff

teresaqhoang self-assigned this Jul 25, 2023

teresaqhoang requested review from dehoward and TaoChenOSU July 25, 2023 15:23

teresaqhoang added enhancement New feature or request webapp Pull requests that update Typescript code PR: ready for review labels Jul 25, 2023

glahaye reviewed Jul 25, 2023

View reviewed changes

webapp/src/components/shared/BundledIcons.tsx Show resolved Hide resolved

dehoward reviewed Jul 25, 2023

View reviewed changes

webapp/src/components/chat/chat-history/ChatHistoryItem.tsx Show resolved Hide resolved

dehoward approved these changes Jul 25, 2023

View reviewed changes

TaoChenOSU approved these changes Jul 25, 2023

View reviewed changes

dehoward added PR: ready to merge PR has been approved by all reviewers, and is ready to merge. and removed PR: ready for review labels Jul 25, 2023

alliscode added this pull request to the merge queue Jul 25, 2023

Merged via the queue into microsoft:main with commit 46f48cc Jul 25, 2023

teresaqhoang deleted the rlfh-skeleton branch August 11, 2023 15:25

teresaqhoang linked an issue Aug 24, 2023 that may be closed by this pull request

Complete enhancement for chat copilot for customer demos #187

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

📝 RLFH UX Skeleton #31

📝 RLFH UX Skeleton #31

teresaqhoang commented Jul 25, 2023

dehoward commented Jul 25, 2023

teresaqhoang commented Jul 25, 2023 •

edited

Loading

dehoward commented Jul 25, 2023

craigomatic commented Jul 25, 2023 •

edited

Loading

📝 RLFH UX Skeleton #31

📝 RLFH UX Skeleton #31

Conversation

teresaqhoang commented Jul 25, 2023

Motivation and Context

Description

Contribution Checklist

dehoward commented Jul 25, 2023

teresaqhoang commented Jul 25, 2023 • edited Loading

dehoward commented Jul 25, 2023

craigomatic commented Jul 25, 2023 • edited Loading

teresaqhoang commented Jul 25, 2023 •

edited

Loading

craigomatic commented Jul 25, 2023 •

edited

Loading