PF Recording/Replaying part for sdk_cli #719

crazygao · 2023-10-11T07:15:57Z

Description

Please add an informative description that covers that changes made by the pull request and link all relevant issues.

Recording functionalities

PF_RECORDING_MODE is the key env var to control this test feature.
Record mode:
- Get node run info (currently llm node), and save the info in the following key value pair
- Key: Ordered dict of all inputs => sha1 hash value
- Value: base64 of output value.
Replay mode:
- hijack all llm nodes with customized tool, it calculate the hash of inputs, and get outputs.

Generated recording files will save in test_config/node_recordings folder

Some tricks

for youtube urls add the following will enforce the language to German &hl=de&persist_hl=1
Some output from url is not stable, mark these tests with Instable tests.

Time estimation
166 Tests run in 3min51s, total 226 tests estimate 5min18s
Most of the time is from fetching URLs.

About techniques of copying method and inject:
Currently I didn't get good ways to reuse the original method and make some patches to the method. All these mocked functions are copied and changed separately.

All Promptflow Contribution checklist:

The pull request does not introduce [breaking changes].
CHANGELOG is updated for new features, bug fixes or other significant changes.
I have read the contribution guidelines.
Create an issue and link to the pull request to get dedicated review from promptflow team. Learn more: suggested workflow.

General Guidelines and Best Practices

Title of the pull request is clear and informative.
There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

Pull request includes test coverage for the included changes.

github-actions · 2023-10-11T07:19:37Z

SDK CLI Global Config Test Result yigao/recording_draft

2 tests 2 ✔️ 1m 3s ⏱️
1 suites 0 💤
1 files 0 ❌

Results for commit d3b4c4a.

♻️ This comment has been updated with latest results.

github-actions · 2023-10-11T07:20:41Z

Executor Unit Test Result yigao/recording_draft

413 tests 413 ✔️ 48s ⏱️
    1 suites     0 💤
    1 files     0 ❌

Results for commit d3b4c4a.

♻️ This comment has been updated with latest results.

github-actions · 2023-10-11T07:21:11Z

SDK PFS E2E Test Result yigao/recording_draft

8 tests 8 ✔️ 57s ⏱️
1 suites 0 💤
1 files 0 ❌

Results for commit d3b4c4a.

♻️ This comment has been updated with latest results.

github-actions · 2023-10-11T07:23:17Z

Executor E2E Test Result yigao/recording_draft

124 tests 122 ✔️ 3m 32s ⏱️
    1 suites     2 💤
    1 files     0 ❌

Results for commit d3b4c4a.

♻️ This comment has been updated with latest results.

github-actions · 2023-10-11T07:31:02Z

SDK CLI Test Result yigao/recording_draft

320 tests 308 ✔️ 23m 16s ⏱️
    1 suites   11 💤
    1 files     1 ❌

For more details on these failures, see this check.

Results for commit d3b4c4a.

♻️ This comment has been updated with latest results.

wangchao1230 · 2023-10-30T06:38:22Z

src/promptflow/tests/sdk_cli_test/recording_utilities/mocked_functions.py

+
+def mock_bulkresult_get_openai_metrics(self):
+    # Some tests request the metrics in replay mode.
+    total_metrics = {"total_tokens": 0, "duration": 0}


why is this return a constant ?

some of the tests are testing against these constants.

example:

since our tool doesn't have a real llm connection, we need to mock these returns.

src/promptflow/tests/sdk_cli_test/recording_utilities/tool_record.py

src/promptflow/tests/sdk_cli_test/e2etests/test_cli.py

src/promptflow/tests/sdk_cli_test/recording_utilities/tool_record.py

src/promptflow/tests/test_configs/flows/web_classification/webClassification20.csv

wangchao1230 · 2023-10-30T09:15:32Z

src/promptflow/tests/sdk_cli_test/recording_utilities/mocked_functions.py

+    result = {}
+    for n in connection_names:
+        try:
+            conn = client.connections.get(name=n, with_secrets=True)


why do we need to mock this?

Good Point. After replaying, this is useless.

wangchao1230 · 2023-10-30T09:16:07Z

src/promptflow/tests/sdk_cli_test/recording_utilities/mocked_functions.py

+    return total_metrics
+
+
+def mock_toolresolver_resolve_tool_by_node(recording_file: Path):


add docstirng for why we need these fucntions

crazygao added 2 commits October 9, 2023 03:27

TEMP PASS

523c02e

Current Fix

aa959ee

crazygao requested a review from a team as a code owner October 11, 2023 07:15

crazygao temporarily deployed to internal October 11, 2023 07:16 — with GitHub Actions Inactive

github-actions bot added sdk prompt flow SDK promptflow labels Oct 11, 2023

Fix

37229ad

crazygao temporarily deployed to internal October 13, 2023 03:10 — with GitHub Actions Inactive

Remove ununsed items

9973fa0

crazygao temporarily deployed to internal October 13, 2023 06:19 — with GitHub Actions Inactive

crazygao added 2 commits October 13, 2023 10:20

Fix

d311941

Merge branch 'main' into yigao/recording_draft

e52fc0e

crazygao temporarily deployed to internal October 13, 2023 11:05 — with GitHub Actions Inactive

wangchao1230 reviewed Oct 30, 2023

View reviewed changes

src/promptflow/tests/sdk_cli_test/recording_utilities/tool_record.py Show resolved Hide resolved

wangchao1230 reviewed Oct 30, 2023

View reviewed changes

src/promptflow/tests/sdk_cli_test/e2etests/test_cli.py Outdated Show resolved Hide resolved

wangchao1230 reviewed Oct 30, 2023

View reviewed changes

src/promptflow/tests/sdk_cli_test/recording_utilities/tool_record.py Outdated Show resolved Hide resolved

wangchao1230 reviewed Oct 30, 2023

View reviewed changes

src/promptflow/tests/sdk_cli_test/recording_utilities/tool_record.py Outdated Show resolved Hide resolved

wangchao1230 reviewed Oct 30, 2023

View reviewed changes

src/promptflow/tests/sdk_cli_test/recording_utilities/tool_record.py Outdated Show resolved Hide resolved

wangchao1230 reviewed Oct 30, 2023

View reviewed changes

src/promptflow/tests/test_configs/flows/web_classification/webClassification20.csv Show resolved Hide resolved

Fix Comments

4ff7d8d

crazygao dismissed zhengfeiwang’s stale review via 4ff7d8d October 30, 2023 08:26

crazygao temporarily deployed to internal October 30, 2023 08:26 — with GitHub Actions Inactive

Fix nit

3a36c7e

crazygao temporarily deployed to internal October 30, 2023 08:31 — with GitHub Actions Inactive

Merge branch 'main' into yigao/recording_draft

d3b4c4a

crazygao temporarily deployed to internal October 30, 2023 08:33 — with GitHub Actions Inactive

wangchao1230 reviewed Oct 30, 2023

View reviewed changes

wangchao1230 closed this Nov 13, 2023

crazygao deleted the yigao/recording_draft branch May 14, 2024 09:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PF Recording/Replaying part for sdk_cli #719

PF Recording/Replaying part for sdk_cli #719

crazygao commented Oct 11, 2023 •

edited

Loading

github-actions bot commented Oct 11, 2023 •

edited

Loading

github-actions bot commented Oct 11, 2023 •

edited

Loading

github-actions bot commented Oct 11, 2023 •

edited

Loading

github-actions bot commented Oct 11, 2023 •

edited

Loading

github-actions bot commented Oct 11, 2023 •

edited

Loading

wangchao1230 Oct 30, 2023

crazygao Oct 30, 2023

crazygao Oct 30, 2023

wangchao1230 Oct 30, 2023

crazygao Oct 30, 2023

wangchao1230 Oct 30, 2023

		return total_metrics


		def mock_toolresolver_resolve_tool_by_node(recording_file: Path):

PF Recording/Replaying part for sdk_cli #719

PF Recording/Replaying part for sdk_cli #719

Conversation

crazygao commented Oct 11, 2023 • edited Loading

Description

All Promptflow Contribution checklist:

General Guidelines and Best Practices

Testing Guidelines

github-actions bot commented Oct 11, 2023 • edited Loading

SDK CLI Global Config Test Result yigao/recording_draft

github-actions bot commented Oct 11, 2023 • edited Loading

Executor Unit Test Result yigao/recording_draft

github-actions bot commented Oct 11, 2023 • edited Loading

SDK PFS E2E Test Result yigao/recording_draft

github-actions bot commented Oct 11, 2023 • edited Loading

Executor E2E Test Result yigao/recording_draft

github-actions bot commented Oct 11, 2023 • edited Loading

SDK CLI Test Result yigao/recording_draft

wangchao1230 Oct 30, 2023

Choose a reason for hiding this comment

crazygao Oct 30, 2023

Choose a reason for hiding this comment

crazygao Oct 30, 2023

Choose a reason for hiding this comment

wangchao1230 Oct 30, 2023

Choose a reason for hiding this comment

crazygao Oct 30, 2023

Choose a reason for hiding this comment

wangchao1230 Oct 30, 2023

Choose a reason for hiding this comment

crazygao commented Oct 11, 2023 •

edited

Loading

github-actions bot commented Oct 11, 2023 •

edited

Loading

github-actions bot commented Oct 11, 2023 •

edited

Loading

github-actions bot commented Oct 11, 2023 •

edited

Loading

github-actions bot commented Oct 11, 2023 •

edited

Loading

github-actions bot commented Oct 11, 2023 •

edited

Loading