Staging to main: Fix bug in MAP and added new notebook programmatic execution #2035

miguelgfierro · 2023-11-07T09:46:14Z

Description

Related Issues

References

Checklist:

I have followed the contribution guidelines and code style for this project.
I have added tests covering my contributions.
I have updated the documentation accordingly.
This PR is being made to staging branch and not to main branch.

Signed-off-by: miguelgfierro <[email protected]>

* Not triggering unit tests on Draft PR Signed-off-by: Jun Ki Min <[email protected]> * Change a PR-triggering file to test Signed-off-by: Jun Ki Min <[email protected]> --------- Signed-off-by: Jun Ki Min <[email protected]>

@wutaomsft

* Announcement LF Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Update email Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Update README.md Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * security Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * license and contribution notice Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * update author link Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Add new code of conduct from LF Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Replacing references GRU4Rec to GRU Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Replacing references GRU4Rec to GRU Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Replacing references GRU4Rec in config files Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Update references Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Delete conda.md Signed-off-by: Jun Ki Min <[email protected]> * refactor map_at_k and map to be the same as Spark's Signed-off-by: Jun Ki Min <[email protected]> * list of test failing to fix Signed-off-by: Jun Ki Min <[email protected]> * Update readme LF feedback @wutaomsft Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Update NEWS.md Co-authored-by: Andreas Argyriou <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Update README.md Co-authored-by: Andreas Argyriou <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Fix test errors, Refactor column check utils to be simpler Signed-off-by: Jun Ki Min <[email protected]> * Rename ranking tests to be _at_k suffixed Signed-off-by: Jun Ki Min <[email protected]> * Change test names in the test group Signed-off-by: Jun Ki Min <[email protected]> * add comment to mocked fn in a test Signed-off-by: Jun Ki Min <[email protected]> * 📝 Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * remove unused input Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * 📝 Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * no need to output the logs twice Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * packages Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * skipping flaky test Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Issue with TF Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Comment out the PR gate affected tests with the upgrade to TF>2.10.1 Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Comment out the nightly builds affected tests with the upgrade to TF>2.10.1 Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * 🐛 Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Comment out the nightly builds affected tests with the upgrade to TF>2.10.1 Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * revert the breaking tests with TF 2.10.1 Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * temporary pin to TF=2.8.4 Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Update security tests Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> * Update expected values to not use fixture Signed-off-by: Jun Ki Min <[email protected]> * list of test failing to fix Signed-off-by: Jun Ki Min <[email protected]> * Fix missing fixture error Signed-off-by: Jun Ki Min <[email protected]> --------- Signed-off-by: miguelgfierro <[email protected]> Signed-off-by: Jun Ki Min <[email protected]> Co-authored-by: miguelgfierro <[email protected]> Co-authored-by: Andreas Argyriou <[email protected]> Co-authored-by: Miguel Fierro <[email protected]>

Signed-off-by: miguelgfierro <[email protected]>

miguelgfierro · 2023-11-07T09:47:28Z

@loomlike I think there are some of the tests in the nightly builds that haven't been updated:

@pytest.mark.notebooks
    @pytest.mark.parametrize(
        "size, expected_values",
        [
            (
                "1m",
                ***
                    "map": 0.060579,
                    "ndcg": 0.299245,
                    "precision": 0.[2701](https://github.com/recommenders-team/recommenders/actions/runs/6749833186/job/18351719718?pr=2035#step:3:2708)16,
                    "recall": 0.104350,
                ***,
            ),
            (
                "10m",
                ***
                    "map": 0.098745,
                    "ndcg": 0.319625,
                    "precision": 0.275756,
                    "recall": 0.154014,
                ***,
            ),
        ],
    )
    def test_sar_single_node_functional(
        notebooks, output_notebook, kernel_name, size, expected_values
    ):
        notebook_path = notebooks["sar_single_node"]
        pm.execute_notebook(
            notebook_path,
            output_notebook,
            kernel_name=kernel_name,
            parameters=dict(TOP_K=10, MOVIELENS_DATA_SIZE=size),
        )
        results = sb.read_notebook(output_notebook).scraps.dataframe.set_index("name")[
            "data"
        ]
    
        for key, value in expected_values.items():
>           assert results[key] == pytest.approx(value, rel=TOL, abs=ABS_TOL)
E           assert 0.1850985029206066 == 0.060579 ± 5.0e-02
E             comparison failed
E             Obtained: 0.1850985029206066
E             Expected: 0.060579 ± 5.0e-02

anargyri · 2023-11-07T13:30:59Z

tests/unit/recommenders/evaluation/test_python_evaluation.py

@@ -194,7 +181,7 @@ def test_python_exp_var(rating_true, rating_pred):
        rating_pred=rating_true,
        col_prediction=DEFAULT_RATING_COL,
    ) == pytest.approx(1.0, TOL)
-    assert exp_var(rating_true, rating_pred) == pytest.approx(-6.4466, TOL)
+    assert exp_var(rating_true, rating_pred) == pytest.approx(-6.4466, 0.01)


Does this mean we now get something between -6.38 or -6.51? It looks a bit strange, since nothing changed with explained variance, doesn't it?

I think this happened when I copied expected outputs values from the fixture over to each tests. This shouldn't be changed. I'll update codes back to what we had. Thank you for the good catch!

anargyri · 2023-11-07T14:00:47Z

tests/unit/recommenders/evaluation/test_spark_evaluation.py

-    evaluator2 = SparkRatingEvaluation(df_true, df_pred)
-    assert evaluator2.exp_var() == target_metrics["exp_var"]
+    evaluator = SparkRatingEvaluation(df_true, df_pred)
+    assert evaluator.exp_var() == pytest.approx(-6.4466, 0.01)


Same question here. So both spark and python evaluations now give slightly different results. Maybe something changed in the input dataframes?

this should use the same TOL for the consistency.

Signed-off-by: Simon Zhao <[email protected]>

…uage Add missing kernelspec language

loomlike · 2023-11-13T17:21:27Z

@miguelgfierro To fix this issue, I need to change map_at_k() in the notebooks to use map() which will cause conflicts with your changes in PR #2031 where you're removing glue parts:

# Record results with papermill for tests
import scrapbook as sb
sb.glue("map", rank_eval.map_at_k())  --> rank_eval.map()
...

So I think it will be good to wait for your PR gets merged into staging first and then fix the issues if that's okay.

miguelgfierro · 2023-11-14T06:41:39Z

So I think it will be good to wait for your PR gets merged into staging first and then fix the issues if that's okay.

Got it, makes sense

Signed-off-by: Simon Zhao <[email protected]>

…xecution_notebook Programmatic execution of notebooks

loomlike · 2023-12-18T22:01:36Z

@SimonYansenZhao Hi Simon, I found an issue at execute_notebook function I wanted you to confirm:

        if (
            "tags" in cell.metadata
            and "parameters" in cell.metadata["tags"]
            and cell.cell_type == "code"
        ):
            cell_source = cell.source
            modified_cell_source = (
                cell_source  # Initialize a variable to hold the modified source
            )
            for param, new_value in parameters.items():
                if (
                    isinstance(new_value, str)
                    and not (new_value.startswith('"') and new_value.endswith('"'))
                    and not (new_value.startswith("'") and new_value.endswith("'"))
                ):
                    # Check if the new value is a string and surround it with quotes if necessary
                    new_value = f'"{new_value}"'
                # # Check if the new value is a string and surround it with quotes if necessary
                # if isinstance(new_value, str):
                #     new_value = f'"{new_value}"'
                # Define a regular expression pattern to match parameter assignments and ignore comments
                pattern = re.compile(
                    rf"(\b{param})\s*=\s*([^#\n]+)(?:#.*$)?",
                    re.MULTILINE
                    # rf"\b{param}\s*=\s*([^\n]+)\b"
                )
                modified_cell_source = pattern.sub(rf"\1 = {new_value}", cell_source)  <-------- Here.

In the code above, modified_cell_source = pattern.sub(rf"\1 = {new_value}", cell_source) this part always replaces a new value with the original cell_source. This means that the next parameter value is updated into the original cell source string again, not the one we applied the previous value to, which is modified_cell_source. And thus modified_cell_source will only contain the last parameter value that was updated.

To test and fix this, I changed the code to apply the pattern to modified_cell_source cumulatively, and added a test case (to do so, I pull that part into a separate function, as I suggested earlier).

If you see those changes make sense or have any comments, please let me know!
The changes are in jumin/fix_nightly branch.

* Revert tests tolerance * Fix notebook parameter parsing * Add notebook utils tests to test groups * Fix notebooks * Fix notebook unit tests * Update evaluation metrics name map. Handle None for exp_var * Fix smoke tests * cleanup * Fix functional test errors * make notebook parameter update function to be private * Fix benchmark notebook bug * fix remaining bugs --------- Signed-off-by: Jun Ki Min <[email protected]>

Signed-off-by: Jun Ki Min <[email protected]>

…t_cell Fix benchmarks last cell to store value, not [value]

loomlike

Tests are all greeeeeeen!

miguelgfierro · 2023-12-23T07:43:56Z

Vamos!!!!

miguelgfierro and others added 20 commits October 30, 2023 22:59

Remove scrapbook and papermill deps

51a2831

Signed-off-by: miguelgfierro <[email protected]>

notebook utils programmatic execution

964bca0

Signed-off-by: miguelgfierro <[email protected]>

Test notebook programmatic

ab63e1c

Signed-off-by: miguelgfierro <[email protected]>

Added test notebook for utils

1ac3023

Signed-off-by: miguelgfierro <[email protected]>

data notebooks

6a03c42

Signed-off-by: miguelgfierro <[email protected]>

Replace papermill and scrapbook for new internal function

e1c2b63

Signed-off-by: miguelgfierro <[email protected]>

Replace papermill and scrapbook for new internal function

70c068e

Signed-off-by: miguelgfierro <[email protected]>

Update new programmatic execution code

54c2278

Signed-off-by: miguelgfierro <[email protected]>

Update new programmatic execution code

055e5f0

Signed-off-by: miguelgfierro <[email protected]>

Update notebooks with new utility

e60008a

Signed-off-by: miguelgfierro <[email protected]>

🐛

1174258

Signed-off-by: miguelgfierro <[email protected]>

Issue with xDeepFM WIP

5a4bd35

Signed-off-by: miguelgfierro <[email protected]>

🐛

ee344e3

Signed-off-by: miguelgfierro <[email protected]>

Not triggering unit tests on Draft PR (#2033)

4ee5262

* Not triggering unit tests on Draft PR Signed-off-by: Jun Ki Min <[email protected]> * Change a PR-triggering file to test Signed-off-by: Jun Ki Min <[email protected]> --------- Signed-off-by: Jun Ki Min <[email protected]>

🐛

922011c

Signed-off-by: miguelgfierro <[email protected]>

Document the tests in programmatic notebook

f7b5fdf

Signed-off-by: miguelgfierro <[email protected]>

📝

c2d9d13

Signed-off-by: miguelgfierro <[email protected]>

WIP

397555c

Signed-off-by: miguelgfierro <[email protected]>

WIP

58cbcef

Signed-off-by: miguelgfierro <[email protected]>

miguelgfierro requested review from gramhagen, anargyri, loomlike, wutaomsft and SimonYansenZhao as code owners November 7, 2023 09:46

anargyri reviewed Nov 7, 2023

View reviewed changes

loomlike self-assigned this Nov 8, 2023

Import missing store_metadata

ddf1b10

Signed-off-by: Simon Zhao <[email protected]>

Add missing kernelspec language

a79bf1c

Signed-off-by: Simon Zhao <[email protected]>

SimonYansenZhao mentioned this pull request Nov 13, 2023

Add missing kernelspec language #2037

Merged

4 tasks

Merge pull request #2037 from recommenders-team/simon/kernelspec_lang…

fb3e0cf

…uage Add missing kernelspec language

SimonYansenZhao and others added 4 commits November 14, 2023 15:12

Correct pattern matching and substitution

bd9573e

Signed-off-by: Simon Zhao <[email protected]>

Merge multiline parameters into one line

8c6aaed

Signed-off-by: Simon Zhao <[email protected]>

Increase timeout

4992cb6

Signed-off-by: Simon Zhao <[email protected]>

Merge pull request #2031 from recommenders-team/miguel/programmatic_e…

b57cec2

…xecution_notebook Programmatic execution of notebooks

This was referenced Dec 18, 2023

[BUG] GPU Nightly builds failing #2041

Closed

Fix nightly test errors #2045

Merged

loomlike added 2 commits December 21, 2023 10:28

Fix benchmarks last cell to store value, not [value]

495d09d

Signed-off-by: Jun Ki Min <[email protected]>

miguelgfierro changed the title ~~Staging to main: Fix bug in MAP~~ Staging to main: Fix bug in MAP and added new notebook programmatic execution Dec 22, 2023

Merge pull request #2046 from recommenders-team/jun/fix_benchmark_las…

5bf18a0

…t_cell Fix benchmarks last cell to store value, not [value]

loomlike approved these changes Dec 22, 2023

View reviewed changes

SimonYansenZhao approved these changes Dec 23, 2023

View reviewed changes

miguelgfierro merged commit 0d9d7c7 into main Dec 23, 2023
45 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Staging to main: Fix bug in MAP and added new notebook programmatic execution #2035

Staging to main: Fix bug in MAP and added new notebook programmatic execution #2035

miguelgfierro commented Nov 7, 2023

miguelgfierro commented Nov 7, 2023

anargyri Nov 7, 2023 •

edited

Loading

loomlike Nov 8, 2023 •

edited

Loading

anargyri Nov 7, 2023

loomlike Nov 8, 2023

loomlike commented Nov 13, 2023

miguelgfierro commented Nov 14, 2023

loomlike commented Dec 18, 2023

loomlike left a comment

miguelgfierro commented Dec 23, 2023

Staging to main: Fix bug in MAP and added new notebook programmatic execution #2035

Staging to main: Fix bug in MAP and added new notebook programmatic execution #2035

Conversation

miguelgfierro commented Nov 7, 2023

Description

Related Issues

References

Checklist:

miguelgfierro commented Nov 7, 2023

anargyri Nov 7, 2023 • edited Loading

Choose a reason for hiding this comment

loomlike Nov 8, 2023 • edited Loading

Choose a reason for hiding this comment

anargyri Nov 7, 2023

Choose a reason for hiding this comment

loomlike Nov 8, 2023

Choose a reason for hiding this comment

loomlike commented Nov 13, 2023

miguelgfierro commented Nov 14, 2023

loomlike commented Dec 18, 2023

loomlike left a comment

Choose a reason for hiding this comment

miguelgfierro commented Dec 23, 2023

anargyri Nov 7, 2023 •

edited

Loading

loomlike Nov 8, 2023 •

edited

Loading