Add script to test nightly environments are solvable and using recent nightlies. #690

bdice · 2023-11-18T00:17:01Z

This PR adds tests that run on every integration PR and on a nightly basis to ensure that RAPIDS conda environments can be solved with recent packages. This will help us diagnose and react to problems arising from conda environment conflicts, which sometimes force conda to select older nightly builds of RAPIDS packages.

A partial solution for https://github.com/rapidsai/ops/issues/2947.

Next steps:

Wait for Fix build strings in nx-cugraph cugraph#4639 to merge.
After this PR is merged, merge Add integration test workflow. workflows#59 as well.

… nightlies.

jameslamb

I totally support this!

Think it'll be helpful for catching more cases similar to rapidsai/build-planning#14 and rapidsai/build-planning#69 where projects might otherwise silently be using different packages than we want.

jameslamb · 2024-08-05T16:18:26Z

ci/test_conda_nightly_env.sh

+    -c rapidsai-nightly \
+    -c conda-forge \
+    -c nvidia  \
+    rapids=${RAPIDS_VERSION} \


(was going to put this as my review, realized a thread would probably be better)

I totally support adding something like this, based on your description offline:

... to ensure that we are able to solve the full RAPIDS conda environment with recent packages (in other words, ensure there are no recent conflicts causing fallback to older conda packages)

But I don't think it offers 100% protection against the case described in https://github.com/rapidsai/ops/issues/2947.

That issue is not about just packages building against too-old versions of dependencies... it's about packages across RAPIDS building against very different versions of dependencies.

It looks to me that the code in this PR would catch cases like these:

"rmm nightlies haven't been published in the last 3 days"

"the latest versions of cugraph and pylibraft can't be installed in the same environment"

These aren't captured by the existing nightly tests at https://github.com/rapidsai/workflows/actions/workflows/nightly-pipeline.yaml.

But this wouldn't be guaranteed to catch a case like this:

"the latest cuml nightly built against an rmm from 5 days ago, but the latest cudf nightly built against an rmm from yesterday"

Because this test with the rapids package is solving across all of the packages' runtime dependencies, but they could have ended up building against older versions based on conflicts in their individual build environments, right?

And those types of conflicts might not show up here if we use pin_compatible(max_pin="x.x") in run dependencies, e.g. pylibraft==24.10.* is going to have a runtime dependency on rmm=24.10.* regardless of which specific nightly of rmm it pulled in at build time. (pylibraft nightly files)

I think detecting that other case would have to happen at build time (or by post-processing of logs from build time). And I don't know how complex that would be, so can't say with confidence that the complexity would be worth it.

I totally support the approach this PR is pursuing, just wanted to be sure to note this other possible avenue for version mismatches to get through.

You are correct about what this will and will not cover. I think it's worth pursuing because (1) it prevents runtime problems from being hidden and (2) sometimes runtime conflicts will also affect build environments, so this may give us a bit of signal into deeper problems happening at build time, should they arise.

Ok great! I totally support moving forward with this.

…htly-env

bdice · 2024-08-29T20:59:52Z

Yay! I'm happy with this now. It looks like this:

Packages that were built 1 or 2 days ago (but less than 3) are shown in yellow, as a warning that something might be broken in the nightly builds.

jameslamb

This looks great!

Although I'll note... I wasn't able to see the pretty colored output that you shared a screenshot of.

The 13,000+ lines of logs at
https://github.com/rapidsai/integration/actions/runs/10622584645/job/29447212735?pr=690 was too large for my browser to load. The majority of that was what looks like a dump of every conda package in the environment's metadata, in pretty-printed (one-line-per-key) JSON format

I was able to open the raw logs (link) and command-F and find the output from this job, but that's not the same

If you can think of a way to reduce the amount of output or to get just the color-coded results up into the job summary, I think it'd help.

.github/workflows/pr.yaml

.github/workflows/test.yaml

Companion PR to enable nightly testing for rapidsai/integration#690.

bdice · 2024-09-06T21:46:31Z

We can merge this once CI passes. I'm going to check the CI results manually so I'm avoiding a /merge for the moment.

bdice · 2024-09-07T15:18:58Z

This is failing due to a "real" problem now, I think.

Possibly related to rapidsai/cuspatial#1453 (comment).

bdice · 2024-09-10T20:47:42Z

/merge

This fixes failures in the new testing workflow from #690. **Update:** the root cause was that `rapids-conda-retry` is sending `2>&1`. The warning is being sent to stderr as intended. The old contents are partially incorrect. We can still solve this by providing `--quiet`, without needing to change `rapids-conda-retry`. <details><summary>Old issue contents</summary> Output like this is shown, even with the `--json` flag to conda: ``` ==> WARNING: A newer version of conda exists. <== current version: 24.9.0 latest version: 24.9.1 Please update conda by running $ conda update -n base -c conda-forge conda ``` The only way to make the output "proper JSON" is to pass `--quiet` as well. This seems like unintentional behavior from conda. The docs from `conda create --help` literally say: ``` --json Report all output as json. Suitable for using conda programmatically. ``` </details> Authors: - Bradley Dice (https://github.com/bdice) Approvers: - James Lamb (https://github.com/jameslamb) - Mike Sarahan (https://github.com/msarahan) URL: #729

bdice added 2 commits November 17, 2023 18:15

Add script to test nightly environments are solvable and using recent…

40ded38

… nightlies.

Clean up script.

551169f

jameslamb reviewed Aug 5, 2024

View reviewed changes

Merge remote-tracking branch 'upstream/branch-24.10' into dry-run-nig…

d78a83f

…htly-env

bdice changed the base branch from branch-23.12 to branch-24.10 August 29, 2024 17:09

bdice added 19 commits August 29, 2024 12:38

Add test-conda-nightly-env to pr.yaml for testing.

df6074e

Skip build jobs for testing.

98383f6

Update RAPIDS_VERSION.

d487411

Fix extension.

55da82b

Exclude some packages, and collapse conda dry-run output.

9d3a8b0

Use package name.

b323bfa

Clean up.

c245fc6

Skip codecov.

4af2f57

Exclude pynvjitlink, fail on missing date strings, show package dates.

0f4741f

Pretty-print dates.

d9478b4

Use build workflow to get CPU runners and high matrix coverage.

df76342

Improve formatting.

26e4143

Fix exclusions.

52c5308

Fix warning for day-old packages.

171644a

Fix exclusion logic.

a27c3fc

Skip packages without dates.

2a5c29c

Update colors.

4aeb2f5

Improve formatting.

15c2c9d

Remove nx-cugraph.

9b72086

bdice marked this pull request as ready for review August 29, 2024 21:00

bdice requested a review from a team as a code owner August 29, 2024 21:00

bdice requested a review from msarahan August 29, 2024 21:00

bdice added 2 commits August 29, 2024 16:02

Re-enable PR builds of integration packages.

925dd58

Add test workflow.

ede8cf2

bdice mentioned this pull request Aug 29, 2024

Add integration test workflow. rapidsai/workflows#59

Merged

bdice self-assigned this Aug 29, 2024

jameslamb approved these changes Aug 29, 2024

View reviewed changes

.github/workflows/pr.yaml Outdated Show resolved Hide resolved

.github/workflows/test.yaml Show resolved Hide resolved

vyasr pushed a commit to rapidsai/workflows that referenced this pull request Sep 3, 2024

Add integration test workflow. (#59)

8c5866c

Companion PR to enable nightly testing for rapidsai/integration#690.

Re-enable build job.

f76b29a

rapids-bot bot merged commit 1e59a93 into rapidsai:branch-24.10 Sep 10, 2024
21 checks passed

bdice mentioned this pull request Sep 10, 2024

Track conda-forge migrations with automated tooling rapidsai/build-planning#100

Open

bdice mentioned this pull request Oct 3, 2024

Quiet conda warnings. #729

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add script to test nightly environments are solvable and using recent nightlies. #690

Add script to test nightly environments are solvable and using recent nightlies. #690

bdice commented Nov 18, 2023 •

edited

Loading

jameslamb left a comment

jameslamb Aug 5, 2024

bdice Aug 5, 2024 •

edited

Loading

jameslamb Aug 5, 2024

bdice commented Aug 29, 2024 •

edited

Loading

jameslamb left a comment

bdice commented Sep 6, 2024

bdice commented Sep 7, 2024

bdice commented Sep 10, 2024

Add script to test nightly environments are solvable and using recent nightlies. #690

Add script to test nightly environments are solvable and using recent nightlies. #690

Conversation

bdice commented Nov 18, 2023 • edited Loading

jameslamb left a comment

Choose a reason for hiding this comment

jameslamb Aug 5, 2024

Choose a reason for hiding this comment

bdice Aug 5, 2024 • edited Loading

Choose a reason for hiding this comment

jameslamb Aug 5, 2024

Choose a reason for hiding this comment

bdice commented Aug 29, 2024 • edited Loading

jameslamb left a comment

Choose a reason for hiding this comment

bdice commented Sep 6, 2024

bdice commented Sep 7, 2024

bdice commented Sep 10, 2024

bdice commented Nov 18, 2023 •

edited

Loading

bdice Aug 5, 2024 •

edited

Loading

bdice commented Aug 29, 2024 •

edited

Loading