Add entry_point plugin support for user defined HELM run_specs #42

dmjoy · 2026-01-21T13:58:55Z

Temporary workaround until this HELM PR lands: stanford-crfm/helm#3916

May consider using similar approach for other MAGNET/HELM plugins.

dmjoy · 2026-01-21T14:26:26Z

@Erotemic I guess we're stuck with failing tests until the boolq scenario is fixed upstream?

Erotemic · 2026-01-21T15:57:42Z

We could:

disable that one test
switch to a different dataset.
use a patched fork of HELM on the CI

dmjoy · 2026-01-22T20:10:51Z

Closing as the upstream PR has landed.

dmjoy · 2026-01-22T20:26:34Z

Re-opening as the entrypoint updates here to plugin our custom run-specs/scenarios are still relevant. Removing the magnet_helm_run.py wrapper script as it's no longer needed.

dmjoy · 2026-01-22T20:29:38Z

@Erotemic seems like CI doesn't like the URL dependency for helm, but I did confirm that our test pass locally. Do you think it makes more sense to pin to a particular commit (vs. the kitware-main branch)? If we're going to make somewhat frequent / potentially breaking changes there maybe pinning to a commit is a good idea?

Erotemic · 2026-01-23T15:46:51Z

I think pinning to a branch makes sense. Frequent changes can be pushed to a branch other than kitware-main, which I want to be: commits that we think probably should go upstream if it was still maintained.

It looks like git urls are not really supported in the project.dependencies section of pyproject.toml as they break pypi deployment. That means making a pypi package like kitware-helm might be the only way to have a truly robust install mechanism where the user doesn't have to think about it.

https://docs.astral.sh/uv/pip/compatibility/?utm_source=chatgpt.com#transitive-url-dependencies

To make CI pass so we can continue to develop I recommend making a requirements/runtime.txt adding the git url line to that and adding a pip install -r requirements/runtime.txt before we pip install the magnet package. Then make the dependency plain without the git url.

This unblocks us while we think of a better way to handle this in general. (I think it will converge on a new kitware-helm package)

dmjoy · 2026-01-23T17:02:35Z

I think I roughly follow you and the link is helpful, I've made some tweaks to that end (again mostly to get the CI passing). I do have reservations about maintaining dependencies it two distinct places (seems like a footgun). I'm with you that we'll likely converge on needing a PyPI kitware-helm package)

dmjoy · 2026-01-23T17:04:08Z

.github/workflows/tests.yml

      run: |-
        python -m pip install pip uv -U
        python -m uv pip install -r pyproject.toml --extra tests
+        python -m uv pip install -r requirements/runtime.txt


@Erotemic is this the place (and only place) I needed to add this?

You also need it in the test_purepy_wheels section after python -m uv pip install --prerelease=allow "aiq-magnet[$INSTALL_EXTRAS]==$MOD_VERSION" -f wheelhouse. There are tow paths for testing the source dist and the wheel dist.

dmjoy · 2026-01-23T17:36:19Z

Alright so maybe the requirements/runtime.txt bit is working now with CI, but running into a different error with the tests (in that it looks like the latest symlink isn't being generated for the demodata so the _coerce_from_patterned_paths piece isn't passing). Running the tests locally it was passing, but then I cleared out my local cached demodata directory, and tried re-running the tests. It doesn't seem like any demo data is being downloaded or something, because a whole hosts of tests failed after clearing that local cache. My assumption was the demo data would get downloaded as a part of the tests. Am I missing something @Erotemic ?

Erotemic · 2026-01-23T20:50:14Z

You should be correct. I'm taking a look. Tests that rely on network items will always have some sort of issue like this. I've got the issue reproduced locally.

Erotemic · 2026-01-23T20:53:42Z

Ah, I see the issue. Upstream HELM removed the "latest" symlink, which honestly is probably a good design decision. See: 130e41ae10c305fa83df6d3158e3153b188b74c8

That does invalidate some tests, but it just means we need to update the "num_expect" in that string. I'll make the change and push it up.

Erotemic · 2026-01-23T23:08:01Z

@dmjoy The tests are passing. Note that the original run had failures to resolve the IPFS url, but rerunning the tests passed. This is part of the tradeoff when using a distributed system, it can be a bit slow to warm up. What probably happened is we hit the IPFS gateway with the URL, it didn't have the data cached, so it started searching for the content, but timed and returned an error. But it kept trying to find that content behind the scenes so when we tried again, it found it and everything worked.

I was hoping it would be a bit more seamless, but it looks like there are still some rough edges.

dmjoy · 2026-01-26T15:20:30Z

Roger that, thanks for pushing up fixes. Any objections to merging this as-is now?

Erotemic · 2026-01-26T15:33:01Z

Let's merge.

We might need to add in a longer timeout, but let's see what it looks like for other PRs.

Add entry_point plugin support for user defined HELM run_specs

50be6d5

dmjoy closed this Jan 22, 2026

Point at kitware-helm fork; remove helm-run wrapper

f81ce56

dmjoy reopened this Jan 22, 2026

Add requirements/runtime.txt for URL kitware-helm package

deac4ba

dmjoy commented Jan 23, 2026

View reviewed changes

Add kitware-helm URL requirement to CI tests

88398a9

Erotemic added 2 commits January 23, 2026 15:56

Fixes for upstream HELM

a9ac8d3

Fix test

082b50f

dmjoy merged commit 6c0731b into main Jan 26, 2026
39 of 45 checks passed

Add entry_point plugin support for user defined HELM run_specs #42

Add entry_point plugin support for user defined HELM run_specs #42

Uh oh!

Conversation

dmjoy commented Jan 21, 2026

Uh oh!

dmjoy commented Jan 21, 2026

Uh oh!

Erotemic commented Jan 21, 2026

Uh oh!

dmjoy commented Jan 22, 2026

Uh oh!

dmjoy commented Jan 22, 2026

Uh oh!

dmjoy commented Jan 22, 2026

Uh oh!

Erotemic commented Jan 23, 2026

Uh oh!

dmjoy commented Jan 23, 2026

Uh oh!

dmjoy Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

Erotemic Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

dmjoy commented Jan 23, 2026

Uh oh!

Erotemic commented Jan 23, 2026

Uh oh!

Erotemic commented Jan 23, 2026

Uh oh!

Erotemic commented Jan 23, 2026

Uh oh!

dmjoy commented Jan 26, 2026

Uh oh!

Erotemic commented Jan 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants