Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Updating apt repositories fails on Slurm images based on Ubuntu 20.04 #3254

Open
tpdownes opened this issue Nov 13, 2024 · 2 comments
Open
Labels
bug Something isn't working stale

Comments

@tpdownes
Copy link
Member

tpdownes commented Nov 13, 2024

Describe the bug

Cannot update apt repository metadata on VMs using the Slurm image based upon Ubuntu 20.04.

Steps to reproduce

Steps to reproduce the behavior:

  1. apt-get update (or apt update) fails with a message similar to

E:Failed to fetch https://downloads.whamcloud.com/public/lustre/latest-feature-release/ubuntu2004/client/Packages 404 Not Found [IP: 138.197.209.17 443], E:Some index files failed to download. They have been ignored, or old ones used instead

Expected behavior

apt-get update should succeed without error

Workaround

Run sudo rm -f /etc/apt/sources.list.d/lustre-client.list and re-run apt-get update.

Planned fix

We will update the Whamcloud (DDN) lustre repository URL in future builds of Slurm Ubuntu 20.04 images. We are also planning a migration for the a3-highgpu-8g blueprint away from Ubuntu 20.04 to a more recent Ubuntu release.

@tpdownes tpdownes added the bug Something isn't working label Nov 13, 2024
@tpdownes
Copy link
Member Author

tpdownes commented Nov 13, 2024

This runner in as the first runner in startup script can implement this:

      - type: shell
        destination: workaround_apt.sh
        content: |
          #!/bin/bash
          # this script is no longer necessary on the most recent TCPX A3
          # images, however it is included for backwards compatibility
          set -e -o pipefail
          rm -f /etc/apt/sources.list.d/lustre-client.list
          apt-get update

See these links for usage of startup scripts on Slurm infrastructure:

Copy link

This issue is stale because it has been open for 30 days with no activity.

@github-actions github-actions bot added the stale label Dec 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working stale
Projects
None yet
Development

No branches or pull requests

1 participant