Use pre-commit hooks in tox configuration #2057

waxlamp · 2024-10-23T15:05:33Z

This extends #2045 to use the same pre-commit hooks to perform linting and formatting via tox.ini. In particular, this "refactors" the specification of what versions of codespell and ruff to use to just one place: the pre-commit config file.

The variants of the pre-commit hooks that are needed to replace the tox.ini commands are listed in the pre-commit config using the manual stage specifier with appropriate aliases, meaning they will not be invoked as a git hook ever, and enabling them to be invoked by name in the tox config.

This PR also modifies the dev Dockerfile to build it to run as the user from the host system. That is, instead of running as root inside the container, it will instead run as the same (non-root) user who is doing development work (including building this container, etc.). That enables the use of pre-commit both inside and outside the container, since pre-commit requires examining Git status, etc. Without this modification, you get errors about "dubious ownership" of the Git repository.

I am totally willing to peel off the Dockerfile changes to a separate PR if we require further discussion, especially if some of these ideas are suitable for general use with Docker containers. Let me know what you think, @mvandenburgh.

yarikoptic · 2024-10-23T15:50:51Z

There are some quirks w.r.t. running this via Docker:

Because the permissions for the .git directory are not those of root,

IMHO nobody should run dev tools as root -- should be mapped to the real UID via --user option etc. (in https://github.com/ReproNim/containers/blob/master/scripts/singularity_cmd#L198C1-L199C19 we passed UID and GID env vars) -- what container we are talking about to try on?

mvandenburgh · 2024-10-23T16:32:19Z

Should the difficulties around running linting inside docker be documented in DEVELOPMENT.md?

yarikoptic · 2024-10-24T01:46:01Z

when do git and tox need to run inside docker environment?

waxlamp · 2024-10-25T14:32:01Z

IMHO nobody should run dev tools as root -- should be mapped to the real UID via --user option etc. (in https://github.com/ReproNim/containers/blob/master/scripts/singularity_cmd#L198C1-L199C19 we passed UID and GID env vars) -- what container we are talking about to try on?

Nobody is running dev tools as root 🙂. Docker containers, by default, run as root inside the container (even when launched by a non-root user on the host system). Your example is using Singularity, which was designed in part to work with non-root users by default.

when do git and tox need to run inside docker environment?

Tox needs to run inside the container precisely when the dev wants it to. For example, I often run the command docker compose run --rm django tox -e lint etc. to perform linting in my dev environment. Git does not need to run inside the container, though pre-commit uses git to determine how to run its hooks, which is where I was running into trouble.

Resonant applications provides this container as a convenience. Adopting pre-commit in dandi-archive caused some of the "rev matching" problems to surface, so I'm looking into how to fix it to provide a generally better experience across Resonant apps.

In any case, I am working on a way to build the container so that it runs as the same user as the dev who built it. If that works, then it will provide parity between host OS and Docker container. Stay tuned for updates.

yarikoptic · 2024-10-28T16:20:15Z

There are some quirks w.r.t. running this via Docker:

Because the permissions for the .git directory are not those of root,

IMHO nobody should run dev tools as root -- should be mapped to the real UID via --user option etc. (in https://github.com/ReproNim/containers/blob/master/scripts/singularity_cmd#L198C1-L199C19 we passed UID and GID env vars) -- what container we are talking about to try on?

FWIW -- checked that it works:

❯ docker compose run --rm --user "$(id -u)" django touch 123
WARN[0000] /home/yoh/proj/dandi/dandi-archive/docker-compose.yml: the attribute `version` is obsolete, it will be ignored, please remove it to avoid potential confusion 
WARN[0000] /home/yoh/proj/dandi/dandi-archive/docker-compose.override.yml: the attribute `version` is obsolete, it will be ignored, please remove it to avoid potential confusion 
[+] Creating 3/0
 ✔ Container dandi-archive-minio-1     Running                                                                                                                                 0.0s 
 ✔ Container dandi-archive-rabbitmq-1  Running                                                                                                                                 0.0s 
 ✔ Container dandi-archive-postgres-1  Running                                                                                                                                 0.0s 

❯ ls -ld 123
-rw-r--r-- 1 yoh root 0 Oct 28 12:18 123

so just add --user "$(id -u)" to your invocation of tox within container. Depending on scripts, you might also benefit from passing env vars as I pointed our so more complete would be --user "$(id -u)" -e "UID=$(id -u)" -e "GID=$(id -g)" (potentially improved to your liking)

yarikoptic · 2024-10-28T16:22:56Z

note: might need to clean up "root owned" artifacts from prior runs of tox first, and then it seems to work just fine

❯ sudo rm -rf .tox/lint/log .ruff_cache
❯ docker compose run --rm --user "$(id -u)" -e "UID=$(id -u)" -e "GID=$(id -g)" django tox -e lint
WARN[0000] /home/yoh/proj/dandi/dandi-archive/docker-compose.yml: the attribute `version` is obsolete, it will be ignored, please remove it to avoid potential confusion 
WARN[0000] /home/yoh/proj/dandi/dandi-archive/docker-compose.override.yml: the attribute `version` is obsolete, it will be ignored, please remove it to avoid potential confusion 
[+] Creating 3/0
 ✔ Container dandi-archive-postgres-1  Running                                                                                                                                 0.0s 
 ✔ Container dandi-archive-minio-1     Running                                                                                                                                 0.0s 
 ✔ Container dandi-archive-rabbitmq-1  Running                                                                                                                                 0.0s 
lint: commands[0]> ruff --version
ruff 0.6.9
lint: commands[1]> ruff check
All checks passed!
lint: commands[2]> ruff format --check
155 files already formatted
lint: commands[3]> codespell .
  lint: OK (0.34=setup[0.06]+cmd[0.01,0.02,0.02,0.23] seconds)
  congratulations :) (0.44 seconds)

waxlamp · 2024-10-31T14:31:14Z

so just add --user "$(id -u)" to your invocation of tox within container

This mostly works--depending on how the container image was built you can still run into problems. Plus, having to add the --user flag on every invocation is annoying and error-prone (you might forget to do it, etc.)

note: might need to clean up "root owned" artifacts from prior runs of tox first, and then it seems to work just fine

This is really the core of the issue--having to remember the --user flag, plus the fact that your root user might be "polluting" your dev directory makes for an unergonomic dev experience.

But I'm experimenting with moving these things to the build phase so that the regular commands "just work". Stand by, I have an incoming fix for all this (I hope).

yarikoptic · 2024-10-31T18:31:50Z

note: might need to clean up "root owned" artifacts from prior runs of tox first, and then it seems to work just fine

This is really the core of the issue--having to remember the --user flag, plus the fact that your root user might be "polluting" your dev directory makes for an unergonomic dev experience.

The core issue is :

IMHO nobody should run dev tools as root -- should be mapped to the real UID via --user option etc. (in https://github.com/ReproNim/containers/blob/master/scripts/singularity_cmd#L198C1-L199C19 we passed UID and GID env vars) -- what container we are talking about to try on?

Nobody is running dev tools as root 🙂. Docker containers, by default, run as root inside the container (even when launched by a non-root user on the host system).

and the fact that "Docker containers" are not animate, they do not run themselves. They are executed by people who run them and as a result running dev tools as root. It is just a fact.

Tox needs to run inside the container precisely when the dev wants it to.

It is something to remember about and to not execute docker with any dev tools/scripts without precautions being taken -- ideally to avoid that generally. Same advice for sudo - too many people often did sudo pip install causing all kinds of pains later on -- all of that is IMHO just a bad practice.

In the longer run it might be better to look to switch to podman and having fake root inside the container space so touched bind-mounted files would still be fine.

I have expressed my opinion ("never run dev tools as root" - whether through docker or sudo or ...), and will quiet down on this particular issue.

waxlamp · 2024-10-31T18:35:56Z

note: might need to clean up "root owned" artifacts from prior runs of tox first, and then it seems to work just fine

This is really the core of the issue--having to remember the --user flag, plus the fact that your root user might be "polluting" your dev directory makes for an unergonomic dev experience.

The core issue is :

IMHO nobody should run dev tools as root -- should be mapped to the real UID via --user option etc. (in https://github.com/ReproNim/containers/blob/master/scripts/singularity_cmd#L198C1-L199C19 we passed UID and GID env vars) -- what container we are talking about to try on?

Nobody is running dev tools as root 🙂. Docker containers, by default, run as root inside the container (even when launched by a non-root user on the host system).

and the fact that "Docker containers" are not animate, they do not run themselves. They are executed by people who run them and as a result running dev tools as root. It is just a fact.

Tox needs to run inside the container precisely when the dev wants it to.

It is something to remember about and to not execute docker with any dev tools/scripts without precautions being taken -- ideally to avoid that generally. Same advice for sudo - too many people often did sudo pip install causing all kinds of pains later on -- all of that is IMHO just a bad practice.

In the longer run it might be better to look to switch to podman and having fake root inside the container space so touched bind-mounted files would still be fine.

I have expressed my opinion ("never run dev tools as root" - whether through docker or sudo or ...), and will quiet down on this particular issue.

I did not mean to be dismissive--I was just pointing out a reality of how most people run Docker containers. As I said, I'm working on a solution that addresses this issue head on (it is similar to what you suggested, but even better).

DEVELOPMENT.md

mvandenburgh

I get this when I run the docker compose build command. I think the adduser command doesn't support the . character in usernames (my username on my ubuntu system is michael.vandenburgh).

[+] Building 1.2s (11/33)                                                                                                                                          docker:default
 => [celery internal] load build definition from django.Dockerfile                                                                                                           0.0s
 => => transferring dockerfile: 2.63kB                                                                                                                                       0.0s
 => [django internal] load build definition from django.Dockerfile                                                                                                           0.0s
 => => transferring dockerfile: 2.63kB                                                                                                                                       0.0s
 => [celery internal] load metadata for docker.io/library/python:3.11-slim                                                                                                   0.6s
 => [celery internal] load .dockerignore                                                                                                                                     0.0s
 => => transferring context: 2B                                                                                                                                              0.0s
 => [django internal] load .dockerignore                                                                                                                                     0.0s
 => => transferring context: 2B                                                                                                                                              0.0s
 => [celery internal] load build context                                                                                                                                     0.4s
 => => transferring context: 657.14kB                                                                                                                                        0.4s
 => [django  1/16] FROM docker.io/library/python:3.11-slim@sha256:5148c0e4bbb64271bca1d3322360ebf4bfb7564507ae32dd639322e4952a6b16                                           0.0s
 => CACHED [django  2/16] RUN apt-get update &&     apt-get install --no-install-recommends --yes     libpq-dev gcc libc6-dev git &&     rm -rf /var/lib/apt/lists/*         0.0s
 => CACHED [django  3/16] RUN pip install pre-commit                                                                                                                         0.0s
 => ERROR [django  4/16] RUN if [ "692589655" != 0 ]; then adduser --uid 692589655 --gid 692584961 --home /home/michael.vandenburgh michael.vandenburgh; fi                  0.5s
 => [django internal] load build context                                                                                                                                     0.4s
 => => transferring context: 657.14kB                                                                                                                                        0.4s
------
 > [django  4/16] RUN if [ "692589655" != 0 ]; then adduser --uid 692589655 --gid 692584961 --home /home/michael.vandenburgh michael.vandenburgh; fi:
0.489 adduser: Please enter a username matching the regular expression
0.489             configured via the NAME_REGEX configuration variable.  Use the
0.489             `--allow-bad-names' option to relax this check or reconfigure
0.489             NAME_REGEX in configuration.
------
failed to solve: process "/bin/sh -c if [ \"${USERID}\" != 0 ]; then adduser --uid ${USERID} --gid ${GROUPID} --home /home/${LOGIN} $LOGIN; fi" did not complete successfully: exit code: 1

The man page for adduser says you can use --allow-bad-names to override this, but then I get this error:

 > [celery  4/16] RUN if [ "692589655" != 0 ]; then adduser --uid 692589655 --gid 692584961 --home /home/michael.vandenburgh michael.vandenburgh --allow-bad-names; fi:
0.361 adduser: The GID 692584961 does not exist.

dev/django.Dockerfile

waxlamp · 2024-11-11T17:20:27Z

I get this when I run the docker compose build command. I think the adduser command doesn't support the . character in usernames (my username on my ubuntu system is michael.vandenburgh).

[+] Building 1.2s (11/33)                                                                                                                                          docker:default
 => [celery internal] load build definition from django.Dockerfile                                                                                                           0.0s
 => => transferring dockerfile: 2.63kB                                                                                                                                       0.0s
 => [django internal] load build definition from django.Dockerfile                                                                                                           0.0s
 => => transferring dockerfile: 2.63kB                                                                                                                                       0.0s
 => [celery internal] load metadata for docker.io/library/python:3.11-slim                                                                                                   0.6s
 => [celery internal] load .dockerignore                                                                                                                                     0.0s
 => => transferring context: 2B                                                                                                                                              0.0s
 => [django internal] load .dockerignore                                                                                                                                     0.0s
 => => transferring context: 2B                                                                                                                                              0.0s
 => [celery internal] load build context                                                                                                                                     0.4s
 => => transferring context: 657.14kB                                                                                                                                        0.4s
 => [django  1/16] FROM docker.io/library/python:3.11-slim@sha256:5148c0e4bbb64271bca1d3322360ebf4bfb7564507ae32dd639322e4952a6b16                                           0.0s
 => CACHED [django  2/16] RUN apt-get update &&     apt-get install --no-install-recommends --yes     libpq-dev gcc libc6-dev git &&     rm -rf /var/lib/apt/lists/*         0.0s
 => CACHED [django  3/16] RUN pip install pre-commit                                                                                                                         0.0s
 => ERROR [django  4/16] RUN if [ "692589655" != 0 ]; then adduser --uid 692589655 --gid 692584961 --home /home/michael.vandenburgh michael.vandenburgh; fi                  0.5s
 => [django internal] load build context                                                                                                                                     0.4s
 => => transferring context: 657.14kB                                                                                                                                        0.4s
------
 > [django  4/16] RUN if [ "692589655" != 0 ]; then adduser --uid 692589655 --gid 692584961 --home /home/michael.vandenburgh michael.vandenburgh; fi:
0.489 adduser: Please enter a username matching the regular expression
0.489             configured via the NAME_REGEX configuration variable.  Use the
0.489             `--allow-bad-names' option to relax this check or reconfigure
0.489             NAME_REGEX in configuration.
------
failed to solve: process "/bin/sh -c if [ \"${USERID}\" != 0 ]; then adduser --uid ${USERID} --gid ${GROUPID} --home /home/${LOGIN} $LOGIN; fi" did not complete successfully: exit code: 1

The man page for adduser says you can use --allow-bad-names to override this, but then I get this error:

 > [celery  4/16] RUN if [ "692589655" != 0 ]; then adduser --uid 692589655 --gid 692584961 --home /home/michael.vandenburgh michael.vandenburgh --allow-bad-names; fi:
0.361 adduser: The GID 692584961 does not exist.

Thanks for the edge cases (regarding username nonconformance and group existence). Commits d7026cc and 2ada467 should help.

DEVELOPMENT.md

dev/django.Dockerfile

Co-authored-by: Mike VanDenburgh <[email protected]>

mvandenburgh

This works for me now. Just left one suggestion

dev/django.Dockerfile

Co-authored-by: Mike VanDenburgh <[email protected]>

dandibot · 2024-12-16T16:36:11Z

🚀 PR was released in v0.4.0 🚀

waxlamp requested review from danlamanna and mvandenburgh October 23, 2024 15:05

waxlamp mentioned this pull request Oct 23, 2024

Add a CI job that runs linting and formatting against latest ruff #2055

Closed

waxlamp force-pushed the detox branch 3 times, most recently from 58fb6cb to 5e90b4c Compare November 5, 2024 15:27

waxlamp added 3 commits November 5, 2024 10:42

Use pre-commit in tox configuration

6e181d1

Build dev container as normal user

ffe0b1b

Add documentation explaining how to build the container image properly

3a9a3c6

waxlamp force-pushed the detox branch from 5e90b4c to 3a9a3c6 Compare November 5, 2024 15:43

yarikoptic reviewed Nov 6, 2024

View reviewed changes

DEVELOPMENT.md Outdated Show resolved Hide resolved

yarikoptic reviewed Nov 6, 2024

View reviewed changes

DEVELOPMENT.md Outdated Show resolved Hide resolved

Include instruction to run docker compose pull

6acf816

mvandenburgh reviewed Nov 8, 2024

View reviewed changes

dev/django.Dockerfile Outdated Show resolved Hide resolved

dev/django.Dockerfile Outdated Show resolved Hide resolved

waxlamp added 2 commits November 11, 2024 09:54

Revert move of ENV instructions

463109c

Install pre-commit through setup.py

22c8fdf

waxlamp force-pushed the detox branch from ee7230e to ba5dcc8 Compare November 11, 2024 17:00

waxlamp added 3 commits November 11, 2024 12:18

Create the group if it doesn't already exist

2ada467

Allow usernames with . in them

d7026cc

Use getent to control conditional creation of user

3eeeeb9

waxlamp force-pushed the detox branch from ba5dcc8 to 3eeeeb9 Compare November 11, 2024 17:18

waxlamp requested a review from mvandenburgh November 11, 2024 17:20

asmacdo reviewed Nov 13, 2024

View reviewed changes

DEVELOPMENT.md Outdated Show resolved Hide resolved

Include GROUPNAME argument in second iteration of build command

026dc13

mvandenburgh reviewed Nov 18, 2024

View reviewed changes

dev/django.Dockerfile Outdated Show resolved Hide resolved

Allow all group names

54c2dfd

Co-authored-by: Mike VanDenburgh <[email protected]>

waxlamp requested a review from mvandenburgh December 12, 2024 17:42

mvandenburgh reviewed Dec 12, 2024

View reviewed changes

dev/django.Dockerfile Outdated Show resolved Hide resolved

Simplify the pre-commit cache generation command

c7ce1ef

Co-authored-by: Mike VanDenburgh <[email protected]>

mvandenburgh approved these changes Dec 13, 2024

View reviewed changes

waxlamp merged commit b0fa9e0 into master Dec 13, 2024
11 checks passed

waxlamp deleted the detox branch December 13, 2024 19:46

dandibot added the released This issue/pull request has been released. label Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use pre-commit hooks in tox configuration #2057

Use pre-commit hooks in tox configuration #2057

waxlamp commented Oct 23, 2024 •

edited

Loading

yarikoptic commented Oct 23, 2024

mvandenburgh commented Oct 23, 2024

yarikoptic commented Oct 24, 2024

waxlamp commented Oct 25, 2024

yarikoptic commented Oct 28, 2024

yarikoptic commented Oct 28, 2024

waxlamp commented Oct 31, 2024

yarikoptic commented Oct 31, 2024 •

edited

Loading

waxlamp commented Oct 31, 2024

mvandenburgh left a comment

waxlamp commented Nov 11, 2024

mvandenburgh left a comment

dandibot commented Dec 16, 2024

Use pre-commit hooks in tox configuration #2057

Use pre-commit hooks in tox configuration #2057

Conversation

waxlamp commented Oct 23, 2024 • edited Loading

yarikoptic commented Oct 23, 2024

mvandenburgh commented Oct 23, 2024

yarikoptic commented Oct 24, 2024

waxlamp commented Oct 25, 2024

yarikoptic commented Oct 28, 2024

yarikoptic commented Oct 28, 2024

waxlamp commented Oct 31, 2024

yarikoptic commented Oct 31, 2024 • edited Loading

waxlamp commented Oct 31, 2024

mvandenburgh left a comment

Choose a reason for hiding this comment

waxlamp commented Nov 11, 2024

mvandenburgh left a comment

Choose a reason for hiding this comment

dandibot commented Dec 16, 2024

waxlamp commented Oct 23, 2024 •

edited

Loading

yarikoptic commented Oct 31, 2024 •

edited

Loading