Disallow periods at the start and end of new usernames #9336

robertknight · 2025-02-10T13:14:44Z

Work towards resolving an ambiguity when parsing mentions out of text like "I like this @bob." by disallowing periods at the start and end of usernames.

There are a small number (~2000 or <0.1%) of existing accounts which do contain periods at the start or end of usernames. We need to continue to allow those accounts to be used, but we can get away with not supporting extracting mentions for those users from text.

Fixes #9335.

Testing:

Try to create a new user account that starts or ends with a period. This should be rejected, with a helpful validation error.
Try to log in to an existing user account that starts or ends with a period. This should work as before.
Try to rename a user account to start or end with a period. This should be rejected.

@bob

Work towards resolving an ambiguity when parsing mentions out of text like "I like this @bob." by disallowing periods at the start and end of usernames. There are a small number (~2000 or <0.1%) of existing accounts which do contain periods at the start or end of usernames. We need to continue to allow those accounts to be used, but we can get away with not supporting extracting mentions for those users from text. Fixes #9335

robertknight · 2025-02-10T13:36:37Z

tests/unit/h/accounts/schemas_test.py

-            schema.deserialize({"username": "a"})
-        assert exc.value.asdict()["username"] == ("Must be 3 characters or more.")
+            schema.deserialize({"username": "ab"})
+        assert "Must be 3 characters or more." in exc.value.asdict()["username"]


The USERNAME_PATTERN regex now implicitly sets a minimum length of 3 (at least one start, middle and end char) so it also fails. I think that's fine, but for this test only check for the length message specifically.

robertknight · 2025-02-10T13:37:51Z

Requested a review from @seanh in case there are any issues with changing the username pattern that I've missed.

seanh

👍

seanh · 2025-02-11T09:38:58Z

h/models/user.py

+# nb. This pattern is used in Python code, JSON schemas and HTML forms, so it
+# needs to use portable syntax.
+USERNAME_PATTERN = "^[A-Za-z0-9_][A-Za-z0-9._]+[A-Za-z0-9_]$"


Will this affect the user-update API, for those users whose existing usernames start or end with a .?

The user update API does not validate the format of the username that I can see, which makes it compatible with old usernames. I have added a note about the change in the comment for USERNAME_PATTERN to make future readers aware.

seanh · 2025-02-11T09:47:15Z

tests/unit/h/views/admin/users_test.py

@@ -35,6 +36,7 @@ def test_users_index(pyramid_request):
        "username": None,
        "authority": None,
        "user": None,
+        "username_pattern": Any.string(),


FYI we're trying to stop using h-matchers nowadays as there's various problems with it (it's implemented in a very clever way that makes it impossible to maintain, it has a lot of features that we never use, and it ruins pytest's assertion failure messages). I'm not sure what the plain Python / pytest alternative to usages like this would be, however. You could use mock.ANY but that doesn't assert that it's a string. Alternatively a simple classic-style matcher could be implemented.

(mock.ANY is implemented using the classic-style matchers approach. The original request for h-matchers was just to cut-paste existing classic-style matchers from app tests into a shared library, but we got h-matchers instead.)

Looks like we used to have an instance_of matcher for just this purpose:

h/tests/common/matchers.py

Lines 52 to 62 in c480243

class instance_of(Matcher): # noqa: N801

"""An object __eq__ to any object which is an instance of `type_`."""

def __init__(self, type_):

self.type = type_

def __eq__(self, other):

return isinstance(other, self.type)

def __repr__(self):

return '<instance of {!r}>'.format(self.type)

. Could build some alias matchers on top of that, like any_string() etc.

FYI we're trying to stop using h-matchers nowadays as there's various problems with it (it's implemented in a very clever way that makes it impossible to maintain, it has a lot of features that we never use, and it ruins pytest's assertion failure messages).

Could we perhaps distill h-matchers into a more limited subset of the functionality that covers the main use cases? Preferably in such a way that the API migration is easy. The "legacy" API could be moved into a submodule in h-matchers to facilitate incremental migration.

I have swapped Any.string() for mock.ANY here, since that is fine for the current use case.

Could we perhaps distill h-matchers into a more limited subset of the functionality that covers the main use cases? Preferably in such a way that the API migration is easy. The "legacy" API could be moved into a submodule in h-matchers to facilitate incremental migration.

I think unfortunately not easily, given the way h-matchers is implemented.

There's only a handful of types of assertion that h-matchers tends to be used for across our tests, and each of those types of assertion can be replaced with a simpler approach, e.g. just a plain assertion, or mock.ANY, or a simple matcher class, etc. In that way I think we can phase out h-matchers.

See #9336 (comment)

acelaya approved these changes Feb 10, 2025

View reviewed changes

robertknight commented Feb 10, 2025

View reviewed changes

robertknight requested a review from seanh February 10, 2025 13:37

seanh approved these changes Feb 11, 2025

View reviewed changes

robertknight added 2 commits February 11, 2025 10:59

Add a note about the username pattern change and existing usernames

8e20725

Prefer mock.ANY over Any.string()

f177feb

See #9336 (comment)

robertknight merged commit 89c82e5 into main Feb 11, 2025
11 checks passed

robertknight deleted the disallow-username-start-end-period branch February 11, 2025 11:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disallow periods at the start and end of new usernames #9336

Disallow periods at the start and end of new usernames #9336

robertknight commented Feb 10, 2025 •

edited

Loading

robertknight Feb 10, 2025

robertknight commented Feb 10, 2025

seanh left a comment

seanh Feb 11, 2025

robertknight Feb 11, 2025 •

edited

Loading

seanh Feb 11, 2025

seanh Feb 11, 2025

robertknight Feb 11, 2025

robertknight Feb 11, 2025

seanh Feb 11, 2025

	class instance_of(Matcher): # noqa: N801
	"""An object __eq__ to any object which is an instance of `type_`."""

	def __init__(self, type_):
	self.type = type_

	def __eq__(self, other):
	return isinstance(other, self.type)

	def __repr__(self):
	return '<instance of {!r}>'.format(self.type)

Disallow periods at the start and end of new usernames #9336

Disallow periods at the start and end of new usernames #9336

Conversation

robertknight commented Feb 10, 2025 • edited Loading

robertknight Feb 10, 2025

Choose a reason for hiding this comment

robertknight commented Feb 10, 2025

seanh left a comment

Choose a reason for hiding this comment

seanh Feb 11, 2025

Choose a reason for hiding this comment

robertknight Feb 11, 2025 • edited Loading

Choose a reason for hiding this comment

seanh Feb 11, 2025

Choose a reason for hiding this comment

seanh Feb 11, 2025

Choose a reason for hiding this comment

robertknight Feb 11, 2025

Choose a reason for hiding this comment

robertknight Feb 11, 2025

Choose a reason for hiding this comment

seanh Feb 11, 2025

Choose a reason for hiding this comment

robertknight commented Feb 10, 2025 •

edited

Loading

robertknight Feb 11, 2025 •

edited

Loading