gh-130453: pygettext: Extend support for specifying custom keywords #130463

tomasr8 · 2025-02-22T17:37:16Z

This addresses the first point in #130453

It is now possible to use the full keyword spec syntax (except for t, that will be added later) to specify keywords:

./python Tools/i18n/pygettext.py --keyword=foo:1
./python Tools/i18n/pygettext.py --keyword=foo:1,2
./python Tools/i18n/pygettext.py --keyword=foo:1c,2
./python Tools/i18n/pygettext.py --keyword=foo:1c,2,3

I tried to match the behaviour of xgettext and babel but neither seem to do much validation for the keyword specs.
xgettext, for instance, does not allow foo:1c,2c (context specified twice) nor foo:1,1c (msgid and msgctxt have the same index) but it does (weirdly) allow foo:1,1 (same index for msgid and msgid_plural), whereas it outright crashes with a double free for foo:1,1,2c.

This PR properly validates the keyword specs in order to be consistent and provide helpful error messages to the user.

Feedback welcome!

Issue: pygettext: Extend support for specifying custom keywords #130453

serhiy-storchaka

I do not think that it was necessary to go so far with detecting errors and generating error reports. Garbage in -- garbage out. The parsing code could be 2 or 3 times smaller without this. But if you already implemented this, it is fine.

LGTM.

Tools/i18n/pygettext.py

serhiy-storchaka · 2025-02-23T10:41:04Z

Tools/i18n/pygettext.py

+        raise ValueError(f'Invalid keyword spec {spec!r}: '
+                         'msgctxt cannot appear without msgid')
+
+    return name, {v: k for k, v in result.items()}


Would it be simpler to build result in that form from the beginning?

I did that in d861c84, let me know if you like it better like that

It was just a question. I am fine with both variants.

I just wanted to let you see the difference :) I don't have a strong preference either, let's stick with the current version, then?

Actually, I tried implementing some followup work on top of this PR (support for the t specifier, multiple keywords with the same funcname) and it's better to use the original representation because the diff in the followup PRs will be smaller.

So I did the right thing by letting the PR lie down for two days. 😄

Yes, good call 🙂 And thanks for your super thorough reviews! It's really appreciated

serhiy-storchaka · 2025-02-23T10:47:29Z

Tools/i18n/pygettext.py

+    try:
+        options.keywords = dict(parse_spec(spec) for spec in options.keywords)
+    except ValueError as e:
+        raise SystemExit(e)


Other errors cause print() + sys.exit(1).

Indeed, though I believe raise SystemExit(s) is functionally equivalent to print(..., file=sys.stderr) + sys.exit(1) and since it's shorter and the intent is clearer, I thought I'd start using that instead.

Though if you prefer to be consistent, I can change it to print+sys.exit?

Yes, it just for consistency (also, sys.exit() allows to set different return codes, but this is not used here).

Updated in 18d29cb to use print+sys.exit

Tools/i18n/pygettext.py

tomasr8 · 2025-02-23T20:22:48Z

I do not think that it was necessary to go so far with detecting errors and generating error reports. Garbage in -- garbage out. The parsing code could be 2 or 3 times smaller without this. But if you already implemented this, it is fine.

Honestly, if you prefer it without the detailed error messages, I am fine with removing them. Just let me know!

My thinking for adding them was that most people using this will not be that familiar with the syntax and for them, it's better to show a descriptive error rather than fail silently, but as I said, if you prefer to have simpler code, that's also ok!

serhiy-storchaka · 2025-02-23T20:30:22Z

Tools/i18n/pygettext.py

+    try:
+        options.keywords = dict(parse_spec(spec) for spec in options.keywords)
+    except ValueError as e:
+        raise SystemExit(e)


Yes, it just for consistency (also, sys.exit() allows to set different return codes, but this is not used here).

serhiy-storchaka · 2025-02-23T20:32:04Z

Tools/i18n/pygettext.py

+        raise ValueError(f'Invalid keyword spec {spec!r}: '
+                         'msgctxt cannot appear without msgid')
+
+    return name, {v: k for k, v in result.items()}


It was just a question. I am fine with both variants.

tomasr8 added 3 commits February 22, 2025 11:40

Support custom keywords

74b0235

Add news entry

619cad5

PEP8

184232e

tomasr8 requested a review from serhiy-storchaka February 22, 2025 17:37

bedevere-app bot added the awaiting review label Feb 22, 2025

bedevere-app bot mentioned this pull request Feb 22, 2025

pygettext: Extend support for specifying custom keywords #130453

Open

tomasr8 added 2 commits February 22, 2025 18:42

Remove whitespace

4dd889b

Remove trailing whitespace

6a46ce7

serhiy-storchaka approved these changes Feb 23, 2025

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting review labels Feb 23, 2025

tomasr8 added 3 commits February 23, 2025 21:04

Simplify code

bb50cfe

Revert making keywords a set

83a21e0

Simplify 'parse_spec'

d861c84

tomasr8 requested a review from serhiy-storchaka February 23, 2025 20:24

serhiy-storchaka approved these changes Feb 23, 2025

View reviewed changes

Use print+sys.exit for consistency

18d29cb

tomasr8 requested a review from serhiy-storchaka February 23, 2025 20:40

Revert changes to parse_spec

a3ef55b

serhiy-storchaka merged commit 44213bc into python:main Feb 25, 2025
39 checks passed

bedevere-app bot removed the awaiting merge label Feb 25, 2025

tomasr8 deleted the pygettext-keywordspec branch February 25, 2025 10:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-130453: pygettext: Extend support for specifying custom keywords #130463

gh-130453: pygettext: Extend support for specifying custom keywords #130463

tomasr8 commented Feb 22, 2025 •

edited by bedevere-app bot

Loading

serhiy-storchaka left a comment

serhiy-storchaka Feb 23, 2025

tomasr8 Feb 23, 2025

serhiy-storchaka Feb 23, 2025

tomasr8 Feb 23, 2025

tomasr8 Feb 24, 2025

serhiy-storchaka Feb 25, 2025

tomasr8 Feb 25, 2025

serhiy-storchaka Feb 23, 2025

tomasr8 Feb 23, 2025

serhiy-storchaka Feb 23, 2025

tomasr8 Feb 23, 2025

tomasr8 commented Feb 23, 2025

serhiy-storchaka Feb 23, 2025

serhiy-storchaka Feb 23, 2025

gh-130453: pygettext: Extend support for specifying custom keywords #130463

gh-130453: pygettext: Extend support for specifying custom keywords #130463

Conversation

tomasr8 commented Feb 22, 2025 • edited by bedevere-app bot Loading

serhiy-storchaka left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tomasr8 commented Feb 23, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tomasr8 commented Feb 22, 2025 •

edited by bedevere-app bot

Loading