ENH: With extract-annotated-pages command #98

wolfram77 · 2025-02-06T12:46:33Z

This pull request addresses the following issue:

ENH: Add command to extract annotated pages #97

@Lucas-C

even links (internal or external) are annotations!

This was the issue.

Lucas-C · 2025-02-07T15:03:28Z

Thank you for your contribution @wolfram77 👍

Could you please address the following points, and I'll be happy to merge your PR:

please run black pdfly/extract_annotated_pages.py so that the GitHub Actions CI pipeline passes
please add a mention of your addition to CHANGELOG.md as part of this PR
please include at lease one basic unit test in test/test_extract_annotations.py. You could take inspiration from the code snippet below to create a PDF file with annotations "on the fly" in this unit test:

from fpdf import FPDF

pdf = FPDF()
pdf.set_font("Helvetica", size=12)

pdf.add_page()
text = "Link set over an arbitrary area with FPDF.link()"
x, y = 20, 150
pdf.text(x=x, y=y, text=text)
width = pdf.get_string_width(text)
pdf.link(
    x=x,
    y=y - pdf.font_size,
    w=width,
    h=pdf.font_size,
    link="https://github.com/py-pdf/fpdf2/discussions",
)

pdf.add_page()
pdf.text_annotation(
    x=20,
    y=150,
    text=f"This is a default text annotation.",
)
pdf.output("pdfly_pr_98.pdf")

PS: I'll be on holiday for a few days, so I'll get back to you only mid-february.

Lucas-C · 2025-02-07T15:03:47Z

@all-contributors please add @wolfram77 for code

allcontributors · 2025-02-07T15:03:57Z

@Lucas-C

I've put up a pull request to add @wolfram77! 🎉

wolfram77 · 2025-02-07T15:55:40Z

@Lucas-C Thanks for reviewing the PR. To simplify the test, I added a yellow highlight to page 7 of resources/input8.pdf. The test case now looks for one annotated page in it.

CHANGELOG.md

Lucas-C · 2025-02-07T16:02:39Z

Nice, good job with the unit test 👍

I think there are issues with the unit tests, on the main branch, regardless of this PR.

I won't have the time to fix them today, so you can either have a look at it based on the GitHub Actions logs, or else I'll fix that when I'll be back from holiday.

wolfram77 · 2025-02-07T17:09:10Z

@Lucas-C From the first failed test, I see the following:

        # Assert
        captured = capsys.readouterr()
>       assert exit_code == 0, captured
E       AssertionError: CaptureResult(out='', err="\x1b[33mUsage: \x1b[0mpytest update-offsets [OPTIONS] FILE_IN
E         \x1b[2mTry \x1b[0m\x1b[2;34m'pytest update-offsets \x1b[0m\x1b[1;2;34m-\x1b[0m\x1b[1;2;34m-help\x1b[0m\x1b[2;34m'\x1b[0m\x1b[2m for help.\x1b[0m
E         \x1b[31m╭─\x1b[0m\x1b[31m Error \x1b[0m\x1b[31m─────────────────────────────────────────────────────────────────────\x1b[0m\x1b[31m─╮\x1b[0m
E         \x1b[31m│\x1b[0m Got unexpected extra argument                                                \x1b[31m│\x1b[0m
E         \x1b[31m│\x1b[0m (/tmp/pytest-of-runner/pytest-0/test_update_offsets0/file-with-offsets-out.p \x1b[31m│\x1b[0m
E         \x1b[31m│\x1b[0m df)                                                                          \x1b[31m│\x1b[0m
E         \x1b[31m╰──────────────────────────────────────────────────────────────────────────────╯\x1b[0m
E         ")
E       assert 2 == 0

I am not sure why it seems to expect update-offsets to have only FILE_IN as an argument (no FILE_OUT), so it is failing with Got unexpected extra argument error. Could it be a pytest / typer version issue? Would be best you take a look at it when you are back.

Lucas-C · 2025-02-07T18:18:53Z

I fixed the main branch.

mypy reports some minor issues with your PR:

pdfly/extract_annotated_pages.py:20: error: "Path" has no attribute "with_stem"  [attr-defined]
pdfly/extract_annotated_pages.py:28: error: "PdfObject" has no attribute "__iter__" (not iterable)  [attr-defined]
pdfly/cli.py:348: error: Argument 2 to "main" has incompatible type "Optional[Path]"; expected "Path"  [arg-type]

wolfram77 · 2025-02-17T02:25:56Z

@Lucas-C please update me when you are back. I have (hopefully) addressed the issues you mentioned above.

pdfly/extract_annotated_pages.py

Lucas-C · 2025-02-17T15:22:53Z

@wolfram77 I updated your branch to rebase it and fix the last issue with ruff

Seems that the `| None` syntax is not valid with Python 3.8 Co-authored-by: Lucas Cimon <[email protected]>

Lucas-C · 2025-02-17T15:48:15Z

Merged!

Thank you @wolfram77 👍 🙂

wolfram77 · 2025-02-17T19:07:48Z

Co-authored-by: Lucas Cimon <[email protected]>

wolfram77 changed the title ~~With extract-annotated-pages command~~ ENH: With extract-annotated-pages command Feb 6, 2025

allcontributors bot mentioned this pull request Feb 7, 2025

Docs: Add wolfram77 as a contributor for code #99

Merged

wolfram77 force-pushed the main branch from 575f137 to 369c413 Compare February 7, 2025 15:49

Lucas-C reviewed Feb 7, 2025

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

Lucas-C approved these changes Feb 7, 2025

View reviewed changes

wolfram77 force-pushed the main branch from 369c413 to 12b28ba Compare February 7, 2025 17:04

wolfram77 force-pushed the main branch from f8cb61b to 2e4bf78 Compare February 7, 2025 19:30

Lucas-C reviewed Feb 17, 2025

View reviewed changes

pdfly/extract_annotated_pages.py Outdated Show resolved Hide resolved

Lucas-C approved these changes Feb 17, 2025

View reviewed changes

wolfram77 force-pushed the main branch from f1a9663 to debab52 Compare February 17, 2025 13:17

🐛 with extract-annotated-pages command

1e0ecdd

Lucas-C force-pushed the main branch from debab52 to 00fb53d Compare February 17, 2025 15:22

Lucas-C force-pushed the main branch 2 times, most recently from ca9000d to 505ab93 Compare February 17, 2025 15:40

pdate pdfly/extract_annotated_pages.py

e74fb10

Seems that the `| None` syntax is not valid with Python 3.8 Co-authored-by: Lucas Cimon <[email protected]>

Lucas-C force-pushed the main branch from 505ab93 to e74fb10 Compare February 17, 2025 15:46

Lucas-C merged commit 407e6cd into py-pdf:main Feb 17, 2025
10 checks passed

Lucas-C added a commit that referenced this pull request Feb 19, 2025

ENH: With extract-annotated-pages command (#98)

c11b31b

Co-authored-by: Lucas Cimon <[email protected]>

Lucas-C added a commit that referenced this pull request Feb 19, 2025

ENH: With extract-annotated-pages command (#98)

b9b0ff0

Co-authored-by: Lucas Cimon <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: With extract-annotated-pages command #98

ENH: With extract-annotated-pages command #98

wolfram77 commented Feb 6, 2025

Lucas-C commented Feb 7, 2025

Lucas-C commented Feb 7, 2025

allcontributors bot commented Feb 7, 2025

wolfram77 commented Feb 7, 2025

Lucas-C commented Feb 7, 2025

wolfram77 commented Feb 7, 2025 •

edited

Loading

Lucas-C commented Feb 7, 2025

wolfram77 commented Feb 17, 2025

Lucas-C commented Feb 17, 2025

Lucas-C commented Feb 17, 2025

wolfram77 commented Feb 17, 2025

ENH: With extract-annotated-pages command #98

ENH: With extract-annotated-pages command #98

Conversation

wolfram77 commented Feb 6, 2025

Lucas-C commented Feb 7, 2025

Lucas-C commented Feb 7, 2025

allcontributors bot commented Feb 7, 2025

wolfram77 commented Feb 7, 2025

Lucas-C commented Feb 7, 2025

wolfram77 commented Feb 7, 2025 • edited Loading

Lucas-C commented Feb 7, 2025

wolfram77 commented Feb 17, 2025

Lucas-C commented Feb 17, 2025

Lucas-C commented Feb 17, 2025

wolfram77 commented Feb 17, 2025

wolfram77 commented Feb 7, 2025 •

edited

Loading