[IMP] account_statement_import_sheet_file: XLS[X] file can actually be HTML by Daemo00 · Pull Request #763 · OCA/bank-statement-import

Daemo00 · 2025-01-30T18:41:09Z

I recently downloaded an XLS file that was actually an HTML file (they exist, check the file in tests/fixtures!) and the module account_statement_import_sheet_file wasn't able to import it.
With this PR, it is possible to import such files.

Most of the README edits happened automatically running pre-commit.

OCA-git-bot · 2025-01-30T18:41:13Z

Hi @alexey-pelykh,
some modules you are maintaining are being modified, check this out!

github-actions · 2025-11-30T12:28:17Z

There hasn't been any activity on this pull request in the past 4 months, so it has been marked as stale and it will be closed automatically if no further activity occurs in the next 30 days.
If you want this PR to never become stale, please ask a PSC member to apply the "no stale" label.

alexey-pelykh

Thanks for tackling this — banks exporting XLS files that are actually HTML is a common pain point. The detection via lstrip().startswith("<html>") and conversion through lxml is a pragmatic solution.

One minor thing: is_HTML doesn't follow Python naming — should be is_html. Not blocking.

Code review LGTM.

alexey-pelykh · 2026-02-28T20:53:53Z

                        _("No valid encoding was found for the attached file")
                    ) from None
                decoded_file = data_file.decode(detected_encoding)
+            is_HTML = decoded_file.lower().lstrip().startswith("<html>")


Nit: is_HTML → is_html per PEP 8 (snake_case for local variables).

Daemo00 · 2026-03-06T15:21:32Z

I'm no more working on this one because I have moved to 18.0 (#871), who wants can supersede.

Daemo00 marked this pull request as ready for review January 30, 2025 18:44

Daemo00 force-pushed the 17.0-imp-account_statement_import_sheet_file-import_HTML branch from 316b3cd to eca64c3 Compare February 24, 2025 15:11

Daemo00 force-pushed the 17.0-imp-account_statement_import_sheet_file-import_HTML branch from eca64c3 to 7a4d63f Compare March 4, 2025 09:30

Daemo00 force-pushed the 17.0-imp-account_statement_import_sheet_file-import_HTML branch from 7a4d63f to c5f7aa5 Compare April 6, 2025 16:03

Daemo00 force-pushed the 17.0-imp-account_statement_import_sheet_file-import_HTML branch from c5f7aa5 to 6f148f3 Compare July 27, 2025 14:49

github-actions Bot added the stale PR/Issue without recent activity, it'll be soon closed automatically. label Nov 30, 2025

Daemo00 force-pushed the 17.0-imp-account_statement_import_sheet_file-import_HTML branch 2 times, most recently from 276af41 to 6f148f3 Compare November 30, 2025 21:38

[IMP] statement_import_sheet_file: XLS[X] file can actually be HTML

a914254

Daemo00 force-pushed the 17.0-imp-account_statement_import_sheet_file-import_HTML branch from 6f148f3 to a914254 Compare November 30, 2025 21:56

github-actions Bot removed the stale PR/Issue without recent activity, it'll be soon closed automatically. label Dec 7, 2025

Daemo00 mentioned this pull request Feb 2, 2026

[18.0][ADD] statement_import_sheet_html_file #871

Open

alexey-pelykh approved these changes Feb 28, 2026

View reviewed changes

Daemo00 closed this Mar 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[IMP] account_statement_import_sheet_file: XLS[X] file can actually be HTML#763

[IMP] account_statement_import_sheet_file: XLS[X] file can actually be HTML#763
Daemo00 wants to merge 1 commit intoOCA:17.0from
Daemo00:17.0-imp-account_statement_import_sheet_file-import_HTML

Daemo00 commented Jan 30, 2025

Uh oh!

OCA-git-bot commented Jan 30, 2025

Uh oh!

github-actions Bot commented Nov 30, 2025

Uh oh!

alexey-pelykh left a comment

Uh oh!

alexey-pelykh Feb 28, 2026

Uh oh!

Daemo00 commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

Daemo00 commented Jan 30, 2025

Uh oh!

OCA-git-bot commented Jan 30, 2025

Uh oh!

github-actions Bot commented Nov 30, 2025

Uh oh!

alexey-pelykh left a comment

Choose a reason for hiding this comment

Uh oh!

alexey-pelykh Feb 28, 2026

Choose a reason for hiding this comment

Uh oh!

Daemo00 commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants