Skip to content

Commit

Permalink
evaluation: review data, update packages, add magic_html (#731)
Browse files Browse the repository at this point in the history
* evaluation: review data, update packages, add magic_html

* update data and remove empty files
  • Loading branch information
adbar authored Nov 7, 2024
1 parent 418f807 commit 623b0ef
Show file tree
Hide file tree
Showing 30 changed files with 2,231 additions and 16,223 deletions.
11 changes: 6 additions & 5 deletions tests/eval-requirements.txt
Original file line number Diff line number Diff line change
@@ -1,9 +1,9 @@
pandas==2.2.2
pandas==2.2.3
tabulate==0.9.0
tqdm==4.66.4
tqdm==4.66.5
# rouge-score==0.1.2

trafilatura==1.10.0
trafilatura==1.12.2

# alternatives
beautifulsoup4==4.12.3
Expand All @@ -16,7 +16,8 @@ inscriptis==2.5.0
justext==3.0.1
newspaper3k==0.2.8
# newspaper4k==0.9.3.1 # replaces newspaper3k if installed
news-please==1.5.44
news-please==1.6.13
# readabilipy==0.2.0 # unmaintained!
readability-lxml==0.8.1
resiliparse==0.14.7
resiliparse==0.14.9
# magic_html @ git+https://github.com/opendatalab/magic-html
1,072 changes: 0 additions & 1,072 deletions tests/eval/autohaus.de-mueller.html

This file was deleted.

2,010 changes: 0 additions & 2,010 deletions tests/eval/bmbf.de-forschungsprojekt.html

This file was deleted.

File renamed without changes.
773 changes: 0 additions & 773 deletions tests/eval/bund.net-marode.html

This file was deleted.

781 changes: 0 additions & 781 deletions tests/eval/bund.net.akw.html

This file was deleted.

49 changes: 0 additions & 49 deletions tests/eval/bundeswehrkarriere.de.Laura.html

This file was deleted.

1 change: 0 additions & 1 deletion tests/eval/cecil.de.lieblingsfarbe.html

This file was deleted.

726 changes: 719 additions & 7 deletions tests/eval/changenow.de.loibl.html

Large diffs are not rendered by default.

Loading

0 comments on commit 623b0ef

Please sign in to comment.