Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ENH] Preprocess - Use language from Corpus #963

Merged
merged 1 commit into from
Oct 11, 2024

Conversation

PrimozGodec
Copy link
Collaborator

@PrimozGodec PrimozGodec commented Apr 14, 2023

Description of changes

Set language combo boxes for the language from the corpus.

Additionally, this PR changes the UDPIPE combo box to keep the same language when one wants to set language to language unavailable in UDPIPE. Previously, it was always reset to English.

Includes
  • Code changes
  • Tests
  • Documentation

@PrimozGodec PrimozGodec force-pushed the language-preprocess branch 2 times, most recently from 4342212 to 1d676a7 Compare April 14, 2023 08:16
@PrimozGodec PrimozGodec force-pushed the language-preprocess branch from 00d9e54 to 4206b70 Compare April 21, 2023 14:21
@PrimozGodec PrimozGodec force-pushed the language-preprocess branch 9 times, most recently from d7ed75d to 56268bc Compare June 1, 2023 09:00
@PrimozGodec PrimozGodec marked this pull request as ready for review June 1, 2023 09:06
@PrimozGodec PrimozGodec force-pushed the language-preprocess branch from 56268bc to 7190a98 Compare June 1, 2023 09:16
@codecov-commenter
Copy link

codecov-commenter commented Jun 1, 2023

Codecov Report

Merging #963 (d22d1c5) into master (64eee89) will decrease coverage by 0.06%.
The diff coverage is 87.50%.

Additional details and impacted files
@@            Coverage Diff             @@
##           master     #963      +/-   ##
==========================================
- Coverage   82.29%   82.23%   -0.06%     
==========================================
  Files          93       93              
  Lines       12448    12481      +33     
  Branches     1699     1710      +11     
==========================================
+ Hits        10244    10264      +20     
- Misses       1893     1902       +9     
- Partials      311      315       +4     

@PrimozGodec
Copy link
Collaborator Author

/rebase

@PrimozGodec
Copy link
Collaborator Author

/rebase

@PrimozGodec PrimozGodec force-pushed the language-preprocess branch 2 times, most recently from f90e4ea to 723e44b Compare December 18, 2023 14:52
@PrimozGodec PrimozGodec force-pushed the language-preprocess branch from 723e44b to 392232c Compare March 8, 2024 09:31
@PrimozGodec PrimozGodec force-pushed the language-preprocess branch 4 times, most recently from dbd7af3 to d22d1c5 Compare March 29, 2024 11:47
@PrimozGodec PrimozGodec marked this pull request as ready for review March 29, 2024 11:47
Copy link
Contributor

@VesnaT VesnaT left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess Lemagen is trying to find a Danish language.
image

@PrimozGodec PrimozGodec force-pushed the language-preprocess branch 2 times, most recently from 9a47c0c to 3be4d30 Compare August 27, 2024 11:54
@PrimozGodec PrimozGodec marked this pull request as draft August 27, 2024 11:54
@PrimozGodec
Copy link
Collaborator Author

This draft is waiting for #1080 to be merged

@PrimozGodec PrimozGodec marked this pull request as ready for review August 29, 2024 14:34
@PrimozGodec
Copy link
Collaborator Author

@VesnaT, I reimplemented the logic. It is ready for review now.

@VesnaT VesnaT merged commit 8b717fe into biolab:master Oct 11, 2024
9 of 12 checks passed
@PrimozGodec PrimozGodec deleted the language-preprocess branch October 11, 2024 10:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants