Skip to content
This repository has been archived by the owner on Aug 26, 2024. It is now read-only.

process.dedupe() gives IndexError: list index out of range because of bug in process.extractWithoutOrder() #307

Open
Thijsvandepoll opened this issue Apr 2, 2021 · 0 comments

Comments

@Thijsvandepoll
Copy link

Hi all,

I found a bug in process.extractWithoutOrder() which causes process.dedupe() to fail unexpectedly. The example:

process.dedupe(["BRITT JEFFREY S", "BRITT JEFFREY S.", "WIEDEMAN SCOTT", "WIEDERMANN SCOTT", "斯科特·维德曼", "杰弗里·S·布里特"])

which results in:

IndexError: list index out of range

The expected result here is:

dict_keys(['BRITT JEFFREY S.', 'WIEDERMANN SCOTT', '斯科特·维德曼', '杰弗里·S·布里特'])

I looked into the source code and I believe I found a bug in process.extractWithoutOrder() which sets the used (pre)processor different for the query then for the choices. I will create a merge request to fix this issue.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant