Skip to content

Commit

Permalink
Dependencies: Add khmer-nltk; Utils: Add khmer-nltk's Khmer sentence …
Browse files Browse the repository at this point in the history
…tokenizer, word tokenizer, and part-of-speech tagger
  • Loading branch information
BLKSerene committed Jul 23, 2023
1 parent b1a72d4 commit e47a52f
Show file tree
Hide file tree
Showing 22 changed files with 175 additions and 89 deletions.
53 changes: 27 additions & 26 deletions ACKNOWLEDGMENTS.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,29 +27,30 @@ As Wordless stands on the shoulders of giants, I hereby extend my sincere gratit
2 |[Botok](https://github.com/OpenPecha/Botok) |0.8.12|Hélios Drupchen Hildt|[Apache-2.0](https://github.com/OpenPecha/Botok/blob/master/LICENSE)
3 |[Charset Normalizer](https://github.com/Ousret/charset_normalizer) |3.2.0 |TAHRI Ahmed R.|[MIT](https://github.com/Ousret/charset_normalizer/blob/master/LICENSE)
4 |[jieba](https://github.com/fxsjy/jieba) |0.42.1|Sun Junyi (孙君意)|[MIT](https://github.com/fxsjy/jieba/blob/master/LICENSE)
5 |[Lingua](https://github.com/pemistahl/lingua-py) |1.3.2 |Peter M. Stahl|[Apache-2.0](https://github.com/pemistahl/lingua-py/blob/main/LICENSE.txt)
6 |[Matplotlib](https://matplotlib.org/) |3.7.1 |Matplotlib Development Team|[Matplotlib](https://matplotlib.org/stable/users/project/license.html)
7 |[NetworkX](https://networkx.org/) |3.0 |NetworkX Developers, Aric Hagberg, Dan Schult,<br>Pieter Swart|[BSD-3-Clause](https://github.com/networkx/networkx/blob/main/LICENSE.txt)
8 |[NLTK](https://www.nltk.org/) |3.8.1 |Steven Bird, Edward Loper, Ewan Klein|[Apache-2.0](https://github.com/nltk/nltk/blob/develop/LICENSE.txt)
9 |[NumPy](https://www.numpy.org/) |1.24.2|NumPy Developers|[BSD-3-Clause](https://github.com/numpy/numpy/blob/main/LICENSE.txt)
10|[opencc-python](https://github.com/yichen0831/opencc-python) |0.1.7 |Carbo Kuo (郭家宝), Yicheng Huang|[Apache-2.0](https://github.com/yichen0831/opencc-python/blob/master/LICENSE.txt)
11|[openpyxl](https://foss.heptapod.net/openpyxl/openpyxl) |3.1.2 |Eric Gazoni, Charlie Clark|[MIT](https://foss.heptapod.net/openpyxl/openpyxl/-/blob/branch/3.1/LICENCE.rst)
12|[PyInstaller](http://www.pyinstaller.org/) |5.9.0 |Hartmut Goebel, Jasper Harrison, Bryan A. Jones,<br>Brénainn Woodsend, Rok Mandeljc|[Bootloader-exception](https://github.com/pyinstaller/pyinstaller/blob/develop/COPYING.txt)
13|[pymorphy3](https://github.com/no-plagiarism/pymorphy3) |1.2.0 |Mikhail Korobov, Danylo Halaiko|[MIT](https://github.com/no-plagiarism/pymorphy3/blob/master/LICENSE.txt)
14|[pypdf](https://github.com/py-pdf/pypdf) |3.6.0 |Mathieu Fenniak, Ashish Kulkarni, Steve Witham, Martin Thoma|[BSD-3-Clause](https://github.com/py-pdf/pypdf/blob/main/LICENSE)
15|[Pyphen](https://pyphen.org/) |0.14.0|Guillaume Ayoub|[GPL-2.0-or-later/LGPL-2.1-or-later/MPL-1.1](https://github.com/Kozea/Pyphen/blob/master/LICENSE)
16|[PyQt](https://riverbankcomputing.com/software/pyqt/) |5.15.9|Riverbank Computing|[Commercial-License/GPL-3.0-only](https://www.riverbankcomputing.com/static/Docs/PyQt5/introduction.html#license)
17|[PyThaiNLP](https://github.com/PyThaiNLP/pythainlp) |4.0.2 |Wannaphong Phatthiyaphaibun (วรรณพงษ์ ภัททิยไพบูลย์)|[Apache-2.0](https://github.com/PyThaiNLP/pythainlp/blob/dev/LICENSE)
18|[python-docx](https://github.com/python-openxml/python-docx) |0.8.11|Steve Canny|[MIT](https://github.com/python-openxml/python-docx/blob/master/LICENSE)
19|[python-mecab-ko](https://github.com/jonghwanhyeon/python-mecab-ko)|1.3.3 |Jonghwan Hyeon|[BSD-3-Clause](https://github.com/jonghwanhyeon/python-mecab-ko/blob/main/LICENSE)
20|[Requests](https://github.com/psf/requests) |2.31.0|Kenneth Reitz|[Apache-2.0](https://github.com/psf/requests/blob/main/LICENSE)
21|[Sacremoses](https://github.com/alvations/sacremoses) |0.0.53|Liling Tan|[MIT](https://github.com/alvations/sacremoses/blob/master/LICENSE)
22|[SciPy](https://scipy.org/scipylib/) |1.10.1|SciPy Developers|[BSD-3-Clause](https://github.com/scipy/scipy/blob/main/LICENSE.txt)
23|[simplemma](https://github.com/adbar/simplemma) |0.9.1 |Adrien Barbaresi|[MIT](https://github.com/adbar/simplemma/blob/main/LICENSE)
24|[spaCy](https://spacy.io/) |3.6.0 |Matthew Honnibal, Ines Montani, Sofie Van Landeghem,<br>Adriane Boyd, Paul O'Leary McCann|[MIT](https://github.com/explosion/spaCy/blob/master/LICENSE)
25|[spacy-pkuseg](https://github.com/explosion/spacy-pkuseg) |0.0.32|Ruixuan Luo (罗睿轩), Jingjing Xu (许晶晶),<br>Xuancheng Ren (任宣丞), Yi Zhang (张艺),<br>Zhiyuan Zhang (张之远), Bingzhen Wei (位冰镇),<br>Xu Sun (孙栩)<br>Adriane Boyd, Ines Montani|[MIT](https://github.com/explosion/spacy-pkuseg/blob/master/LICENSE)
26|[stopword](https://github.com/fergiemcdowall/stopword) |2.0.5 |Fergus McDowall|[MIT](https://github.com/fergiemcdowall/stopword/blob/master/LICENSE)
27|[SudachiPy](https://github.com/WorksApplications/sudachi.rs) |0.6.7 |Works Applications Co., Ltd.|[Apache-2.0](https://github.com/WorksApplications/sudachi.rs/blob/develop/LICENSE)
28|[TextBlob](https://github.com/sloria/TextBlob) |0.17.1|Steven Loria|[MIT](https://github.com/sloria/TextBlob/blob/dev/LICENSE)
29|[Underthesea](https://undertheseanlp.com/) |6.2.0 |Vu Anh|[GPL-3.0-or-later](https://github.com/undertheseanlp/underthesea/blob/main/LICENSE)
30|[wordcloud](https://github.com/amueller/word_cloud) |1.9.2 |Andreas Christian Müller|[MIT](https://github.com/amueller/word_cloud/blob/main/LICENSE)
5 |[khmer-nltk](https://github.com/VietHoang1512/khmer-nltk) |1.5 |Phan Viet Hoang|[Apache-2.0](https://github.com/VietHoang1512/khmer-nltk/blob/main/LICENSE)
6 |[Lingua](https://github.com/pemistahl/lingua-py) |1.3.2 |Peter M. Stahl|[Apache-2.0](https://github.com/pemistahl/lingua-py/blob/main/LICENSE.txt)
7 |[Matplotlib](https://matplotlib.org/) |3.7.1 |Matplotlib Development Team|[Matplotlib](https://matplotlib.org/stable/users/project/license.html)
8 |[NetworkX](https://networkx.org/) |3.0 |NetworkX Developers, Aric Hagberg, Dan Schult,<br>Pieter Swart|[BSD-3-Clause](https://github.com/networkx/networkx/blob/main/LICENSE.txt)
9 |[NLTK](https://www.nltk.org/) |3.8.1 |Steven Bird, Edward Loper, Ewan Klein|[Apache-2.0](https://github.com/nltk/nltk/blob/develop/LICENSE.txt)
10|[NumPy](https://www.numpy.org/) |1.24.2|NumPy Developers|[BSD-3-Clause](https://github.com/numpy/numpy/blob/main/LICENSE.txt)
11|[opencc-python](https://github.com/yichen0831/opencc-python) |0.1.7 |Carbo Kuo (郭家宝), Yicheng Huang|[Apache-2.0](https://github.com/yichen0831/opencc-python/blob/master/LICENSE.txt)
12|[openpyxl](https://foss.heptapod.net/openpyxl/openpyxl) |3.1.2 |Eric Gazoni, Charlie Clark|[MIT](https://foss.heptapod.net/openpyxl/openpyxl/-/blob/branch/3.1/LICENCE.rst)
13|[PyInstaller](http://www.pyinstaller.org/) |5.9.0 |Hartmut Goebel, Jasper Harrison, Bryan A. Jones,<br>Brénainn Woodsend, Rok Mandeljc|[Bootloader-exception](https://github.com/pyinstaller/pyinstaller/blob/develop/COPYING.txt)
14|[pymorphy3](https://github.com/no-plagiarism/pymorphy3) |1.2.0 |Mikhail Korobov, Danylo Halaiko|[MIT](https://github.com/no-plagiarism/pymorphy3/blob/master/LICENSE.txt)
15|[pypdf](https://github.com/py-pdf/pypdf) |3.6.0 |Mathieu Fenniak, Ashish Kulkarni, Steve Witham, Martin Thoma|[BSD-3-Clause](https://github.com/py-pdf/pypdf/blob/main/LICENSE)
16|[Pyphen](https://pyphen.org/) |0.14.0|Guillaume Ayoub|[GPL-2.0-or-later/LGPL-2.1-or-later/MPL-1.1](https://github.com/Kozea/Pyphen/blob/master/LICENSE)
17|[PyQt](https://riverbankcomputing.com/software/pyqt/) |5.15.9|Riverbank Computing|[Commercial-License/GPL-3.0-only](https://www.riverbankcomputing.com/static/Docs/PyQt5/introduction.html#license)
18|[PyThaiNLP](https://github.com/PyThaiNLP/pythainlp) |4.0.2 |Wannaphong Phatthiyaphaibun (วรรณพงษ์ ภัททิยไพบูลย์)|[Apache-2.0](https://github.com/PyThaiNLP/pythainlp/blob/dev/LICENSE)
19|[python-docx](https://github.com/python-openxml/python-docx) |0.8.11|Steve Canny|[MIT](https://github.com/python-openxml/python-docx/blob/master/LICENSE)
20|[python-mecab-ko](https://github.com/jonghwanhyeon/python-mecab-ko)|1.3.3 |Jonghwan Hyeon|[BSD-3-Clause](https://github.com/jonghwanhyeon/python-mecab-ko/blob/main/LICENSE)
21|[Requests](https://github.com/psf/requests) |2.31.0|Kenneth Reitz|[Apache-2.0](https://github.com/psf/requests/blob/main/LICENSE)
22|[Sacremoses](https://github.com/alvations/sacremoses) |0.0.53|Liling Tan|[MIT](https://github.com/alvations/sacremoses/blob/master/LICENSE)
23|[SciPy](https://scipy.org/scipylib/) |1.10.1|SciPy Developers|[BSD-3-Clause](https://github.com/scipy/scipy/blob/main/LICENSE.txt)
24|[simplemma](https://github.com/adbar/simplemma) |0.9.1 |Adrien Barbaresi|[MIT](https://github.com/adbar/simplemma/blob/main/LICENSE)
25|[spaCy](https://spacy.io/) |3.6.0 |Matthew Honnibal, Ines Montani, Sofie Van Landeghem,<br>Adriane Boyd, Paul O'Leary McCann|[MIT](https://github.com/explosion/spaCy/blob/master/LICENSE)
26|[spacy-pkuseg](https://github.com/explosion/spacy-pkuseg) |0.0.32|Ruixuan Luo (罗睿轩), Jingjing Xu (许晶晶),<br>Xuancheng Ren (任宣丞), Yi Zhang (张艺),<br>Zhiyuan Zhang (张之远), Bingzhen Wei (位冰镇),<br>Xu Sun (孙栩)<br>Adriane Boyd, Ines Montani|[MIT](https://github.com/explosion/spacy-pkuseg/blob/master/LICENSE)
27|[stopword](https://github.com/fergiemcdowall/stopword) |2.0.5 |Fergus McDowall|[MIT](https://github.com/fergiemcdowall/stopword/blob/master/LICENSE)
28|[SudachiPy](https://github.com/WorksApplications/sudachi.rs) |0.6.7 |Works Applications Co., Ltd.|[Apache-2.0](https://github.com/WorksApplications/sudachi.rs/blob/develop/LICENSE)
29|[TextBlob](https://github.com/sloria/TextBlob) |0.17.1|Steven Loria|[MIT](https://github.com/sloria/TextBlob/blob/dev/LICENSE)
30|[Underthesea](https://undertheseanlp.com/) |6.2.0 |Vu Anh|[GPL-3.0-or-later](https://github.com/undertheseanlp/underthesea/blob/main/LICENSE)
31|[wordcloud](https://github.com/amueller/word_cloud) |1.9.2 |Andreas Christian Müller|[MIT](https://github.com/amueller/word_cloud/blob/main/LICENSE)
Loading

0 comments on commit e47a52f

Please sign in to comment.