Skip to content

Latest commit

 

History

History
52 lines (39 loc) · 2.16 KB

README.md

File metadata and controls

52 lines (39 loc) · 2.16 KB

REPOSITORY FOR

INSTALLATION

To use this project, following packages need to be installed:

  • scikit-learn (using version 0.21.0)
  • joblib (0.13.2)
  • Theano (1.0.4) or tensorflow (1.10.0)
  • Keras (2.2.4)
  • numpy (1.16.3)

Optional (for drawing plots):

  • matplotlib (used version 3.0.3)

If recollecting data from sources and rebuilding datasets following libraries are needed:

  • re
  • csv
  • xml
  • random

Using different version for joblib can render models unusable.

STRUCTURE

Package data contains source code used for extracting data from sources. Example usage is shown in script classifiers_main.py. One of datasets is in folder sources, others will be on Dropbox (link will be added in a day or two) because files were too big to upload on Github.

Package classifiers contains trained models in folder models and training and testing sets in folder datasets. It also contains source code used when searching for best parameters and saving trained models. Example usage is shown in script classifiers_main.py.

Package evaluation contains source code for evaluation of models. Originally used code is evaluate.py, corrected code is evaluate_corrected.py. Example usage is shown in script evaluation_main.py.

USAGE

To use modules, one can import them in terminal and use immediately or write to one of scripts that end with main.py and then execute script.

SOURCES

Extracting data again from sources is possible, but I do not provide source files.

Synonym sources:

Antonym sources:

Author

Jasmina Pegan

LICENSE

Licenca Creative Commons
To delo je objavljeno pod licenco Creative Commons Priznanje avtorstva-Deljenje pod enakimi pogoji 4.0 Mednarodna.