Skip to content

Commit

Permalink
added additional methods; updated version; added news item
Browse files Browse the repository at this point in the history
  • Loading branch information
fracpete committed Sep 6, 2024
1 parent bfe268d commit 685b5c5
Show file tree
Hide file tree
Showing 10 changed files with 546 additions and 56 deletions.
2 changes: 1 addition & 1 deletion docs/maven.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,6 @@ Add the following dependency to your `pom.xml`:
<dependency>
<groupId>net.sf.meka</groupId>
<artifactId>meka</artifactId>
<version>1.9.7</version>
<version>1.9.8</version>
</dependency>
```
5 changes: 5 additions & 0 deletions docs/meka.classifiers.multilabel.MULAN.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,11 @@ http://mulan.sourceforge.net
Method Name
default: RAkEL1

* `-no-rename`

Don't rename attributes
default: rename

* `-W <classifier name>`

Full name of base classifier.
Expand Down
4 changes: 4 additions & 0 deletions docs/meka.classifiers.multilabel.Maniac.md
Original file line number Diff line number Diff line change
Expand Up @@ -100,6 +100,10 @@ J"org Wicker, Andrey Tyukin, Stefan Kramer: A Nonlinear Label Compression and Tr

Output additional statistics.

* `-use-qr`

Use QR decomposition to find coefficients

* `-output-debug-info`

If set, classifier is run in debug mode and
Expand Down
4 changes: 4 additions & 0 deletions docs/meka.classifiers.multilabel.PLST.md
Original file line number Diff line number Diff line change
Expand Up @@ -97,6 +97,10 @@ For more information see:

Output additional statistics.

* `-use-qr`

Use QR decomposition to find coefficients

* `-output-debug-info`

If set, classifier is run in debug mode and
Expand Down
157 changes: 157 additions & 0 deletions docs/meka.classifiers.multilabel.meta.ERFH.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,157 @@
## Synopsis
Extremely Randomised Forest of HOMER trees.

## BibTeX
```
@inproceedings{Li2017,
address = {Cham},
author = {Li, Jinxia and Zheng, Yihan and Han, Chao and Wu, Qingyao and Chen, Jian},
booktitle = {Intelligence Science and Big Data Engineering},
editor = {Sun, Yi and Lu, Huchuan and Zhang, Lihe and Yang, Jian and Huang, Hua},
pages = {450--460},
publisher = {Springer International Publishing},
title = {Extremely Randomized Forest with Hierarchy of Multi-label Classifiers},
year = {2017},
ISBN = {978-3-319-67777-4}
}
```
## Options
* `-T threshold`

Prediction threshold

* `-I <num>`

Sets the number of models (default 10)

* `-P <size percentage>`

Size of each bag, as a percentage of total training size (default 67)

* `-S <seed>`

Random number seed for sampling (default 1)

* `-W <classifier name>`

Full name of base classifier.
(default: meka.classifiers.multilabel.BR)

* `-output-debug-info`

If set, classifier is run in debug mode and
may output additional info to the console

* `-do-not-check-capabilities`

If set, classifier capabilities are not checked before classifier is built
(use with caution).

* `-num-decimal-places`

The number of decimal places for the output of numbers in the model (default 2).

* `-batch-size`

The desired batch size for batch prediction (default 100).

**Options specific to classifier meka.classifiers.multilabel.BR:**

* `-W <classifier name>`

Full name of base classifier.
(default: weka.classifiers.trees.J48)

* `-output-debug-info`

If set, classifier is run in debug mode and
may output additional info to the console

* `-do-not-check-capabilities`

If set, classifier capabilities are not checked before classifier is built
(use with caution).

* `-num-decimal-places`

The number of decimal places for the output of numbers in the model (default 2).

* `-batch-size`

The desired batch size for batch prediction (default 100).

**Options specific to classifier weka.classifiers.trees.J48:**

* `-U`

Use unpruned tree.

* `-O`

Do not collapse tree.

* `-C <pruning confidence>`

Set confidence threshold for pruning.
(default 0.25)

* `-M <minimum number of instances>`

Set minimum number of instances per leaf.
(default 2)

* `-R`

Use reduced error pruning.

* `-N <number of folds>`

Set number of folds for reduced error
pruning. One fold is used as pruning set.
(default 3)

* `-B`

Use binary splits only.

* `-S`

Do not perform subtree raising.

* `-L`

Do not clean up after the tree has been built.

* `-A`

Laplace smoothing for predicted probabilities.

* `-J`

Do not use MDL correction for info gain on numeric attributes.

* `-Q <seed>`

Seed for random data shuffling (default 1).

* `-doNotMakeSplitPointActualValue`

Do not make split point actual value.

* `-output-debug-info`

If set, classifier is run in debug mode and
may output additional info to the console

* `-do-not-check-capabilities`

If set, classifier capabilities are not checked before classifier is built
(use with caution).

* `-num-decimal-places`

The number of decimal places for the output of numbers in the model (default 2).

* `-batch-size`

The desired batch size for batch prediction (default 100).
156 changes: 156 additions & 0 deletions docs/meka.classifiers.multilabel.meta.HOMER.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,156 @@
## Synopsis
HOMER tree algorithm. For more information see:
Tsoumakas, Grigorios, Katakis, Ioannis, Vlahavas, Ioannis: Effective and efficient multilabel classification in domains with large number of labels. In: Proc. ECML/PKDD 2008 Workshop on Mining Multidimensional Data (MMD’08), 53--59, 2008.

## BibTeX
```
@inproceedings{Tsoumakas2008,
author = {Tsoumakas, Grigorios and Katakis, Ioannis and Vlahavas, Ioannis},
booktitle = {Proc. ECML/PKDD 2008 Workshop on Mining Multidimensional Data (MMD’08)},
organization = {sn},
pages = {53--59},
title = {Effective and efficient multilabel classification in domains with large number of labels},
volume = {21},
year = {2008}
}
```
## Options
* `-k K`

The number of partitions per level.

* `-S seed`

The seed to set.

* `-ls class`

The label splitter class to use.

* `-t threshold`

The threshold for the multi-label classifier distribution

* `-W <classifier name>`

Full name of base classifier.
(default: meka.classifiers.multilabel.BR)

* `-output-debug-info`

If set, classifier is run in debug mode and
may output additional info to the console

* `-do-not-check-capabilities`

If set, classifier capabilities are not checked before classifier is built
(use with caution).

* `-num-decimal-places`

The number of decimal places for the output of numbers in the model (default 2).

* `-batch-size`

The desired batch size for batch prediction (default 100).

**Options specific to classifier meka.classifiers.multilabel.BR:**

* `-W <classifier name>`

Full name of base classifier.
(default: weka.classifiers.trees.J48)

* `-output-debug-info`

If set, classifier is run in debug mode and
may output additional info to the console

* `-do-not-check-capabilities`

If set, classifier capabilities are not checked before classifier is built
(use with caution).

* `-num-decimal-places`

The number of decimal places for the output of numbers in the model (default 2).

* `-batch-size`

The desired batch size for batch prediction (default 100).

**Options specific to classifier weka.classifiers.trees.J48:**

* `-U`

Use unpruned tree.

* `-O`

Do not collapse tree.

* `-C <pruning confidence>`

Set confidence threshold for pruning.
(default 0.25)

* `-M <minimum number of instances>`

Set minimum number of instances per leaf.
(default 2)

* `-R`

Use reduced error pruning.

* `-N <number of folds>`

Set number of folds for reduced error
pruning. One fold is used as pruning set.
(default 3)

* `-B`

Use binary splits only.

* `-S`

Do not perform subtree raising.

* `-L`

Do not clean up after the tree has been built.

* `-A`

Laplace smoothing for predicted probabilities.

* `-J`

Do not use MDL correction for info gain on numeric attributes.

* `-Q <seed>`

Seed for random data shuffling (default 1).

* `-doNotMakeSplitPointActualValue`

Do not make split point actual value.

* `-output-debug-info`

If set, classifier is run in debug mode and
may output additional info to the console

* `-do-not-check-capabilities`

If set, classifier capabilities are not checked before classifier is built
(use with caution).

* `-num-decimal-places`

The number of decimal places for the output of numbers in the model (default 2).

* `-batch-size`

The desired batch size for batch prediction (default 100).
Loading

0 comments on commit 685b5c5

Please sign in to comment.