added additional methods; updated version; added news item

Waikato · Sep 6, 2024 · 685b5c5 · 685b5c5
1 parent bfe268d
commit 685b5c5
Show file tree

Hide file tree

Showing 10 changed files with 546 additions and 56 deletions.
diff --git a/docs/maven.md b/docs/maven.md
@@ -3,6 +3,6 @@ Add the following dependency to your `pom.xml`:
   <dependency>
     <groupId>net.sf.meka</groupId>
     <artifactId>meka</artifactId>
-    <version>1.9.7</version>
+    <version>1.9.8</version>
   </dependency>
 ```
diff --git a/docs/meka.classifiers.multilabel.MULAN.md b/docs/meka.classifiers.multilabel.MULAN.md
@@ -8,6 +8,11 @@ http://mulan.sourceforge.net
     Method Name
     default: RAkEL1
 
+* `-no-rename`
+
+    Don't rename attributes
+    default: rename
+
 * `-W <classifier name>`
 
     Full name of base classifier.

diff --git a/docs/meka.classifiers.multilabel.Maniac.md b/docs/meka.classifiers.multilabel.Maniac.md
@@ -100,6 +100,10 @@ J"org Wicker, Andrey Tyukin, Stefan Kramer: A Nonlinear Label Compression and Tr
 
     Output additional statistics.
 
+* `-use-qr`
+
+    Use QR decomposition to find coefficients
+
 * `-output-debug-info`
 
     If set, classifier is run in debug mode and

diff --git a/docs/meka.classifiers.multilabel.PLST.md b/docs/meka.classifiers.multilabel.PLST.md
@@ -97,6 +97,10 @@ For more information see:
 
     Output additional statistics.
 
+* `-use-qr`
+
+    Use QR decomposition to find coefficients
+
 * `-output-debug-info`
 
     If set, classifier is run in debug mode and

diff --git a/docs/meka.classifiers.multilabel.meta.ERFH.md b/docs/meka.classifiers.multilabel.meta.ERFH.md
@@ -0,0 +1,157 @@
+## Synopsis
+Extremely Randomised Forest of HOMER trees.
+
+## BibTeX
+```
+@inproceedings{Li2017,
+   address = {Cham},
+   author = {Li, Jinxia and Zheng, Yihan and Han, Chao and Wu, Qingyao and Chen, Jian},
+   booktitle = {Intelligence Science and Big Data Engineering},
+   editor = {Sun, Yi and Lu, Huchuan and Zhang, Lihe and Yang, Jian and Huang, Hua},
+   pages = {450--460},
+   publisher = {Springer International Publishing},
+   title = {Extremely Randomized Forest with Hierarchy of Multi-label Classifiers},
+   year = {2017},
+   ISBN = {978-3-319-67777-4}
+}
+```
+## Options
+* `-T threshold`
+
+    Prediction threshold
+
+* `-I <num>`
+
+    Sets the number of models (default 10)
+
+* `-P <size percentage>`
+
+    Size of each bag, as a percentage of total training size (default 67)
+
+* `-S <seed>`
+
+    Random number seed for sampling (default 1)
+
+* `-W <classifier name>`
+
+    Full name of base classifier.
+    (default: meka.classifiers.multilabel.BR)
+
+* `-output-debug-info`
+
+    If set, classifier is run in debug mode and
+    may output additional info to the console
+
+* `-do-not-check-capabilities`
+
+    If set, classifier capabilities are not checked before classifier is built
+    (use with caution).
+
+* `-num-decimal-places`
+
+    The number of decimal places for the output of numbers in the model (default 2).
+
+* `-batch-size`
+
+    The desired batch size for batch prediction  (default 100).
+
+**Options specific to classifier meka.classifiers.multilabel.BR:**
+
+* `-W <classifier name>`
+
+    Full name of base classifier.
+    (default: weka.classifiers.trees.J48)
+
+* `-output-debug-info`
+
+    If set, classifier is run in debug mode and
+    may output additional info to the console
+
+* `-do-not-check-capabilities`
+
+    If set, classifier capabilities are not checked before classifier is built
+    (use with caution).
+
+* `-num-decimal-places`
+
+    The number of decimal places for the output of numbers in the model (default 2).
+
+* `-batch-size`
+
+    The desired batch size for batch prediction  (default 100).
+
+**Options specific to classifier weka.classifiers.trees.J48:**
+
+* `-U`
+
+    Use unpruned tree.
+
+* `-O`
+
+    Do not collapse tree.
+
+* `-C <pruning confidence>`
+
+    Set confidence threshold for pruning.
+    (default 0.25)
+
+* `-M <minimum number of instances>`
+
+    Set minimum number of instances per leaf.
+    (default 2)
+
+* `-R`
+
+    Use reduced error pruning.
+
+* `-N <number of folds>`
+
+    Set number of folds for reduced error
+    pruning. One fold is used as pruning set.
+    (default 3)
+
+* `-B`
+
+    Use binary splits only.
+
+* `-S`
+
+    Do not perform subtree raising.
+
+* `-L`
+
+    Do not clean up after the tree has been built.
+
+* `-A`
+
+    Laplace smoothing for predicted probabilities.
+
+* `-J`
+
+    Do not use MDL correction for info gain on numeric attributes.
+
+* `-Q <seed>`
+
+    Seed for random data shuffling (default 1).
+
+* `-doNotMakeSplitPointActualValue`
+
+    Do not make split point actual value.
+
+* `-output-debug-info`
+
+    If set, classifier is run in debug mode and
+    may output additional info to the console
+
+* `-do-not-check-capabilities`
+
+    If set, classifier capabilities are not checked before classifier is built
+    (use with caution).
+
+* `-num-decimal-places`
+
+    The number of decimal places for the output of numbers in the model (default 2).
+
+* `-batch-size`
+
+    The desired batch size for batch prediction  (default 100).
diff --git a/docs/meka.classifiers.multilabel.meta.HOMER.md b/docs/meka.classifiers.multilabel.meta.HOMER.md
@@ -0,0 +1,156 @@
+## Synopsis
+HOMER tree algorithm. For more information see:
+Tsoumakas, Grigorios, Katakis, Ioannis, Vlahavas, Ioannis: Effective and efficient multilabel classification in domains with large number of labels. In: Proc. ECML/PKDD 2008 Workshop on Mining Multidimensional Data (MMD’08), 53--59, 2008.
+
+## BibTeX
+```
+@inproceedings{Tsoumakas2008,
+   author = {Tsoumakas, Grigorios and Katakis, Ioannis and Vlahavas, Ioannis},
+   booktitle = {Proc. ECML/PKDD 2008 Workshop on Mining Multidimensional Data (MMD’08)},
+   organization = {sn},
+   pages = {53--59},
+   title = {Effective and efficient multilabel classification in domains with large number of labels},
+   volume = {21},
+   year = {2008}
+}
+```
+## Options
+* `-k K`
+
+    The number of partitions per level.
+
+* `-S seed`
+
+    The seed to set.
+
+* `-ls class`
+
+    The label splitter class to use.
+
+* `-t threshold`
+
+    The threshold for the multi-label classifier distribution
+
+* `-W <classifier name>`
+
+    Full name of base classifier.
+    (default: meka.classifiers.multilabel.BR)
+
+* `-output-debug-info`
+
+    If set, classifier is run in debug mode and
+    may output additional info to the console
+
+* `-do-not-check-capabilities`
+
+    If set, classifier capabilities are not checked before classifier is built
+    (use with caution).
+
+* `-num-decimal-places`
+
+    The number of decimal places for the output of numbers in the model (default 2).
+
+* `-batch-size`
+
+    The desired batch size for batch prediction  (default 100).
+
+**Options specific to classifier meka.classifiers.multilabel.BR:**
+
+* `-W <classifier name>`
+
+    Full name of base classifier.
+    (default: weka.classifiers.trees.J48)
+
+* `-output-debug-info`
+
+    If set, classifier is run in debug mode and
+    may output additional info to the console
+
+* `-do-not-check-capabilities`
+
+    If set, classifier capabilities are not checked before classifier is built
+    (use with caution).
+
+* `-num-decimal-places`
+
+    The number of decimal places for the output of numbers in the model (default 2).
+
+* `-batch-size`
+
+    The desired batch size for batch prediction  (default 100).
+
+**Options specific to classifier weka.classifiers.trees.J48:**
+
+* `-U`
+
+    Use unpruned tree.
+
+* `-O`
+
+    Do not collapse tree.
+
+* `-C <pruning confidence>`
+
+    Set confidence threshold for pruning.
+    (default 0.25)
+
+* `-M <minimum number of instances>`
+
+    Set minimum number of instances per leaf.
+    (default 2)
+
+* `-R`
+
+    Use reduced error pruning.
+
+* `-N <number of folds>`
+
+    Set number of folds for reduced error
+    pruning. One fold is used as pruning set.
+    (default 3)
+
+* `-B`
+
+    Use binary splits only.
+
+* `-S`
+
+    Do not perform subtree raising.
+
+* `-L`
+
+    Do not clean up after the tree has been built.
+
+* `-A`
+
+    Laplace smoothing for predicted probabilities.
+
+* `-J`
+
+    Do not use MDL correction for info gain on numeric attributes.
+
+* `-Q <seed>`
+
+    Seed for random data shuffling (default 1).
+
+* `-doNotMakeSplitPointActualValue`
+
+    Do not make split point actual value.
+
+* `-output-debug-info`
+
+    If set, classifier is run in debug mode and
+    may output additional info to the console
+
+* `-do-not-check-capabilities`
+
+    If set, classifier capabilities are not checked before classifier is built
+    (use with caution).
+
+* `-num-decimal-places`
+
+    The number of decimal places for the output of numbers in the model (default 2).
+
+* `-batch-size`
+
+    The desired batch size for batch prediction  (default 100).