- pip install --upgrade git+https://github.com/MakiResearchTeam/MultiClassLabelBalancing.git
- Perform dataset analysis using HCScanner:
- Generate
.npyfile containing information about labelsets (labelsets, alphas).
- Generate
- Perform balancing using available algorithms:
- Generate new
.npyfile containing information about labelsets (labelsets, alphas), but with new alphas.
- Generate new
- Do construction of a new dataset:
- You can do copying of existing data using new alphas and then perform uniform sampling from the resulting dataset.
- Or you can build a generator that would sample pictures containing certain labelsets using a distribution in order to sample them.