Skip to content

active learning data pruning #2

@anhhuyalex

Description

@anhhuyalex

jan21_2321_pruning: filter out 8500 images / 9000 in cache

jan22_2221_pruning_1000:

  • random 8000 sets
  • pruned from best 8500 images: start with
    if [int(i) for i in samples_list[1:100].split(" ") if i != ""] != [70420, 2598, 63069, 32343, 37942, 25934, 32988, 15835, 50918, 48882, 28161, 13467, 6373, 1247, 41268, 66508, 31]: continue

generally jobs look like this

jan22_2221_pruning_${NUM_SAMPLES}from${TOTAL_SAMPLES}
jan22_2221_pruning_1500_from_8000 vs. jan22_2221_pruning_1500_from_9000
jan22_2221_pruning_2000_from_7500 vs. jan22_2221_pruning_2000_from_9000
jan22_2221_pruning_2500_from_7000 vs. jan22_2221_pruning_2500_from_9000
jan22_2221_pruning_3000_from_6500 vs. jan22_2221_pruning_3000_from_9000
jan22_2221_pruning_3500_from_6000 vs. jan22_2221_pruning_3500_from_9000
vs. jan22_2221_pruning_8500_from_9000

forward pruning (adding data samples from small runs instead of taking out data samples)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions