Add mined-vs-random negatives experiment with measured results by DaoyuanLi2816 · Pull Request #10 · DaoyuanLi2816/labelbank

DaoyuanLi2816 · 2026-06-11T05:07:39Z

Measures the library's central claim end to end through its public API on a public dataset (banking77, 77-intent closed bank; Qwen2.5-0.5B-Instruct + LoRA; 2,000 train / 1,000 held-out; pools of 8; identical budgets per arm; one RTX 4080, ~1 h):

arm	MAP@25	R@1
zero-shot	0.069	1.9%
random negatives (bootstrap)	0.788	67.6%
self-mined, round 1	0.838	76.2%
self-mined, round 2	0.839	75.7%

+5.0 MAP / +8.6 R@1 for mined pools over random at the same budget, concentrated at top-1 where sibling labels collide (R@10 saturated for both arms).
--cold-start ablation: mining round 1 from the zero-shot model instead of the bootstrap model collapses to MAP 0.430 — far below random. Hard negatives are only as good as their miner; the experiment mirrors the competition protocol (bootstrap → mine from the previous round's model, fresh adapter per round), and the README now states this caveat with numbers.
README gains a "Measured: do mined negatives beat random ones?" section with the full table and reproduce commands; results JSON archived under the script's output dir.

🤖 Generated with Claude Code

banking77, Qwen2.5-0.5B + LoRA, identical per-arm budgets, RTX 4080: self-mined pools beat random negatives by +5.0 MAP@25 / +8.6 R@1, with the gain concentrated at top-1 where sibling labels collide. The --cold-start ablation shows mining from an untrained model collapses to MAP 0.430 (below random) - hard negatives are only as good as their miner, which is why the pipeline bootstraps on random negatives first (the competition protocol).

DaoyuanLi2816 merged commit 5c25654 into main Jun 11, 2026
2 checks passed

DaoyuanLi2816 deleted the feat/mined-negatives-experiment branch June 11, 2026 05:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add mined-vs-random negatives experiment with measured results#10

Add mined-vs-random negatives experiment with measured results#10
DaoyuanLi2816 merged 1 commit into
mainfrom
feat/mined-negatives-experiment

DaoyuanLi2816 commented Jun 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

DaoyuanLi2816 commented Jun 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant