Skip to content

Add mined-vs-random negatives experiment with measured results#10

Merged
DaoyuanLi2816 merged 1 commit into
mainfrom
feat/mined-negatives-experiment
Jun 11, 2026
Merged

Add mined-vs-random negatives experiment with measured results#10
DaoyuanLi2816 merged 1 commit into
mainfrom
feat/mined-negatives-experiment

Conversation

@DaoyuanLi2816

Copy link
Copy Markdown
Owner

Measures the library's central claim end to end through its public API on a public dataset (banking77, 77-intent closed bank; Qwen2.5-0.5B-Instruct + LoRA; 2,000 train / 1,000 held-out; pools of 8; identical budgets per arm; one RTX 4080, ~1 h):

arm MAP@25 R@1
zero-shot 0.069 1.9%
random negatives (bootstrap) 0.788 67.6%
self-mined, round 1 0.838 76.2%
self-mined, round 2 0.839 75.7%
  • +5.0 MAP / +8.6 R@1 for mined pools over random at the same budget, concentrated at top-1 where sibling labels collide (R@10 saturated for both arms).
  • --cold-start ablation: mining round 1 from the zero-shot model instead of the bootstrap model collapses to MAP 0.430 — far below random. Hard negatives are only as good as their miner; the experiment mirrors the competition protocol (bootstrap → mine from the previous round's model, fresh adapter per round), and the README now states this caveat with numbers.
  • README gains a "Measured: do mined negatives beat random ones?" section with the full table and reproduce commands; results JSON archived under the script's output dir.

🤖 Generated with Claude Code

banking77, Qwen2.5-0.5B + LoRA, identical per-arm budgets, RTX 4080:
self-mined pools beat random negatives by +5.0 MAP@25 / +8.6 R@1, with
the gain concentrated at top-1 where sibling labels collide. The
--cold-start ablation shows mining from an untrained model collapses to
MAP 0.430 (below random) - hard negatives are only as good as their
miner, which is why the pipeline bootstraps on random negatives first
(the competition protocol).
@DaoyuanLi2816 DaoyuanLi2816 merged commit 5c25654 into main Jun 11, 2026
2 checks passed
@DaoyuanLi2816 DaoyuanLi2816 deleted the feat/mined-negatives-experiment branch June 11, 2026 05:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant