Skip to content

luoli830/IR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

IR - final project report

The implementation source code is based on the following projects

  1. DRhard -- https://github.com/jingtaozhan/DRhard
  2. JTR -- https://github.com/CSHaitao/JTR/tree/main

Run the following codes for biomedical dataset preprocess
python preprocess_bio.py --data_type 0

STAR: use the provided STAR model to compute query/passage embeddings and perform similarity search on the biomedical dataset.
python inference.py --data_type doc --max_doc_length 512 --mode bio-train

Tree Initialization After embedding documents and query, we can initialize the tree using recursive k-means.
Run the following codes in JTR repo:
python construct_tree.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages