IR - final project report

The implementation source code is based on the following projects

DRhard -- https://github.com/jingtaozhan/DRhard
JTR -- https://github.com/CSHaitao/JTR/tree/main

Run the following codes for biomedical dataset preprocess
python preprocess_bio.py --data_type 0

STAR: use the provided STAR model to compute query/passage embeddings and perform similarity search on the biomedical dataset.
python inference.py --data_type doc --max_doc_length 512 --mode bio-train

Tree Initialization After embedding documents and query, we can initialize the tree using recursive k-means.
Run the following codes in JTR repo:
python construct_tree.py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
construct_tree.py		construct_tree.py
inference.py		inference.py
preprocess_bio.py		preprocess_bio.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IR - final project report

About

Releases

Packages

Contributors 2

Languages

luoli830/IR

Folders and files

Latest commit

History

Repository files navigation

IR - final project report

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages