ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation

Riccardo Pozzi, Matteo Palmonari, Andrea Coletta, Luigi Bellomarini, Jens Lehmann, Sahar Vahdati

The paper has been accepted at ISWC 2025. A preprint that has not undergone peer review is available at https://arxiv.org/abs/2508.16983.

We present ReFactX, a scalable method that enables LLMs to access external knowledge without depending on retrievers or auxiliary models. Our approach uses constrained generation with a pre-built prefix-tree index. Triples from Wikidata are verbalized in 800 million textual facts, tokenized, and indexed in a prefix tree for efficient access. During inference, to acquire external knowledge, the LLM generates facts with constrained generation which allows only sequences of tokens that form an existing fact.

This repository contains the source code for using ReFactX and reproducing our work accepted at ISWC 2025.

Setup

install the requirements pip install -r requirements.txt
prepare the .env file: cp env-sample.txt .env, then edit .env (can be skipped if using the simple index in the try_refactx notebook)

Try ReFactX

For quickly trying ReFactX with an in-memory prefix tree (derived from a 31k-facts knowledge base) use the notebook try_refactx.ipynb.

Wikidata Prefix Tree

Refer to PrefixTree.md for creating the Wikidata prefix tree we used in our work.

Experiments

To reproduce our experiments use the eval.py script replacing INDEX, MODEL, and DATASET according to your needs (each of them is a python file to import).

python eval.py --index INDEX --model MODEL --dataset DATASET

Throughtput

For the throughput experiment run

python throughput.py --model MODEL --index INDEX --max-tokens 4001 --output out.json [--unconstrained-generation]

Cite

@misc{pozzi2025refactxscalablereasoningreliable,
      title={ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation}, 
      author={Riccardo Pozzi and Matteo Palmonari and Andrea Coletta and Luigi Bellomarini and Jens Lehmann and Sahar Vahdati},
      year={2025},
      eprint={2508.16983},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2508.16983},
      doi = "10.48550/arXiv.2508.16983",
}

Name		Name	Last commit message	Last commit date
Latest commit History 350 Commits
hpc_sample		hpc_sample
services		services
throughput.results.local		throughput.results.local
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
10		10
LICENSE		LICENSE
PrefixTree.md		PrefixTree.md
Readme.md		Readme.md
RedisCache.py		RedisCache.py
SimpleCache.py		SimpleCache.py
Textgrad.ipynb		Textgrad.ipynb
Tokenizers.ipynb		Tokenizers.ipynb
WebQSP_base.py		WebQSP_base.py
WebQSP_test.py		WebQSP_test.py
WebQSP_test_sample200.py		WebQSP_test_sample200.py
analyze.ipynb		analyze.ipynb
analyze.py		analyze.py
api.py		api.py
base_dataset_config.py		base_dataset_config.py
base_http_index.py		base_http_index.py
base_model_config.py		base_model_config.py
base_postgres_index_config.py		base_postgres_index_config.py
bi_base.py		bi_base.py
bi_index.py		bi_index.py
bi_index_llama.py		bi_index_llama.py
bi_index_phi4.py		bi_index_phi4.py
bi_index_qwen.py		bi_index_qwen.py
bi_test.py		bi_test.py
ctrie.py		ctrie.py
ctrie_generate.ipynb		ctrie_generate.ipynb
ctrie_generate.py		ctrie_generate.py
db_disk_usage.sql		db_disk_usage.sql
debug.ipynb		debug.ipynb
debug_postgres.py		debug_postgres.py
do_textgrad.py		do_textgrad.py
dummy_model_config.py		dummy_model_config.py
env-sample.txt		env-sample.txt
estimate_subtree_mem_usage.py		estimate_subtree_mem_usage.py
eval.py		eval.py
filter_error.py		filter_error.py
filter_props.ipynb		filter_props.ipynb
filter_props.py		filter_props.py
fix_judge.py		fix_judge.py
gemma3_1B_model.py		gemma3_1B_model.py
gemma3_4B_model.py		gemma3_4B_model.py
gemma3_index.py		gemma3_index.py
get_answer_metrics.py		get_answer_metrics.py
get_answer_stats.py		get_answer_stats.py
http_index_llama.py		http_index_llama.py
http_index_phi4.py		http_index_phi4.py
http_index_qwen.py		http_index_qwen.py
llama_1B_model.py		llama_1B_model.py
llama_70B_model.py		llama_70B_model.py
llama_8B_model.py		llama_8B_model.py
llama_index.py		llama_index.py
llama_model_config.py		llama_model_config.py
llm_as_a_judge.py		llm_as_a_judge.py
load_labels.py		load_labels.py
mintaka_base.py		mintaka_base.py
mintaka_dev.py		mintaka_dev.py
mintaka_dev_ssample72.py		mintaka_dev_ssample72.py
mintaka_test.py		mintaka_test.py
mintaka_test_ssample200.py		mintaka_test_ssample200.py
mintaka_train.py		mintaka_train.py
mintaka_train_ssample200.py		mintaka_train_ssample200.py
parquet.py		parquet.py
phi3_model.py		phi3_model.py
phi4_index.py		phi4_index.py
phi4_model.py		phi4_model.py
populate_postgres.py		populate_postgres.py
populate_redis.py		populate_redis.py
prompt_base.py		prompt_base.py
prompt_llm_only.py		prompt_llm_only.py
pseudocode.py		pseudocode.py
qwen25_05B_model.py		qwen25_05B_model.py
qwen25_14B_model.py		qwen25_14B_model.py
qwen25_1B_model.py		qwen25_1B_model.py
qwen25_32B_model.py		qwen25_32B_model.py
qwen25_3B_model.py		qwen25_3B_model.py
qwen25_72B_model.py		qwen25_72B_model.py
qwen25_7B_model.py		qwen25_7B_model.py
qwen25_index.py		qwen25_index.py
qwen25_model_config.py		qwen25_model_config.py
qwen3_30A3B_model.py		qwen3_30A3B_model.py
qwen3_32B_model.py		qwen3_32B_model.py
qwq32_model.py		qwq32_model.py
reasoning_prompt.txt		reasoning_prompt.txt
requirements.txt		requirements.txt
simple_index.py		simple_index.py
simple_index.txt.gz		simple_index.txt.gz
simple_index_llama.py		simple_index_llama.py
simple_index_phi4.py		simple_index_phi4.py
simple_index_qwen.py		simple_index_qwen.py
sizesV2_1M.txt		sizesV2_1M.txt
sizes_100k.txt		sizes_100k.txt
sizes_10k.txt		sizes_10k.txt
sizes_1M.txt		sizes_1M.txt
stratify.ipynb		stratify.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation

Riccardo Pozzi, Matteo Palmonari, Andrea Coletta, Luigi Bellomarini, Jens Lehmann, Sahar Vahdati

Setup

Try ReFactX

Wikidata Prefix Tree

Experiments

Throughtput

Cite

About

Uh oh!

Releases

Packages

Languages

License

rpo19/ReFactX

Folders and files

Latest commit

History

Repository files navigation

ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation

Riccardo Pozzi, Matteo Palmonari, Andrea Coletta, Luigi Bellomarini, Jens Lehmann, Sahar Vahdati

Setup

Try ReFactX

Wikidata Prefix Tree

Experiments

Throughtput

Cite

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages