hallucination

Star

Here are 57 public repositories matching this topic...

Libr-AI / OpenFactVerification

Star

Loki: Open-source solution designed to automate the process of verifying factuality

ai hallucination factuality

Updated Oct 3, 2024
Python

jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness

Star

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

reliability calibration safety awesome-list uncertainty-quantification uncertainty-estimation robustness hallucination gpt-3 gpt-4 in-context-learning large-language-models prompt-engineering prompting llms chain-of-thought chatgpt

Updated Jun 18, 2024

BradyFU / Woodpecker

Star

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.

multimodality hallucination hallucinations large-language-models llm mllm multimodal-large-language-models

Updated Jun 17, 2024
Python

amazon-science / RefChecker

Star

RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.

hallucination factuality llms

Updated Nov 7, 2024
Python

FuxiaoLiu / LRV-Instruction

Star

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

evaluation vision vqa llama object-detection gpt evaluation-metrics iclr multimodal vision-and-language hallucination vicuna gpt-4 foundation-models prompt-engineering chatgpt llava iclr2024

Updated Mar 13, 2024
Python

tianyi-lab / HallusionBench

Star

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

benchmark benchmarks lmm hallucination gpt-4 large-language-models llm llava large-vision-language-models vlms gpt-4v

Updated Nov 13, 2024
Python

IAAR-Shanghai / UHGEval

Star

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

benchmark evaluation dataset openai hallucination huggingface huggingface-transformers ceval gpt-3 openai-api hallucinations gpt-4 large-language-models llm chatgpt qwen hallucination-evaluation hallucination-detection

Updated Nov 12, 2024
Python

IAAR-Shanghai / ICSFSurvey

Star

Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.

decoding self-improvement knowledge-distillation data-augmentation reasoning self-consistency preference-learning hallucination self-correction attention-head large-language-models chain-of-thought large-language-model internal-consistency self-feedback self-refine self-correct

Updated Dec 7, 2024
Jupyter Notebook

xieyuquanxx / awesome-Large-MultiModal-Hallucination

Star

😎 curated list of awesome LMM hallucinations papers, methods & resources.

multi-modal multimodal lmm hallucination

Updated Mar 23, 2024

ictnlp / TruthX

Star

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

safety llama representation language-model mistral explainable-ai hallucination baichuan hallucinations gpt-4 truthfulness llm llms chatgpt chatglm llm-inference llama2 llama3

Updated Mar 26, 2024
Python

zjunlp / KnowledgeCircuits

Star

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

natural-language-processing artificial-intelligence transformer circuit interpretability hallucination large-language-models model-editing knowledge-editing knowledge-edting knowledge-circuit

Updated Dec 17, 2024
Python

zjunlp / FactCHD

Star

[IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection

benchmark natural-language-processing knowledge dataset factual hallucination large-language-models factchd

Updated Apr 28, 2024
Python

AmourWaltz / Reliable-LLM

Star

knowledge uncertainty reliable hallucination

Updated Sep 10, 2024
JavaScript

yfzhang114 / LLaVA-Align

Star

This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strategy.

hallucination debiasing large-vision-language-models

Updated Mar 28, 2024
Python

HillZhang1999 / ICD

Star

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"

decoding-algorithm hallucination large-language-models

Updated Feb 27, 2024
Python

NishilBalar / Awesome-LVLM-Hallucination

Star

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

mlm hallucination large-language-models llm mllm large-vision-language-models multimodal-large-language-models hallucination-evaluation hallucination-detection vision-language-models lvlm hallucination-mitigation hallucination-survey hallucination-research hallucination-benchmark multimodal-language-model

Updated Dec 10, 2024

Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the available tool, methods, repo, code etc to detect hallucination, LLM evaluation, grading and much more.

nlp ai evaluation ml pytorch judge feedback-collection sota custom-dataset finetuning hallucination llm llm-evaluation hallucination-detection phi-3

Updated Jul 10, 2024
Jupyter Notebook

anlp-team / LTI_Neural_Navigator

Star

"Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" by Jiarui Li and Ye Yuan and Zehua Zhang

nlp machine-learning web-crawler llama datasets dataset-generation system-design hallucination rag large-language-models llm

Updated Mar 18, 2024
HTML

dmis-lab / OLAPH

Star

OLAPH: Improving Factuality in Biomedical Long-form Question Answering

question-answering hallucination biomedical-research factuality

Updated Sep 10, 2024
Python

sled-group / 3D-GRAND

Star

Official Implementation of 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs

dataset 3d hallucination llm

Updated Jun 13, 2024

Improve this page

Add a description, image, and links to the hallucination topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hallucination topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hallucination

Here are 57 public repositories matching this topic...

Libr-AI / OpenFactVerification

jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness

BradyFU / Woodpecker

amazon-science / RefChecker

FuxiaoLiu / LRV-Instruction

tianyi-lab / HallusionBench

IAAR-Shanghai / UHGEval

IAAR-Shanghai / ICSFSurvey

xieyuquanxx / awesome-Large-MultiModal-Hallucination

ictnlp / TruthX

zjunlp / KnowledgeCircuits

zjunlp / FactCHD

AmourWaltz / Reliable-LLM

yfzhang114 / LLaVA-Align

HillZhang1999 / ICD

NishilBalar / Awesome-LVLM-Hallucination

deshwalmahesh / PHUDGE

anlp-team / LTI_Neural_Navigator

dmis-lab / OLAPH

sled-group / 3D-GRAND

Improve this page

Add this topic to your repo