KERMIT4NLI

Project Datasets

MultiNLI (https://www.aclweb.org/anthology/N18-1101.pdf)

This paper introduces the Multi-Genre Natural Language Inference (MultiNLI) corpus, a dataset designed for use in the development and evaluation of machine learning models for sentence understanding. At 433k examples, this resource is one of the largest corpora available for natural language inference (a.k.a. recognizing textual entailment), improving upon available resources in both its coverage and difficulty. MultiNLI accomplishes this by offering data from ten distinct genres of written and spoken English, making it possible to evaluate systems on nearly the full complexity of the language, while supplying an explicit setting for evaluating cross-genre domain adaptation. In addition, an evaluation using existing machine learning models designed for the Stanford NLI corpus shows that it represents a substantially more difficult task than does that corpus, despite the two showing similar levels of inter-annotator agreement.
HANS (https://www.aclweb.org/anthology/P19-1334.pdf)

A machine learning system can score well on a given test set by relying on heuristics that are effective for frequent example types but break down in more challenging cases. We study this issue within natural language inference (NLI), the task of determining whether one sentence entails another. We hypothesize that statistical NLI models may adopt three fallible syntactic heuristics: the lexical overlap heuristic, the subsequence heuristic, and the constituent heuristic. To determine whether models have adopted these heuristics, we introduce a controlled evaluation set called HANS (Heuristic Analysis for NLI Systems), which contains many examples where the heuristics fail. We find that models trained on MNLI, including BERT, a state-of-the-art model, perform very poorly on HANS, suggesting that they have indeed adopted these heuristics. We conclude that there is substantial room for improvement in NLI systems, and that the HANS dataset can motivate and measure progress in this area.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
BERT+Kermit_NLI_model.ipynb		BERT+Kermit_NLI_model.ipynb
BERT_ONLY_NLI_model.ipynb		BERT_ONLY_NLI_model.ipynb
README.md		README.md
RelevanceTrees-DatasetGenerator.ipynb		RelevanceTrees-DatasetGenerator.ipynb
collins_head_rules_SH.xml		collins_head_rules_SH.xml
generate_tree_KERMIT-sentence.ipynb		generate_tree_KERMIT-sentence.ipynb
json_rules.json		json_rules.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KERMIT4NLI

Project Datasets

Models

Bert Only

Bert + Kermit

About

Releases

Packages

Contributors 2

Languages

ART-Group-it/KERMIT4NLI

Folders and files

Latest commit

History

Repository files navigation

KERMIT4NLI

Project Datasets

Models

Bert Only

Bert + Kermit

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages