Bert-conf-paper

The implementation of paper submitted ACS.

Abstract

The transformer-based models have remarkably improved the performance of many natural language processing tasks in recent years. However, their input length is a limitation of these models, mainly due to the computational complexity of the attention. Inspired by the human tendency to ignore many words during a reading comprehension task, we experiment with the effect of removing tokens from a sequence on prediction performance for sentiment analysis. In this preliminarypaper, we analyze a length reducing system based on layer-level attention scores to see what percentage of input length is required to obtain a reasonable amount of accuracy in a sentiment analysis task. We show that the filtering system based on BERT allows us to reduce sequences lengths of up to 99% in a sentiment analysis task while still obtaining 70% accuracy.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
CLS		CLS
First and last		First and last
Reduced de-token		Reduced de-token
SST-fine tune and ev		SST-fine tune and ev
fine tune imdb		fine tune imdb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bert-conf-paper

Abstract

About

Releases

Packages

Languages

License

taneset/Bert-conf-paper

Folders and files

Latest commit

History

Repository files navigation

Bert-conf-paper

Abstract

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages