GitHub - JayaswalVivek/Transformer_Encoder_Based_Regression: Transformer-encoder for performing a NLP task

Encoder-only Transformer for performing a NLP task.

Data Source: Kaggle
URL: https://www.kaggle.com/competitions/commonlitreadabilityprize
Problem Definition: Rate the complexity of literary passages for grades 3-12 classroom use
Problem Type: Regression using unstructured data

This code implements a Transformer-Encoder model for Kaggle's "CommonLit Readability Prize" challenge and the source data sets can be downloaded from Kaggle's website. While BERT (Devlin et al., 2018) is also a Transformer-Encoder model, this implementation is not another example of BERT pre-training or fine-tuning because --

it does not estimate the token embeddings and instead uses GloVe embeddings (https://nlp.stanford.edu/projects/glove/);
it does not make use of masking (or another form of self-supervised learning) during the training process; and
it makes the use of position-based encoding optional.

Consequently, the current implementation aims to facilitate an understanding of the encoder-based transformer architecture for predictive modelling. It can be used to explore the change in RMSE owing to --

an inclusion/exclusion of position-based encoding; and
a change in the number of (a) input dimensions in an attention layer; (b) attention heads; (c) dimensions in the feedforward network; or (d) encoder layers.

An example evaluation can be viewed by clicking on this link (https://github.com/JayaswalVivek/Transformer_Encoder/wiki/Model-Evaluation)

References

Devlin, Jacob, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. "Bert: Pre-training of deep bidirectional transformers for language understanding." arXiv preprint arXiv:1810.04805. 2018.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
Kaggle_CommonLit_Challenge_Generate_Trng_Set_K_Dim.ipynb		Kaggle_CommonLit_Challenge_Generate_Trng_Set_K_Dim.ipynb
Kaggle_CommonLit_Challenge_TE_Func.ipynb		Kaggle_CommonLit_Challenge_TE_Func.ipynb
Kaggle_CommonLit_Challenge_TE_Model.ipynb		Kaggle_CommonLit_Challenge_TE_Model.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Encoder-only Transformer for performing a NLP task.

About

Releases

Packages

Languages

License

JayaswalVivek/Transformer_Encoder_Based_Regression

Folders and files

Latest commit

History

Repository files navigation

Encoder-only Transformer for performing a NLP task.

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages