Multiclass text classification using two models : Keras Custom Embeddings and Google BERT
Change the file paths according to your directory. Code can be run directly in google colab.
Two models have been tested:
The model was trained from scratch for word embeddings; further propogated to RNN architecture resulting in 77% train accuracy and 67% validation accuracy.
The model was trained using pre-trained Google BERT word embeddings giving better results with 83% train accuracy and 74% validation accuracy.
Results can probably be futher improved by removing stopwords and then feeding it to Google BERT architecture.