This question uses following dataset of Urdu sentiment analysis. The class labels are P(positive) and N (Negative)
https://github.com/MuhammadYaseenKhan/Urdu-Sentiment-Corpus/blob/master/urdu- sentiment-corpus-v1.tsv
Implement following sequence based deep learning models for the same task of sentiment analysis. Perform binary text classification. RNN GRU LSTM BiLSTM You can implement these models in Keras or Pytorch. Split the data into train and test set. Use 75% for training and 25% for testing. For each of these models, try following hyper parameters and report the best results with parameter values. Number of layers = 2 or 3. Dropout rate, 0.3 or 0.7 So you will have 2 *2 = 4 different sets of parameters. Calculate accuracy, Precision, Recall and F-score for all classifiers and report the results in table. Also report parameter values which were used to get the results.
This question uses same dataset of Q 1. Perform the task of binary classification on the dataset. Choose one classifier from deep learning models implemented in Question 1 based on best results on F-measure for binary classification. Use following embedding for vector representation and report the results. You need to train the embeddings yourself on the given Urdu dataset.
- WordToVec
- Fasttext
- Elmo