The greatest form of interaction in social systems is through postings, whether public or private, which can deal with various subjects. Several of these posts may be related to the user’s thinking about using the system at that time, for example: "It’s so boring to
have only 140 characters to post on twitter :(". We call these Usage Related posts (URP).
This repository contains an automatic way of identificated URP in twitter posts in portuguese using a Convolutional Neural Network.
In the database, we remove all the stop words, pontuations and emogis.
The model applied the convolution filters in the sentences and the dense layer is respondible to identify the class give the resuls of the convolution layer. The process is shown the figure below.
This code has 91.5% accuracy and 12.3% of loss.
Dependencies:
- Python 3.x or python 2.7
- Keras 2.x
- Tensorflow or Theano
- NLTK
- Sklearn