Text-Generation

Build a Neural Network based model to predict what the next word will be in a sequence of words/sentences.

Background

The text generation problem took some time to solve and perfect. Unlike images, dealing with text and natural language needs some temporal memory because human languages contain various subtleties. A simple example is the use of pronouns like he, she, or it instead of nouns, like somebody's name. These things are just understood by humans but machines need to learn which name can be replaced by "she" and when. This is done using RNNs - a class of neural nets used for many language tasks. Their specialty lies in a mechanism that helps them retain bits of information to use later. The LSTM unit is a kind of RNN and was one of the most popular tools for natural language processing before transformers emerged.

Dataset

The dataset is taken from Gutenberg. Here is the dataset. The dataset consists of The Adventures of Sherlock Holmes, by Arthur Conan Doyle to train my model.

Code Information

We have used Keras Sequential model to train the model with categorical_crossentropy as the loss function and adam as the optimizer. Embedding, Bidirectional, LSTM, and Dense layers are used in this model. Early Stopping is used to stop early when the accuracy is not increasing much. To convert text into sequences we have used Keras Tokenizer and pad_sequences.

Prerequisites

You need to have installed the following software and libraries on your machine before running this project.

Python 3
Anaconda: It will install an ipython notebook and most of the libraries which are needed like sklearn, pandas, seaborn, matplotlib, numpy, PIL.
TensorFlow
Keras

Installing

Python 3: https://www.python.org/downloads/
Anaconda: https://www.anaconda.com/download/
Tensorflow: pip install tensorflow
Keras: pip install keras

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
Text Generation.ipynb		Text Generation.ipynb
book.txt		book.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text-Generation

Background

Dataset

Code Information

Prerequisites

Installing

About

Releases

Packages

Languages

Aaryan8751/Text-Generation

Folders and files

Latest commit

History

Repository files navigation

Text-Generation

Background

Dataset

Code Information

Prerequisites

Installing

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages