Text_Summarizer_On_Patents

NLP Project on summarizing patent abstract into single sentences using Encoder-Decoder architecture in Recurrent Neural Networks.

Abstract

The idea behind the project is to generate summary of any short documentation. It can be applied to various fields and for our case, we generated Patent titles using the Patent Abstract. We have used encoder-decoder Recurrent Neural Networks, experimenting with majorly LSTM and GRU cell units. The dataset contains patents from various domains which makes the problem more challenging. The project report is attached for your reference.

Dataset

The dataset was generated using BigQuery on Google Patents website. The patents were extracted from various domains including but not limited to Physics, Biology, Algebra. The details of the dataset are present in the project report.

Word Embeddings

We used 300 dimensional Glove Embeddings trained on 2.2M vocabulary and 840B tokens. (https://nlp.stanford.edu/projects/glove/)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
models		models
project_report		project_report
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text_Summarizer_On_Patents

Abstract

Dataset

Word Embeddings

About

Releases

Packages

Languages

ajindal1/Text_Summarizer_On_Patents

Folders and files

Latest commit

History

Repository files navigation

Text_Summarizer_On_Patents

Abstract

Dataset

Word Embeddings

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages