Medium Summarizer App

This is a web app that scrapes medium post get inputting the post url, sumarizes it into 5- 10 sentences using the extractive summarization method , get 5 keywords in the articles based on importance(not frequency) using TFIDF and then outputs the summary and the keywords.

Some of the tools used

App framework tools like Streamlit(first attempt), Heroku
Extractive summarization tool like Sumy(LSA model)
Webscraping libraries like BeautifulSoup and Requests
URL verification library like TldExtract
Text preprocessing libraries like Spacy(heavy duty), Sumy
Text vectorization library like TFIDFVectorizer and CountVectorizer in Scikit-learn
Others like HTML etc

The most difficult part was trying to understand the algorithm behind the summarization model and using Streamlit to wrap all the components together.

Here is a demo of the app

You can find the link to the app here

Further improvements to be done are

Moving from extractive summarizer to abstractive summarizer
Improvemnet in building a good summarizer from scratch
Improvement on the scraping part as well as the interface.

Feel free to go through my draft_work and jupyter_notebooks

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
scratch_work		scratch_work
work_book		work_book
.gitignore		.gitignore
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
app.py		app.py
demo.gif		demo.gif
nltk.txt		nltk.txt
requirements.txt		requirements.txt
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Medium Summarizer App

About

Releases

Packages

Languages

License

anitaokoh/Medium_Summarizer

Folders and files

Latest commit

History

Repository files navigation

Medium Summarizer App

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages