Junction 2016 Project on Hatespeech

(Hacktathon Helsinki - 25-27 nov. 2016)

Inspiration

The last elections have shown that politicians are getting away with posting hate speech and false information on their social media feeds.

What it does

We want to provide a tool that allows to analyse politicians social media feeds in regard to hate, information credibility and sentiment analysis. Moreover, it's not only a search page, but also a chrome plugin to directly extract and check content from websites.

How we built it

We have used several tools:

main language is python, pandas and nltk for sentiment analysis
the application is served with django
hatebase database for detecting hate speeches and words
our own database and duck duck go search for checking content credibility
javascript for interactive content on the webpage and for the chrome extension
twitter, facebook and reddit api for retrieving content
wikidata and sparql for fetching information about politicians

Challenges we ran into

Facts checking is tremendously hard
website credibility
hate speech analysis is not just single words checking
wikidata querying is not easy and can take a while
parsing and scrapping websites and search engines results
Not enough time to work on our own classifier, even though we have annotated data

Accomplishments that we're proud of

Good looking web application that works
the chrome extension is simple but effective
working hate speech detection

What we learned

generating and using apis
using and combining multiple content sources
generating meaningful visualizations
building chrome extension

What's next for HackHate

Getting a large dataset from reddit data and training our own machine learning classifier
Extending our database for credibility checking
annotating the web pages with chrome extension to give live credibility checking

Built With python, django, twitter, facebook-graph, machine-learning, nltk, sparql, wikidata, chrome

Try it out

http://hackhate.huguesverlin.fr

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
chrome_extension		chrome_extension
nltk		nltk
webapp		webapp
.gitignore		.gitignore
README.md		README.md
RedditProvider.py		RedditProvider.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Junction 2016 Project on Hatespeech

Inspiration

What it does

How we built it

Challenges we ran into

Accomplishments that we're proud of

What we learned

What's next for HackHate

Try it out

About

Releases

Packages

Languages

hverlin/hackhate

Folders and files

Latest commit

History

Repository files navigation

Junction 2016 Project on Hatespeech

Inspiration

What it does

How we built it

Challenges we ran into

Accomplishments that we're proud of

What we learned

What's next for HackHate

Try it out

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages