Skip to content

Nourahussein/Movie-classfiction-pased-on-it-s-Arabic-subtitle

Folders and files

NameName
Last commit message
Last commit date

Latest commit

b1566dd · Jun 9, 2018

History

41 Commits
Jul 10, 2017
Jul 10, 2017
Jul 10, 2017
Apr 25, 2017
Jun 9, 2018
Apr 19, 2017
Apr 30, 2017
Apr 18, 2017
Apr 25, 2017
Jun 16, 2017
Jun 16, 2017
Apr 17, 2017
Apr 23, 2017
Apr 19, 2017
Apr 18, 2017

Repository files navigation

Movie Genre Classification from Subtitles

Domain Background

In this project, aim is to categorise movies into genres by analysing Arabic subtitles with machine learning techniques.

main stages:

1- cleaning data: Pre-process arabic text (remove diacritics, punctuations and repeating characters.

2- text extraction:

3- classification.

4- testing.

install:

  • [NumPy]
  • [Pandas]
  • [NLTK]
  • [Matplotlib]

You will also need to have software installed to run and execute jupyter notebook.

Data:

download Arabic subtitles from http://subscene.com/ it contains 20 subtitle for each genre. you can find it in subtitles dirctory.

How to Contribute:

git clone https://github.com/Nourahussein/Movie-classfiction-pased-on-it-s-Arabic-subtitle

cd Movie-classfiction-pased-on-it-s-Arabic-subtitle

About

classify English movies by using its Arabic subtitle

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published