YOLO-CNN-LSTM Model for Image Classification

1.项目标题与内容概述

标题：深度学习用于检测电影和连续剧中的暴力和虐待行为： OTT 流媒体平台概述：本项目利用深度学习和计算机视觉技术检测并移除电影视频中的暴力和不当内容。与目前常见的仅识别是否为暴力的检测方法不同的是，该框架还对识别到的暴力行为进行初级暴力和高级暴力的分类。它结合了YOLO和CNN-LSTM模型，能够以70%的准确率检测和分类暴力内容。随着人们对好莱坞影片暴力内容的关注增加，该框架填补了当前流媒体平台内容审核的空白，减少了对人工内容审核的依赖，提高了效率。它在不同视频类型上的可扩展性使其成为数字平台内容管理的有力工具。

Project title and content overview

Title: Deep learning for violence and abuse detection in cinema and series: OTT streaming platforms Content: This project uses deep learning and computer vision techniques to detect and remove violent and inappropriate content from movie videos. Unlike current common detection methods that only identify whether the violence is violent or not, the framework also categorizes the identified violence in terms of lowand high levels in movies. It combines YOLO and CNN-LSTM models to detect and categorize violent content with 70% accuracy. With the increased concern about violent content in Hollywood movies, the framework fills the current gap in content review for streaming platforms, reducing the reliance on manual content review and improving efficiency. Its scalability across different video types makes it a powerful tool for content management on digital platforms.

Project Structure

2.数据集，预处理及项目结构数据集AIRTLab：https://github.com/airtlab/A-Dataset-for-Automatic-Violence-Detection-in-Videos 首先，我们对上述数据集做了新的整理分类，划分为非暴力（包括手势，握手，击掌，拥抱，跳跃，行走，打招呼），初级暴力（包括推搡，扇耳光，窒息，打架，踢，拳击），高级暴力（包括枪击，刺伤，棍棒），数据结构如下： dataset/ train/ class1/ img1.jpg img2.jpg ... class2/ img1.jpg img2.jpg ... class3/ img1.jpg img2.jpg ... test/ class1/ img1.jpg img2.jpg ... class2/ img1.jpg img2.jpg ... class3/ img1.jpg img2.jpg ... 其次，对已分类的视频进行抽帧，每秒抽取三帧，每隔10帧保存1帧，即保存0,10,20，...帧。对于yolo部分，帧的形状为（640x640x3），并且需要用labeling进行动作标注；对于CNNLSTM部分，帧的形状为（64X64X3）。最后，yolo部分参照：yolo.ipynb；CNNLSTM参照：0712.ipynb，train_cnn_lstm.ipynb，test_cnn_lstm.ipynb。 Jupter文件包含了模型的训练，测试和评估。

2.Dataset, Preprocessing and Project Structure

Dataset AIRTLab: https://github.com/airtlab/A-Dataset-for-Automatic-Violence-Detection-in-Videos

First of all, we made a new organizing classification of the above dataset into nonviolent (including gesture, handshake, highfive, hug, jump, walk, and greet), low-violence (including push, slap, choking, stifle, kick, and punch), and high-violence (including shoot, stab, and club), and the data structure is as follows: dataset/ train/ class1/ img1.jpg img2.jpg ... class2/ img1.jpg img2.jpg ... class3/ img1.jpg img2.jpg ... test/ class1/ img1.jpg img2.jpg ... class2/ img1.jpg img2.jpg ... class3/ img1.jpg img2.jpg ... Secondly, the classified video is extracted by drawing three frames per second and saving 1 frame every 10 frames, i.e., saving 0,10,20,... frames. For the yolo part, the shape of the frame is (640x640x3), and it needs to be labeled with labeling for action; for the CNNLSTM part, the shape of the frame is (64X64X3). Finally, the yolo part refers to: yolo.ipynb; CNNLSTM refers to: 0712.ipynb, train_cnn_lstm.ipynb, test_cnn_lstm.ipynb. Jupyter Notebook containing the model training, testing and evaluation.

Requirements

运行代码需要你安装如下函数库：

Python 3.x
TensorFlow
Keras
NumPy
Matplotlib
OpenCV
Jupyter

To run the code in this repository, you need to have the following libraries installed:

Python 3.x
TensorFlow
Keras
NumPy
Matplotlib
OpenCV
Jupyter

You can install the required packages using the following command:

pip install tensorflow keras numpy matplotlib opencv-python-headless jupyter

Dataset The dataset used for this project should be structured in the following format:

dataset/
    train/
        class1/
            img1.jpg
            img2.jpg
            ...
        class2/
            img1.jpg
            img2.jpg
            ...
    test/
        class1/
            img1.jpg
            img2.jpg
            ...
        class2/
            img1.jpg
            img2.jpg
            ...

git clone https://github.com/yourusername/cnn-image-classification.git cd cnn-image-classification

jupyter notebook cnn.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
cnnlstm		cnnlstm
0712.ipynb		0712.ipynb
0721.html		0721.html
AIRTlab.ipynb		AIRTlab.ipynb
output0712.html		output0712.html
readme.md		readme.md
ucl101.ipynb		ucl101.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YOLO-CNN-LSTM Model for Image Classification

Project Structure

Requirements

About

Releases

Packages

Contributors 2

Languages

sydney-machine-learning/violence-movies

Folders and files

Latest commit

History

Repository files navigation

YOLO-CNN-LSTM Model for Image Classification

Project Structure

Requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages