Speech Eye Motion Dataset (SEMD)

This repository contains script to build Speech Eye Motion Dataset. You can download videos and transcripts through youtube and able to extract eye skeleton.

If you have any qeustions or comments, please feel free to contact me by email ([email protected]).

Requirement

python 3.4+
apiclient
youtube_dl
pandas
sklearn
tqdm
numpy
pickle
cv2
webvtt
shape_predictor_68_face_landmarks.dat

Usage

Before you run python code below, please make sure have following folders in your directory:

/model, /videos, /facial_keypoints, /clips, /dataset, /filtered_clips

To detect facial landmarks, we use shape_predictor_68_face_landmarks.dat. Please place the file on model directory.

1. Download videos from youtube.

python download_video.py -video_path ./videos/ -youtube_ch_id UC_0NfufarVw04vDfWFm8z_Q -max_result 50 -lang en -dev_key YOUR_DEV_KEY -year_from 2018 -year_to 2019

2. Extract facial landmarks.

python run_facial_landmarks.py -vid_path ./videos/ -facial_keypoints ./facial_keypoints -model_path ./model/shape_predictor_68_face_landmarks.dat -width 960 -height 540 -frame_threshold 500

3. Run SceneDetect

python run_scenedetect.py -clip_path ./clips -vid_path ./videos

4. Run Clip Filtering

python run_clip_filtering.py -clip_path ./clips -vid_path ./videos -landmarks_path ./facial_keypoints -clip_filter_path ./filtered_clips -threshold 30 ratio 0.5

5. Generate Speech Eye Motion Dataset (SEMD)

python make_eye_motion_dataset.py -vid_path ./videos -facial_keypoints ./facial_keypoints -clip_filter_path ./filtered_clips -dataset_path ./dataset

6. Preprocess SEMD

python run_preprocessing.py dataset_path ./dataset -data_size -1 -fps 10 -n_components 7 -is_rotation_killed True

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
landmarking		landmarking
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_utils.py		data_utils.py
display.py		display.py
download_video.py		download_video.py
make_eye_motion_dataset.py		make_eye_motion_dataset.py
plot.py		plot.py
run.sh		run.sh
run_clip_filtering.py		run_clip_filtering.py
run_facial_landmarks.py		run_facial_landmarks.py
run_preprocessing.py		run_preprocessing.py
run_scenedetect.py		run_scenedetect.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Eye Motion Dataset (SEMD)

Requirement

Usage

1. Download videos from youtube.

2. Extract facial landmarks.

3. Run SceneDetect

4. Run Clip Filtering

5. Generate Speech Eye Motion Dataset (SEMD)

6. Preprocess SEMD

About

Releases

Packages

Languages

License

Eddie-Hwang/Speech-Eye-Motion-Dataset

Folders and files

Latest commit

History

Repository files navigation

Speech Eye Motion Dataset (SEMD)

Requirement

Usage

1. Download videos from youtube.

2. Extract facial landmarks.

3. Run SceneDetect

4. Run Clip Filtering

5. Generate Speech Eye Motion Dataset (SEMD)

6. Preprocess SEMD

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages