Lip synchronization model for the Tamil language

Our model predicts 3D face animation for a given Tamil audio speech with considering the coarticulation effects. Our model achieved a Root Mean Square Error of 0.0648 in our test data split and achieved an 83% of overall subjective accuracy. Further, the Turing test confirmed that participants were unable to distinguish our predicted animation from the ground truth.

Results

SHORT SPEECH	LONG SPEECH
FAST SPEECH	SLOW SPEECH
MIXED LANGUAGE SPEECH

Required packages & libraries

librosa 0.8.1
ffmpeg 4.3.1
opencv-python 4.5.2
scipy 1.6.2
tensorflow 2.6.0
sklearn 0.22.2
Blender 2.92.0
pickle 4.0
numpy 1.20.0
matplotlib 3.3.4
tqdm 4.59.0
keras-tuner 1.1.2
mediapipe 0.8.9.1

Model training

Place all recorded training videos inside VIDEOs directory
Run PDM.py and Preprocess.py in the specified order to perform feature extraction
Run HyperparameterTuning.py to pick best possible hyperparameter combination (optional)
Set the best hyperparameter values in the BLSTM_128_64.py file (optional)
Run BLSTM_128_64.py to train the deeplearning model

Model prediction

Record the input Tamil speech audio and place it inside the VIDEOs/AUDIOs/ directory
Set Blender path in Prediction.py in cmd variable
Run command python Prediction.py <fileName> <audioFormat> to generate animation

Pre-trained model

To try our trained model download the preporocessor and model weights from WeightsAndPreprocessor.zip and unzip them inside the logs directory

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
mesh.png		mesh.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lip synchronization model for the Tamil language

Results

Required packages & libraries

Model training

Model prediction

Pre-trained model

python implementation will be published soon after publishing the research paper.

About

Releases

Packages

MohamedSabthar/Lip-synchronization-model-for-the-Tamil-language

Folders and files

Latest commit

History

Repository files navigation

Lip synchronization model for the Tamil language

Results

Required packages & libraries

Model training

Model prediction

Pre-trained model

python implementation will be published soon after publishing the research paper.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages