Skip to content
#

torchaudio

Here are 51 public repositories matching this topic...

This project leverages Python, computer vision, and deep learning techniques, utilizing pre-trained models such as RetinaNet_ResNet-50 for image-based object detection. It is designed with a primary focus on enhancing security across various sectors. The RetinaNet_ResNet-50 model enables both image and video-based detection functionalities.

  • Updated Feb 28, 2024

In this notebook, we aim to recognize speech commands using classification. For this purpose, we used the SPEECHCOMMANDS dataset and the deep convolutional model M5. The code is written in Python and designed for the PyTorch platform.

  • Updated Jun 18, 2024
  • Jupyter Notebook

The core of my graduation project that uses convolutional neural networks to extract the vocal part from a song by removing the sound of musical instruments. The project is rather academic, it did not achieve too great real results, but this is expected. I'm not going to develop it further.

  • Updated Jun 13, 2020
  • Python

Improve this page

Add a description, image, and links to the torchaudio topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the torchaudio topic, visit your repo's landing page and select "manage topics."

Learn more