Skip to content
View fschmid56's full-sized avatar

Block or report fschmid56

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Prediction of sound event bounding boxes (SEBBs)

Python 25 2 Updated Aug 2, 2024

This project is for "Practical Work in AI"

Jupyter Notebook 4 Updated Dec 10, 2024

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

Python 249 44 Updated Nov 20, 2024
Python 20 Updated Oct 17, 2024
Python 21 Updated Dec 22, 2024
Python 1 Updated Sep 1, 2024

A library built for easier audio self-supervised training, downstream tasks evaluation

Python 110 10 Updated Aug 27, 2024

Accurate and general beat tracker

Python 97 16 Updated Nov 5, 2024

This repository provides the code for "Improving Query-by-Vocal Imitation with Contrastive Learning and Audio Pretraining", presented at DCASE 2024. The paper addresses the challenge of audio retri…

Python 1 Updated Oct 25, 2024

This repository provides an easy way to train your models on the datasets of DCASE task 1.

Python 12 1 Updated Jan 2, 2025

This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

Jupyter Notebook 108 13 Updated Oct 15, 2024

Web application to record speech for an open data set

HTML 422 161 Updated Apr 29, 2020

Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desired target dataset.

162 6 Updated Mar 4, 2023

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,247 117 Updated Jul 11, 2024

Vector (and Scalar) Quantization, in Pytorch

Python 2,769 226 Updated Dec 3, 2024

Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"

Jupyter Notebook 1,141 395 Updated Oct 31, 2020

Simple implementation of Mobile-Former on Pytorch

Python 108 17 Updated Sep 26, 2021
Python 1 Updated Jul 22, 2023

This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"

Python 23 4 Updated Sep 18, 2023

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,075 866 Updated Jul 6, 2024

Summary, Code for Deep Neural Network Quantization

Python 534 82 Updated Oct 13, 2024

A lightweight library for Frechet Audio Distance calculation.

Python 241 24 Updated Sep 4, 2024

View model summaries in PyTorch!

Python 2,649 124 Updated Dec 26, 2024

Ba3l

Python 3 1 Updated Dec 6, 2021

Collection of recent methods on (deep) neural network compression and acceleration.

932 132 Updated Dec 3, 2024
Jupyter Notebook 1 1 Updated Mar 14, 2023

Improving Recording Device Generalization using Impulse Response Augmentation

Python 11 Updated Apr 4, 2023

Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"

Python 30 4 Updated Apr 29, 2022

ESC-50: Dataset for Environmental Sound Classification

Python 1,444 291 Updated Mar 20, 2024

Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.

Python 25 3 Updated Jun 23, 2023
Next
Showing results