fschmid56

Follow

Florian Schmid fschmid56

Follow

University Assistant @CPJKU Linz Working on Audio Classification, Audio Tagging, Acoustic Scene Classification, Low-Complexity Models

41 followers · 12 following

Johannes Kepler University
Linz
in/florian-schmid-887224293
@Florian04130962
https://scholar.google.com/citations?user=BYQ5Sy8AAAAJ&hl=de

Achievements

Achievements

Stars

merlresearch / sebbs

Prediction of sound event bounding boxes (SEBBs)

Python 25 2 Updated Aug 2, 2024

thswlsgud0423 / Audio_Tagging_Jimmy

This project is for "Practical Work in AI"

Jupyter Notebook 4 Updated Dec 10, 2024

fschmid56 / EfficientAT

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

Python 249 44 Updated Nov 20, 2024

CPJKU / cpjku_dcase24

Python 20 Updated Oct 17, 2024

fschmid56 / PretrainedSED

Python 21 Updated Dec 22, 2024

SchilcherPatrick / DCASE24_Task1

Python 1 Updated Sep 1, 2024

Audio-WestlakeU / audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation

Python 110 10 Updated Aug 27, 2024

CPJKU / beat_this

Accurate and general beat tracker

Python 97 16 Updated Nov 5, 2024

Jonathan-Greif / QBV

This repository provides the code for "Improving Query-by-Vocal Imitation with Contrastive Learning and Audio Pretraining", presented at DCASE 2024. The paper addresses the challenge of audio retri…

Python 1 Updated Oct 25, 2024

yqcai888 / easy_dcase_task1

This repository provides an easy way to train your models on the datasets of DCASE task 1.

Python 12 1 Updated Jan 2, 2025

Audio-WestlakeU / ATST-SED

This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

Jupyter Notebook 108 13 Updated Oct 15, 2024

petewarden / open-speech-recording

Web application to record speech for an open data set

HTML 422 161 Updated Apr 29, 2020

internet-explorer-ssl / internet-explorer

Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desired target dataset.

162 6 Updated Mar 4, 2023

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,247 117 Updated Jul 11, 2024

lucidrains / vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Python 2,769 226 Updated Dec 3, 2024

musikalkemist / AudioSignalProcessingForML

Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"

Jupyter Notebook 1,141 395 Updated Oct 31, 2020

ACheun9 / Pytorch-implementation-of-Mobile-Former

Simple implementation of Mobile-Former on Pytorch

Python 108 17 Updated Sep 26, 2021

cwilldoner / practicalwork

Python 1 Updated Jul 22, 2023

fschmid56 / cpjku_dcase23

This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"

Python 23 4 Updated Sep 18, 2023

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,075 866 Updated Jul 6, 2024

csyhhu / Awesome-Deep-Neural-Network-Compression

Summary, Code for Deep Neural Network Quantization

Python 534 82 Updated Oct 13, 2024

gudgud96 / frechet-audio-distance

A lightweight library for Frechet Audio Distance calculation.

Python 241 24 Updated Sep 4, 2024

TylerYep / torchinfo

View model summaries in PyTorch!

Python 2,649 124 Updated Dec 26, 2024

kkoutini / ba3l

Ba3l

Python 3 1 Updated Dec 6, 2021

MingSun-Tse / Efficient-Deep-Learning

Collection of recent methods on (deep) neural network compression and acceleration.

932 132 Updated Dec 3, 2024

fschmid56 / malach23-pipeline

Jupyter Notebook 1 1 Updated Mar 14, 2023

theMoro / DIRAugmentation

Improving Recording Device Generalization using Impulse Response Augmentation

Python 11 Updated Apr 4, 2023

RicherMans / PSL

Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"

Python 30 4 Updated Apr 29, 2022

karolpiczak / ESC-50

ESC-50: Dataset for Environmental Sound Classification

Python 1,444 291 Updated Mar 20, 2024

fschmid56 / EfficientAT_HEAR

Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.

Python 25 3 Updated Jun 23, 2023