This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', which was accepted by IJCAI-ECAI2022 (Long oral)

Python 66 12 Updated Jun 7, 2022

dxyang / StyleTransfer

Implementation of "Perceptual Losses for Real-Time Style Transfer and Super-Resolution" in PyTorch

Python 297 70 Updated Sep 23, 2020

NVIDIA / partialconv

A New Padding Scheme: Partial Convolution based Padding

Python 1,235 213 Updated May 16, 2023

naoto0804 / pytorch-inpainting-with-partial-conv

Unofficial pytorch implementation of 'Image Inpainting for Irregular Holes Using Partial Convolutions' [Liu+, ECCV2018]

Python 595 135 Updated Jan 22, 2024

tanimutomo / partialconv

Re-Implementation of "Image Inpainting for Irregular Holes using Partial Convolution"

Python 71 16 Updated Oct 3, 2023

ryanwongsa / Image-Inpainting

Image Inpainting for Irregular Holes Using Partial Convolutions

HTML 25 5 Updated Jun 22, 2020

ZhaoRunning / Radio2Speech

official implementation of Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals

Python 8 3 Updated Nov 15, 2024

FENRlR / MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

Python 119 29 Updated Nov 19, 2024

TEN-framework / TEN-Agent

TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compa…

Python 3,732 360 Updated Dec 25, 2024

llvm / llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 29,819 12,311 Updated Dec 25, 2024

jhao104 / proxy_pool

Python ProxyPool for web spider

Python 21,787 5,218 Updated Sep 10, 2024

RobustSP / toolbox

This document contains the functions that are currently available in the RobustSP toolbox: a Matlab toolbox for robust signal processing. The toolbox can be freely used for non-commercial use only.…

MATLAB 16 10 Updated Nov 15, 2019

moodle / moodle

Moodle - the world's open source learning platform

PHP 5,850 6,685 Updated Dec 19, 2024

yt-dlp / yt-dlp

A feature-rich command-line audio/video downloader

Python 94,245 7,367 Updated Dec 23, 2024

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,238 117 Updated Jul 11, 2024

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,675 184 Updated Nov 14, 2024

kaituoxu / Conv-TasNet

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

Python 686 156 Updated Apr 6, 2023

nanless / universal-speech-enhancement

Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec…

Python 42 4 Updated Jul 29, 2024

coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,292 141 Updated Jun 6, 2024

NVIDIA / NeMo-text-processing

NeMo text processing for ASR and TTS

Python 288 91 Updated Dec 11, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 8,763 842 Updated Dec 18, 2024

QwenLM / vllm-gptq

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 127 3 Updated Dec 6, 2024

spotify / pedalboard

🎛 🔊 A Python library for audio.

C++ 5,297 269 Updated Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ziyi Lin Ziyi6

Block or report Ziyi6

Stars

facebookresearch / ears_dataset

usefulsensors / moonshine

SamsungLabs / hifi_plusplus

BUTSpeechFIT / VBx

pyannote / pyannote-audio

ehabets / RIR-Generator

haoheliu / voicefixer

Andong-Li-speech / TaylorSENet