Skip to content
View Ziyi6's full-sized avatar
  • TU Darmstadt
  • Darmstadt
  • 15:06 (UTC +01:00)

Block or report Ziyi6

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Expressive Anechoic Recordings of Speech (EARS)

Python 140 7 Updated Jun 25, 2024

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,425 124 Updated Dec 17, 2024

HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement (ICASSP 2023)

Python 77 7 Updated Dec 2, 2023

Variational Bayes HMM over x-vectors diarization

Python 257 57 Updated Jan 15, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,561 800 Updated Dec 13, 2024

Generating room impulse responses

C++ 435 147 Updated Dec 20, 2023

General Speech Restoration

Python 1,062 132 Updated May 31, 2024

This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', which was accepted by IJCAI-ECAI2022 (Long oral)

Python 66 12 Updated Jun 7, 2022

Implementation of "Perceptual Losses for Real-Time Style Transfer and Super-Resolution" in PyTorch

Python 297 70 Updated Sep 23, 2020

A New Padding Scheme: Partial Convolution based Padding

Python 1,235 213 Updated May 16, 2023

Unofficial pytorch implementation of 'Image Inpainting for Irregular Holes Using Partial Convolutions' [Liu+, ECCV2018]

Python 595 135 Updated Jan 22, 2024

Re-Implementation of "Image Inpainting for Irregular Holes using Partial Convolution"

Python 71 16 Updated Oct 3, 2023

Image Inpainting for Irregular Holes Using Partial Convolutions

HTML 25 5 Updated Jun 22, 2020

official implementation of Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals

Python 8 3 Updated Nov 15, 2024

Application of MB-iSTFT-VITS components to vits2_pytorch

Python 119 29 Updated Nov 19, 2024

TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compa…

Python 3,732 360 Updated Dec 25, 2024

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 29,819 12,311 Updated Dec 25, 2024

Python ProxyPool for web spider

Python 21,787 5,218 Updated Sep 10, 2024

This document contains the functions that are currently available in the RobustSP toolbox: a Matlab toolbox for robust signal processing. The toolbox can be freely used for non-commercial use only.…

MATLAB 16 10 Updated Nov 15, 2019

Moodle - the world's open source learning platform

PHP 5,850 6,685 Updated Dec 19, 2024

A feature-rich command-line audio/video downloader

Python 94,245 7,367 Updated Dec 23, 2024

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,238 117 Updated Jul 11, 2024

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,675 184 Updated Nov 14, 2024

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

Python 686 156 Updated Apr 6, 2023

Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec…

Python 42 4 Updated Jul 29, 2024

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,292 141 Updated Jun 6, 2024

NeMo text processing for ASR and TTS

Python 288 91 Updated Dec 11, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 8,763 842 Updated Dec 18, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 127 3 Updated Dec 6, 2024

🎛 🔊 A Python library for audio.

C++ 5,297 269 Updated Nov 26, 2024
Next