Skip to content
View Ryu1845's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Ryu1845

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Psychoacoustic Calibration for Efficient Neural Audio Coding

Python 22 8 Updated Sep 26, 2023

Minimal Implementation of Visual Autoregressive Modelling (VAR)

Python 15 Updated Jan 7, 2025

A production-ready implementation of WaveRNN-based autoregressive waveform synthesis.

Python 8 Updated Oct 24, 2021

A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.

138 12 Updated Jan 8, 2025

PyTorch building blocks for OLMo

Python 46 5 Updated Jan 9, 2025

Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".

Python 96 5 Updated Jan 3, 2025

TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching

Jupyter Notebook 451 42 Updated Jan 7, 2025
Python 31 1 Updated Jan 9, 2025

Taming Stable Diffusion for Lip Sync!

Python 1,364 131 Updated Jan 8, 2025

A python package for calculating the PESQ.

Python 367 70 Updated Apr 24, 2023

🔥 A minimal training framework for scaling FLA models

12 Updated Jan 6, 2025

[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"

Python 174 8 Updated Sep 30, 2024
Python 15 1 Updated Jan 8, 2025

PyTorch implementation of the NSGT/sliCQT

Python 15 1 Updated Nov 10, 2023

music demixing with the sliCQ Transform and PyTorch

Python 28 6 Updated Nov 10, 2023

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,437 196 Updated Jan 8, 2025

Swiftly get tons of images from indexed tars on Huggingface

Python 37 1 Updated Dec 19, 2024

A Video Tokenizer Evaluation Dataset

Python 79 5 Updated Dec 30, 2024

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 3,665 189 Updated Jan 8, 2025

Clean and modernized implementation of FastSpeech2/LightSpeech using IPA

Python 2 Updated Aug 16, 2024
Python 9 Updated Sep 28, 2024

PageRank for LLMs

Jupyter Notebook 35 2 Updated Jan 9, 2025

StyleTTS 2 Optimized Training Fork

Python 8 Updated Jan 2, 2025

Python library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings according to Ribeiro et al. (2011).

Python 23 1 Updated Aug 11, 2023

Converts text to speech in realtime

Python 2,226 218 Updated Jan 6, 2025

Tile primitives for speedy kernels

Cuda 9 Updated Nov 22, 2024

The official repository for the paper "Optimal Flow Matching: Learning Straight Trajectories in Just One Step" (NeurIPS 2024)

Jupyter Notebook 53 1 Updated Dec 19, 2024
Python 52 Updated Dec 22, 2024
Python 13 1 Updated Dec 30, 2024
Next
Showing results