Skip to content
View HeCheng0625's full-sized avatar

Block or report HeCheng0625

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
HeCheng0625/README.md

Hi there 👋

I'm Yuancheng Wang (王远程), a PhD student at the Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), SDS, supervised by Prof. Zhizheng Wu. before that, I received the B.S. degree at CUHK-Shenzhen. My research interests include Multi-modal LLM, Generative AI for Speech and Audio, Post-Training, and Representation Learning. I am currently a research scientist intern at Meta Superintelligence Labs, working on enhancing the speech capabilities of Llama models. Previously, I have also interned at Microsoft Research Asia (MSRA) and ByteDance.

I have developed several advanced TTS models, including NaturalSpeech 3 and MaskGCT, and I am one of the main contributors and leaders of the open-source Amphion AmphionGitHub stars toolkit. My work has been published at top international AI conferences such as NeurIPS, ICML, ICLR, ACL, and IEEE SLT. I am looking for a full-time position now, feel free to contact me if you are interested in my experience!

🔗 Homepages

Popular repositories Loading

  1. Diffusion-Speech-Tokenizer Diffusion-Speech-Tokenizer Public

    This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Lan…

    Python 153 10

  2. AUDIT_v2 AUDIT_v2 Public

    Python 4

  3. Amphion Amphion Public

    Forked from open-mmlab/Amphion

    Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

    Python 2 1

  4. DDA4230 DDA4230 Public

    Jupyter Notebook

  5. CSC4080-Project CSC4080-Project Public

    Python

  6. CS61B CS61B Public

    Java