Skip to content

Commit b3d67dc

Browse files
committed
Add reference of univnet implementation
1 parent 2c0d8d7 commit b3d67dc

File tree

2 files changed

+6
-1
lines changed

2 files changed

+6
-1
lines changed

Diff for: README.md

+1
Original file line numberDiff line numberDiff line change
@@ -271,6 +271,7 @@ credit a few of the amazing folks in the community that have helped make this ha
271271
- [Ramesh et al](https://arxiv.org/pdf/2102.12092.pdf) who authored the DALLE paper, which is the inspiration behind Tortoise.
272272
- [Nichol and Dhariwal](https://arxiv.org/pdf/2102.09672.pdf) who authored the (revision of) the code that drives the diffusion model.
273273
- [Jang et al](https://arxiv.org/pdf/2106.07889.pdf) who developed and open-sourced univnet, the vocoder this repo uses.
274+
- [Kim and Jung](https://github.com/mindslab-ai/univnet) who implemented univnet pytorch model.
274275
- [lucidrains](https://github.com/lucidrains) who writes awesome open source pytorch models, many of which are used here.
275276
- [Patrick von Platen](https://huggingface.co/patrickvonplaten) whose guides on setting up wav2vec were invaluable to building my dataset.
276277

Diff for: tortoise/models/vocoder.py

+5-1
Original file line numberDiff line numberDiff line change
@@ -223,7 +223,11 @@ def remove_weight_norm(self):
223223

224224

225225
class UnivNetGenerator(nn.Module):
226-
"""UnivNet Generator"""
226+
"""
227+
UnivNet Generator
228+
229+
Originally from https://github.com/mindslab-ai/univnet/blob/master/model/generator.py.
230+
"""
227231

228232
def __init__(self, noise_dim=64, channel_size=32, dilations=[1,3,9,27], strides=[8,8,4], lReLU_slope=.2, kpnet_conv_size=3,
229233
# Below are MEL configurations options that this generator requires.

0 commit comments

Comments
 (0)