You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
• [[Presentation on YouTube](https://www.youtube.com/watch?v=Bucb3nAa398)] ([Can't watch YouTube?](https://a3s.fi/swift/v1/AUTH_a235c0f452d648828f745589cde1219a/specvqgan_public/SpecVQGAN%20YouTube.mp4))
10
-
•
10
+
•
11
11
12
12
[](https://colab.research.google.com/drive/1pxTIMweAKApJZ3ZFqyBee3HtMqFpnwQ0?usp=sharing)
13
13
@@ -131,7 +131,7 @@ Each part holds ~3k clips and can be used independently as a subset of the whole
131
131
132
132
For `BN Inception` features, we employ the same procedure as [RegNet](https://github.com/PeihaoChen/regnet#data-preprocessing).
133
133
134
-
For `ResNet50` features, we rely on [video_features](https://v-iashin.github.io/video_features/models/resnet/)
134
+
For `ResNet50` features, we rely on [video_features (branch `specvqgan`)](https://github.com/v-iashin/video_features/tree/specvqgan)
135
135
repository and used these commands:
136
136
```bash
137
137
# VAS (few hours on three 2080Ti)
@@ -299,9 +299,9 @@ The setting **(a)**: the transformer is trained on *VGGSound* to sample from the
@@ -565,7 +565,7 @@ Please, use this bibtex if you would like to cite our work
565
565
# Acknowledgments
566
566
Funding for this research was provided by the Academy of Finland projects 327910 & 324346. The authors acknowledge CSC — IT Center for Science, Finland, for computational resources for our experimentation.
567
567
568
-
We also acknowledge the following codebases:
568
+
We also acknowledge the following work:
569
569
- The code base is built upon an amazing [taming-transformers](https://github.com/CompVis/taming-transformers) repo.
570
570
Check it out if you are into high-res image generation.
571
571
- The implementation of some evaluation metrics is partially borrowed and adapted from [torch-fidelity](https://github.com/toshas/torch-fidelity).
0 commit comments