Gemma 3 and Wan2.1 #357
DCVirtualCosmos
started this conversation in
Ideas
Replies: 1 comment
-
Bump. Gemma 3n 2B and 4B is more efficient and optimized model that can run on lower-end hardware as well. Currently not yet added to taggui. It's available via transformers library - https://huggingface.co/blog/gemma3n |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
With the release of those powerful models, one could start dreaming of a tool to caption videos to train Wan LoRAs. And Gemma 3 seems the perfect tool. It's pretty smart, follow instruction quite well, there are versions of it uncensored already on hugging face, and it can analyze perfectly a sequence of images to describe what is happening in a short video.
So, it would be nice if Taggui could:
I will try to do this myself when I got time, but perhaps you are faster!
Beta Was this translation helpful? Give feedback.
All reactions