Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

text-to-pose: Use more datasets #3

Open
1 of 6 tasks
AmitMY opened this issue May 16, 2022 · 0 comments
Open
1 of 6 tasks

text-to-pose: Use more datasets #3

AmitMY opened this issue May 16, 2022 · 0 comments
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@AmitMY
Copy link
Contributor

AmitMY commented May 16, 2022

The sign-language-datasets package includes datasets that could be used for this task:

  • dicta sign (dictionary - hamnosys / pose)
  • sign2mint (dictionary - signwriting / videos. poses can be easily extracted)
  • signtyp (signwriting / videos) - quite noisy, I don't think it is worth using.
  • swojs_glossario (signwiriting / videos) - very high quality, but long sentences rather than single words. I don't think it is worth using.

There are also more datasets, not currently available in sign-language-datasets, that would probably be good to add:

  • dgs lexicon (dictionary - hamnosys / videos) - The DGS Corpus includes a lexicon for all the glosses. When clicking on a gloss (skip all $s for ease) then we have this for example which includes hamnosys and a video. Sometimes, only hamnosys exists, but with references in the DGS corpus when this hamnosys was performed. This is very powerful to get multiple almost-aligned examples for the same sequence (could probably extract 50-200k+ samples), in a non-dictionary manner
  • The polish sign language corpus includes many dictionary videos (example) with hamnosys
@AmitMY AmitMY added enhancement New feature or request help wanted Extra attention is needed labels May 16, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

1 participant