Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Alyawarra (kinship) data dictionary is missing names for entities/relations #2

Open
cmeb45 opened this issue Nov 29, 2020 · 0 comments

Comments

@cmeb45
Copy link

cmeb45 commented Nov 29, 2020

Hello!

Thank you for putting together and releasing these exercises. I am currently going through Exercise 6 and I noticed that the data dictionary lookups for the Alyawarra (kinship) dataset (data/kinship/bin/idx2ent.npy and data/kinship/bin/idx2rel.npy) appear to only contain indices and not the names of the entities/relations. It would seem that this information is necessary for the interpretability of the t-SNE visualization and k-NN of the entity/relation embeddings.

I did find a dictionary for the original Alyawarra dataset from 1971 at Kinsources. However, the codes for the relation types there range from 1 to 29, while the indices in data/kinship/bin/idx2rel.npy range from 0 to 25, so I am not sure of the mapping between these two sets of values.

Is there anything that I am missing? Any insights would be very helpful.

Many thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant