Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improving matchings for the Standards with SoWs #58

Open
asitang opened this issue Nov 4, 2019 · 0 comments
Open

Improving matchings for the Standards with SoWs #58

asitang opened this issue Nov 4, 2019 · 0 comments

Comments

@asitang
Copy link
Collaborator

asitang commented Nov 4, 2019

Some initial exploratory paths:

Cosine Similarity with n-gram (Tf-Idf)

  • fast
  • interpretability
  • no word sense

Use paragraph to vec to model the whole texts to be matched:

  • this can capture the whole contexts (word relationships) and word sense better
  • faster to calculate the similarity since only one vector
  • no Interpretability

Use context word embeddings techniques to model each token and then use soft-cosine to find cosine similarity:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant