Skip to content
This repository has been archived by the owner on Dec 11, 2019. It is now read-only.

This experiment was conducted in collaboration with ARG-tech. Goal: develop score for context similarity between two documents. Results are impressive but performance is poor.

License

Notifications You must be signed in to change notification settings

digital-peace-talks/Eperiment-1-Synonymity-Scoring

 
 

Repository files navigation

About

Segments texts, extracts weighed keywords, conducts a synonym set walk for every possible keyword pair and stores the calculated distances.

Parts of this research project will happen inside Colaboratory as well.

Segmenting

We segment documents in order to increase the accuracy of the sysnonymity scoring.

A simple CLI tool cmd/cli/segmenter.go which is using github.com/jdkato/prose is user for the first steps of segmenting.

To run it simply provide your string as first argument:

go run cmd/segmenter/*.go "Hello darkness my old friend"

Argument Analysis API

We extract and weigh keywords, conduct sysnonym set walks and store the calculated disctances. For a detailed API documentation, please refer to the APIBlueprint located in apiary.apib or check out the interactive docs at Apiary.

License

General

Find the corresponding license attached in LICENSE. This project is meant for experimenting which means using and vendoring in foreign code where suitable. This license does only applies to code written by contributors, external libraries can differ and got their licenses stored in the respective vendor directory.

Code for ADW

The licensing from the original project applies without any changes.

ADW (Align, Disambiguate and Walk) -- A Unified Approach for Measuring Semantic Similarity.

Copyright (c) 2014 Sapienza University of Rome. All Rights Reserved.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

If you use this system, please cite the following paper:

M. T. Pilehvar, D. Jurgens and R. Navigli. Align, Disambiguate and Walk: A Unified Approach for Measuring Semantic Similarity. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013), Sofia, Bulgaria, August 4-9, 2013, pp. 1341-1351.

Code for wrapping around ADW

The code for wrapping around the ADW implementation is licensed through the same LICENSE as the original library.

Thanks to the respective authors and developers for providing their work.

About

This experiment was conducted in collaboration with ARG-tech. Goal: develop score for context similarity between two documents. Results are impressive but performance is poor.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Go 35.1%
  • Makefile 25.8%
  • Python 24.7%
  • API Blueprint 12.6%
  • Kotlin 1.5%
  • Shell 0.3%