Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

to calculate semantic siliartity between two words such as happy and sad #33

Open
xinli2008 opened this issue Mar 29, 2022 · 1 comment

Comments

@xinli2008
Copy link

Hello, can this tool be used to calculate the semantic simliarity between two words such as "happy" and "sad"?
@MaartenGr

@MaartenGr
Copy link
Owner

Yes! PolyFuzz is meant to find the distance between two sets of strings. Distance might mean string-distance, as in how many changes do you need to take to go from one string to another, or it might mean the distance in semantic similarity.

Semantic similarity is typically extracted using embedding techniques, such as Word2Vec, FastText, or the transformer models that have shown tremendous performance boosts. To use one of these techniques, you can follow along with the guide here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants