(calc.datova.ai)

176 points nxa | 1 comments | 14 May 25 19:54 UTC | HN request time: 0.41s | source

I've been playing with embeddings and wanted to try out what results the embedding layer will produce based on just word-by-word input and addition / subtraction, beyond what many videos / papers mention (like the obvious king-man+woman=queen). So I built something that doesn't just give the first answer, but ranks the matches based on distance / cosine symmetry. I polished it a bit so that others can try it out, too.

For now, I only have nouns (and some proper nouns) in the dataset, and pick the most common interpretation among the homographs. Also, it's case sensitive.

Show context

Jimmc414 ◴[14 May 25 23:18 UTC] No.43990190[source]▶

>>43988533 (OP) #

dog - cat = paleolith

paleolith + cat = Paleolithic Age

paleolith + dog = Paleolithic Age

paleolith - cat = neolith

paleolith - dog = hand ax

cat - dog = meow

Wonder if some of the math is off or I am not using this properly

replies(1): >>43999589 #

1. Glyptodon ◴[15 May 25 21:40 UTC] No.43999589[source]▶

>>43990190 #

I figure the mathematically highest value must defer from the semantically most accurate relatively frequently. (Because Car - Wheel = Touring Car doesn't make a lot of sense to me.)

↑

Show HN: Semantic Calculator (king-man+woman=?)