←back to thread

176 points nxa | 5 comments | | HN request time: 1.007s | source

I've been playing with embeddings and wanted to try out what results the embedding layer will produce based on just word-by-word input and addition / subtraction, beyond what many videos / papers mention (like the obvious king-man+woman=queen). So I built something that doesn't just give the first answer, but ranks the matches based on distance / cosine symmetry. I polished it a bit so that others can try it out, too.

For now, I only have nouns (and some proper nouns) in the dataset, and pick the most common interpretation among the homographs. Also, it's case sensitive.

Show context
nikolay ◴[] No.43988786[source]
Really?!

  man - brain = woman
  woman - brain = businesswoman
replies(6): >>43988818 #>>43988887 #>>43988910 #>>43988964 #>>43988972 #>>43989276 #
1. 2muchcoffeeman ◴[] No.43988818[source]
Man - brain = Irish sea
replies(1): >>43988825 #
2. nikolay ◴[] No.43988825[source]
Case matters, obviously! Try "man" with a lower-case "M"!
replies(1): >>43988974 #
3. Alifatisk ◴[] No.43988974[source]
Why does case matter? How does it affect the meaning?
replies(2): >>43989004 #>>43989028 #
4. bfLives ◴[] No.43989004{3}[source]
“Man” is probably being interpreted as the Isle of Man.

https://en.m.wikipedia.org/wiki/Isle_of_Man

5. G1N ◴[] No.43989028{3}[source]
Man (capital M) is probably being interpreted as some proper noun, maybe Isle of Man in this case?