(calc.datova.ai)

176 points nxa | 3 comments | 14 May 25 19:54 UTC | HN request time: 0s | source

I've been playing with embeddings and wanted to try out what results the embedding layer will produce based on just word-by-word input and addition / subtraction, beyond what many videos / papers mention (like the obvious king-man+woman=queen). So I built something that doesn't just give the first answer, but ranks the matches based on distance / cosine symmetry. I polished it a bit so that others can try it out, too.

For now, I only have nouns (and some proper nouns) in the dataset, and pick the most common interpretation among the homographs. Also, it's case sensitive.

1. matallo ◴[14 May 25 21:51 UTC] No.43989584[source]▶

>>43988533 (OP) #

uncle + aunt = great-uncle (91%)

great idea, but I find the results unamusing

replies(1): >>43989784 #

2. HWR_14 ◴[14 May 25 22:20 UTC] No.43989784[source]▶

>>43989584 (TP) #

Your aunt's uncle is your great-uncle. It's more correct than your intuition.

replies(1): >>43989911 #

3. matallo ◴[14 May 25 22:38 UTC] No.43989911[source]▶

>>43989784 #

I asked ChatGPT (after posting my comment) and this is the response. "Uncle + Aunt = Great-Uncle is incorrect. A great-uncle is the brother of your grandparent."

↑

Show HN: Semantic Calculator (king-man+woman=?)