←back to thread

Embeddings are underrated (2024)

(technicalwriting.dev)
484 points jxmorris12 | 3 comments | | HN request time: 0.798s | source
1. nakedneuron ◴[] No.43968176[source]
Question for the experts: As embeddings in question reflect content is there another vector for style? I was wondering if stylometry research would profit from embeddings becoming more easily accessible than ever. I dug my head into this matter some time ago and believe this would be the right tool.
replies(2): >>43968366 #>>43968836 #
2. n3t ◴[] No.43968366[source]
If you have a dataset for which you know which elements have similar/same style, you can create your own embeddings for style.
3. navar ◴[] No.43968836[source]
Embeddings as a tool have been around for longer than LLMs. They were (and are) ubiquitous in, e.g., recommender systems. It sounds maybe this would be more in-line with what you are looking for. In this case, check out https://github.com/benfred/implicit - I have used it in the past with great success.