(blog.abdellatif.io)

548 points tifa2up | 1 comments | 20 Oct 25 15:55 UTC | HN request time: 0.207s | source

Show context

383toast ◴[20 Oct 25 17:39 UTC] No.45646734[source]▶

They should've tested other embedding models, there are better ones than openai's (and cheaper)

replies(1): >>45646823 #

Which do you suggest?

1. leftnode ◴[20 Oct 25 19:10 UTC] No.45647899[source]▶

The Qwen3 600M and 4B embedding models are near state of the art and aren't too computationally intensive.

Production RAG: what I learned from processing 5M+ documents