/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
Production RAG: what I learned from processing 5M+ documents
(blog.abdellatif.io)
548 points
tifa2up
| 1 comments |
20 Oct 25 15:55 UTC
|
HN request time: 0.207s
|
source
Show context
383toast
◴[
20 Oct 25 17:39 UTC
]
No.
45646734
[source]
▶
>>45645349 (OP)
#
They should've tested other embedding models, there are better ones than openai's (and cheaper)
replies(1):
>>45646823
#
prettyblocks
◴[
20 Oct 25 17:44 UTC
]
No.
45646823
[source]
▶
>>45646734
#
Which do you suggest?
replies(2):
>>45646987
#
>>45647899
#
1.
leftnode
◴[
20 Oct 25 19:10 UTC
]
No.
45647899
[source]
▶
>>45646823
#
The Qwen3 600M and 4B embedding models are near state of the art and aren't too computationally intensive.
ID:
GO
↑