/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
Karpathy on DeepSeek-OCR paper: Are pixels better inputs to LLMs than text?
(twitter.com)
237 points
JnBrymn
| 1 comments |
21 Oct 25 17:43 UTC
|
HN request time: 0.973s
|
source
https://xcancel.com/karpathy/status/1980397031542989305
Show context
sabareesh
◴[
22 Oct 25 22:18 UTC
]
No.
45675879
[source]
▶
>>45658928 (OP)
#
It might be that our current tokenization is inefficient compared to how well image pipeline does. Language already does lot of compression but there might be even better way to represent it in latent space
replies(3):
>>45675953
#
>>45676049
#
>>45677115
#
1.
◴[
23 Oct 25 01:14 UTC
]
No.
45677115
[source]
▶
>>45675879
#
ID:
GO
↑