←back to thread

182 points BUFU | 3 comments | | HN request time: 0.917s | source
Show context
Patrick_Devine ◴[] No.42070613[source]
This was a pretty heavy lift for us to get out which was why it took a while. In addition to writing new image processing routines, a vision encoder, and doing cross attention, we also ended up re-architecting the way the models get run by the scheduler. We'll have a technical blog post soon about all the stuff that ended up changing.
replies(4): >>42070644 #>>42071917 #>>42072723 #>>42076774 #
1. csomar ◴[] No.42072723[source]
Any info of when we will get the 11B and 90B models?
replies(1): >>42076770 #
2. jjice ◴[] No.42076770[source]
Not sure if I'm misunderstanding, but they're live: https://ollama.com/library/llama3.2-vision

Ran the 11B yesterday and it worked great.

replies(1): >>42083795 #
3. csomar ◴[] No.42083795[source]
These are vision optimized, though? Or that doesn't make them perform less for coding tasks?