(arxiv.org)

177 points lnyan | 3 comments | 18 Nov 24 09:44 UTC | HN request time: 0.766s | source

1. startupsfail ◴[18 Nov 24 17:36 UTC] No.42174762[source]▶

Generating data with OpenAI model AND copying the approach from OpenAI model. This is a bit unsatisfactory, its like saying you wrote some working code, while in fact you’ve decompiled the binary and then compiled it again.

replies(1): >>42175872 #

2. exe34 ◴[18 Nov 24 19:19 UTC] No.42175872[source]▶

>>42174762 (TP) #

well if you have working code at the end, you made progress. closedAI can pull any model at any time for a profit.

replies(1): >>42200516 #

3. startupsfail ◴[21 Nov 24 02:35 UTC] No.42200516[source]▶

>>42175872 #

Yes, agreed. And it’s not like OpenAI isn’t doing the same thing, in a sense. Data was originally sampled from human annotations.

↑

LLaVA-O1: Let Vision Language Models Reason Step-by-Step