←back to thread

321 points jhunter1016 | 6 comments | | HN request time: 0.001s | source | bottom
Show context
WithinReason ◴[] No.41878604[source]
Does OpenAI have any fundamental advantage beyond brand recognition?
replies(15): >>41878633 #>>41878979 #>>41880635 #>>41880834 #>>41881554 #>>41881647 #>>41881720 #>>41881764 #>>41881926 #>>41882221 #>>41882479 #>>41882695 #>>41883076 #>>41883128 #>>41883207 #
idunnoman1222 ◴[] No.41881647[source]
Yes, they already collected all the data. The same data has had walls put up around it
replies(4): >>41881678 #>>41882077 #>>41882200 #>>41882333 #
1. ugh123 ◴[] No.41882077[source]
Which data? Is that data that Google and/or Meta can't get or doesn't have already?
replies(2): >>41883016 #>>41883115 #
2. jazzyjackson ◴[] No.41883016[source]
Well, at this point most new data being created is conversations with chatgpt, seeing as how stack overflow and reddit are increasingly useless, so their conversation logs are their moat.
replies(2): >>41883911 #>>41884055 #
3. charlieyu1 ◴[] No.41883115[source]
AI companies have been paying people to create new data for a while
replies(1): >>41883947 #
4. staticautomatic ◴[] No.41883911[source]
There’s tons of human-created data the AI companies aren’t using yet.
5. ugh123 ◴[] No.41883947[source]
Do you mean by RLHF? If so, thats not 'data' used by the model in the traditional sense.
6. sangnoir ◴[] No.41884055[source]
> so their conversation logs are their moat

Google and Meta aren't exactly lacking in conversation data: Facebook, Messenger, Instagram, Google Talk, Google Groups, Google Plus, Blogspot comments, Youtube Transcripts, &tc. The breadth and and breadth of data those 2 companies are sitting on that goes back for years is mind boggling.