←back to thread

747 points porridgeraisin | 1 comments | | HN request time: 0.374s | source
Show context
JCM9 ◴[] No.45063064[source]
Not a surprise. All the major players have reached the limits of training on existing data—they’re already training on essentially the whole internet plus a bunch of content they allegedly stole (hence various lawsuits). There haven’t been any major breakthroughs in model architecture from the major players recently and thus they’re now in a battle for more data to train on. They need data, and they want YOUR data, now, and are gonna do increasingly shady things to get it.
replies(5): >>45063645 #>>45063676 #>>45063696 #>>45064759 #>>45064804 #
1. imiric ◴[] No.45064804[source]
Yeah, this is hardly surprising.

To AI companies, data is even more of a gold mine than to adtech companies. It is existentially important.

The truly evil behavior will emerge at the intersection of these two industries. I'm sure Google and Facebook are already using data from one to power the other, even if it's currently behind closed doors. I can hardly wait for the use cases these geniuses will think of once this is publicly acceptable and in widespread use by all companies.