←back to thread

577 points simonw | 1 comments | | HN request time: 0.201s | source
Show context
croes ◴[] No.44723485[source]
I bet the training data included enough space invader cloned in JS
replies(3): >>44723515 #>>44723527 #>>44724105 #
jplrssn ◴[] No.44723515[source]
I also wouldn't be surprised if labs were starting to mix in a few pelican SVGs into their training data.
replies(3): >>44723675 #>>44723681 #>>44723878 #
diggan ◴[] No.44723681[source]
Even "accidentally" it makes sense that "SVGs of pelicans riding bikes" are now included into datasets used for training as it has spread as a wildfire on the internet, making it less useful as a simple benchmark.

This is why I keep all my benchmarks private and don't share anything about them publicly, as soon as you write about them anywhere publicly they'll stop being useful in some months.

replies(1): >>44723789 #
toyg ◴[] No.44723789[source]
> This is why I keep all my benchmarks private

This is also why, if I were an artist or anyone commercially relying on creative output of any kind, I wouldn't be posting anything on the internet anymore, ever. The minute you make anything public, the engines will clone it to death and turn it into a commodity.

replies(2): >>44724505 #>>44724610 #
1. debugnik ◴[] No.44724610[source]
That makes it so much harder to show art to people and market yourself though.

I considered experimenting with web DRM for art sites/portfolios, on the assumption that scrappers won't bother with the analog loophole (and dedicated art-style cloners would hopefully be disappointed by the quality), but gave up because of limited compatible devices for the strongest DRM levels, and HDCP being broken on those levels anyway. If the DRM technique caught on it would take attackers, at most, a few bucks and hours once to bypass it, and I don't think users would truly understand that upfront.