An image of an archeologist adventurer who wears a hat and uses a bullwhip

(theaiunderwriter.substack.com)

1503 points participant3 | 2 comments | 03 Apr 25 17:55 UTC | HN request time: 0.512s | source

Show context

MgB2 ◴[03 Apr 25 20:24 UTC] No.43574927[source]▶

Idk, the models generating what are basically 1:1 copies of the training data from pretty generic descriptions feels like a severe case of overfitting to me. What use is a generational model that just regurgitates the input?

I feel like the less advanced generations, maybe even because of their limitations in terms of size, were better at coming up with something that at least feels new.

In the end, other than for copyright-washing, why wouldn't I just use the original movie still/photo in the first place?

replies(13): >>43575052 #>>43575080 #>>43575231 #>>43576085 #>>43576153 #>>43577026 #>>43577350 #>>43578381 #>>43578512 #>>43578581 #>>43579012 #>>43579408 #>>43582494 #

ramraj07 ◴[04 Apr 25 04:40 UTC] No.43578381[source]▶

>>43574927 #

So I train a model to say y=2, and then I ask the model to guess the value of y and it says 2, and you call that overfitting?

Overfitting is if you didn't exactly describe Indiana Jones and then it still gave Indiana Jones.

replies(2): >>43578447 #>>43579929 #

MgB2 ◴[04 Apr 25 04:51 UTC] No.43578447[source]▶

>>43578381 #

The prompt didn't exactly describe Indiana Jones though. It left a lot of freedom for the model to make the "archeologist" e.g. female, Asian, put them in a different time period, have them wear a different kind of hat etc.

It didn't though, it just spat out what is basically a 1:1 copy of some Indiana Jones promo shoot. No where did the prompt ask for it to look like Harrison Ford.

replies(3): >>43578572 #>>43582523 #>>43585657 #

fluidcruft ◴[04 Apr 25 05:14 UTC] No.43578572[source]▶

>>43578447 #

But... the prompt neither forbade Indiana Jones nor did it describe something that excluded Indiana Jones.

If we were playing Charades, just about anyone would have guessed you were describing Indiana Jones.

If you gave a street artist the same prompt, you'd probably get something similar unless you specified something like "... but something different than Indiana Jones".

replies(2): >>43578848 #>>43579133 #

darkwater ◴[04 Apr 25 07:00 UTC] No.43579133[source]▶

>>43578572 #

The nice thing about humans is that not every single human being read almost every content present on the Internet. So yeah, a certain group of people would draw or think of Indiana Jones with that prompt, but not everyone. Maybe we will have different models with different trainings/settings that permits this kind of freedom, although I doubt it will be the commercial ones.

replies(1): >>43579245 #

dash2 ◴[04 Apr 25 07:23 UTC] No.43579245[source]▶

>>43579133 #

I mean, did anyone here read the prompt and not think “Indiana Jones”?

replies(2): >>43579930 #>>43580193 #

1. darkwater ◴[04 Apr 25 09:23 UTC] No.43579930[source]▶

>>43579245 #

Is HN the whole world? Isn't an AI model supposed to be global, since it has ingested the whole Internet?

How can you express, in term of AI training, ignoring the existence of something that's widely present in your training data set? if you ask the same question to a 18yo girl in rural Thailand, would she draw Harrison Ford as Indiana Jones? Maybe not. Or maybe she would.

But IMO an AI model must be able to provide a more generic (unbiased?) answer when the prompt wasn't specific enough.

replies(1): >>43580301 #

2. lupusreal ◴[04 Apr 25 10:28 UTC] No.43580301[source]▶

>>43579930 (TP) #

Why should the AI be made to emulate a person naive to extant human society, tropes and customs? That would only make it harder for most people to use.

Maybe it would have some point if you are targetting users in a substantially different social context. In the case, you would design the model to be familiar with their tropes instead. So when they describe a character iconic in their culture, by a few distinguishing characteristics, it would produce that character for them. That's no different at all.

↑