←back to thread

1503 points participant3 | 1 comments | | HN request time: 0s | source
Show context
dcow ◴[] No.43578203[source]
Either, (1) LLMs are just super lossy compress/decompress machines and we humans find fascination in the loss that happens at decompression time, at times ascribing creativity and agency to it. Status quo copyright is a concern as we reduce the amount of lossiness, because at some point someone can claim that an output is close enough to the original to constitute infringement. AI companies should probably license all their training data until we sort the mess out.

Or, (2) LLMs are creative and do have agency, and feeding them bland prompts doesn't get their juices flowing. Copyright isn't a concern, the model just regurgitated a cheap likeness of Indiana Jones as Harrison Ford the world has seen ad nauseam. You'd probably do the same thing if someone prompted you the same way, you lazy energy conserving organism you.

In any case, perhaps the idea "cheap prompts yield cheap outputs" holds true. You're asking the model respond to the entirely uninspired phrase: "an image of an archeologist adventurer who wears a hat and uses a bullwhip". It's not surprising to me that the model outputs a generic pop-culture-shaped image that looks uncannily like the most iconic and popular rendition of the idea: Harrison Ford.

If you look at the type of prompts our new generation of prompt artists are using over in communities like Midjourney, a cheap generic sentence doesn't cut it.

replies(3): >>43578239 #>>43578351 #>>43578493 #
sothatsit ◴[] No.43578239[source]
You don't even need to add much more to the prompts. Just a few words, and it changes the characters you get. It won't always produce something good, but at least we have a lot of control over what it produces. Examples:

"An image of an Indian female archeologist adventurer who wears a hat and uses a bullwhip" (https://sora.com/g/gen_01jqzet1p8fjaa808bmqnvf7rk)

"An image of a fat Russian archeologist adventurer who wears a hat and uses a bullwhip" (https://sora.com/g/gen_01jqzfk727erer98a6yexafe70)

"An image of a skeletal archeologist adventurer who wears a hat and uses a bullwhip" (https://sora.com/g/gen_01jqzfnaz6fgqvgwqw8w4ntf6p)

Or, give ChatGPT a starting image. (https://sora.com/g/gen_01jqzf7vdweg4v5198aqfynjym)

And by further remixing the images ChatGPT produces, you can get your images to be even more unique. (https://sora.com/g/gen_01jqzfzmbze0wa310m42f8j5yw)

replies(3): >>43578299 #>>43578315 #>>43578578 #
otabdeveloper4 ◴[] No.43578578[source]
Archeologists don't actually wear fedora hats.

And the stereotypical meme "archeologist hat" is the pith helmet.

replies(1): >>43578652 #
sothatsit ◴[] No.43578652[source]
Here, I asked ChatGPT to generate an image using a pith helmet for you: https://sora.com/g/gen_01jqzmab6hfxxtrt3atd0jgpg7

You can just ask for whatever changes you want.

replies(1): >>43580840 #
otabdeveloper4 ◴[] No.43580840[source]
> You can just ask for whatever changes you want.

Yes, as long as what you're asking for is Indiana Jones.

replies(2): >>43583616 #>>43585533 #
1. sothatsit ◴[] No.43585533[source]
You just have to write the prompt in a way that is not so obviously pointing to Indiana Jones, and you get something that is not Indiana Jones...

"A nerdy archaeologist adventurer in a pith helmet, with glasses and a backpack, stumbling his way through a green overgrown abandoned temple. Vines reach for his heels" (https://sora.com/g/gen_01jr0yd810e8xsenp85xy2g47f)

"A nerdy archaeologist adventurer in a pith helmet, with glasses and a backpack, nervously sneaking her way through a green overgrown abandoned temple. She is wearing pink khaki pants, and a singlet" (https://sora.com/g/gen_01jr0z837jecpa770v009bs1m3)

Is it as creative as good humans? Not at all. It definitely falls into tropes readily. But we can still inject novel ideas into our prompts for the AI, and get unique results. Especially if you draw sketches and provide those to the AI to work from.