←back to thread

Google is winning on every AI front

(www.thealgorithmicbridge.com)
993 points vinhnx | 1 comments | | HN request time: 0s | source
Show context
ruuda ◴[] No.43661815[source]
I'm trying Imagen 3 to add pictures to a presentation in Google Slides, and it's making such basic mistakes that I thought image models weren't making any more by now. I tried for half an hour to prompt it into generating an illustration of a Thinkpad facing with the back to the viewer, so the keyboard is not visible. It couldn't do it, it would always make the keyboard face towards the viewer. Or you ask for an illustration of an animal pointing a finger, and it gives it an additional arm. Meanwhile you ask OpenAI to ghiblify a picture while changing the setting and adding 5 other things, and it absolutely nails it.
replies(3): >>43661826 #>>43661862 #>>43662012 #
boznz ◴[] No.43661862[source]
I thought it was just me. A few hours ago Gemini told me "As a language model, I'm not able to assist you with that." This was after generating an image a few minutes earlier. I think the copy/paste buffer pulled in some old source files I had attached a few days earlier (no idea how) because under the "sources and related content" it now showed two files Gemini is obviously calling its brother imagen for offloading the image generation, which is smart I guess if it works
replies(1): >>43662261 #
Hikikomori ◴[] No.43662261[source]
Can Gemini 2.5 pro generate images? It only describes them for me.
replies(1): >>43662302 #
1. boznz ◴[] No.43662302{3}[source]
I'm using 2.0 Flash and if I ask it, it says yes it can, but it does seem hit and miss as above.