Google is winning on every AI front

(www.thealgorithmicbridge.com)

1005 points vinhnx | 1 comments | 12 Apr 25 03:58 UTC | HN request time: 0.197s | source

Show context

ruuda ◴[12 Apr 25 06:15 UTC] No.43661815[source]▶

I'm trying Imagen 3 to add pictures to a presentation in Google Slides, and it's making such basic mistakes that I thought image models weren't making any more by now. I tried for half an hour to prompt it into generating an illustration of a Thinkpad facing with the back to the viewer, so the keyboard is not visible. It couldn't do it, it would always make the keyboard face towards the viewer. Or you ask for an illustration of an animal pointing a finger, and it gives it an additional arm. Meanwhile you ask OpenAI to ghiblify a picture while changing the setting and adding 5 other things, and it absolutely nails it.

replies(3): >>43661826 #>>43661862 #>>43662012 #

boznz ◴[12 Apr 25 06:27 UTC] No.43661862[source]▶

>>43661815 #

I thought it was just me. A few hours ago Gemini told me "As a language model, I'm not able to assist you with that." This was after generating an image a few minutes earlier. I think the copy/paste buffer pulled in some old source files I had attached a few days earlier (no idea how) because under the "sources and related content" it now showed two files Gemini is obviously calling its brother imagen for offloading the image generation, which is smart I guess if it works

replies(1): >>43662261 #

Hikikomori ◴[12 Apr 25 07:43 UTC] No.43662261[source]▶

>>43661862 #

Can Gemini 2.5 pro generate images? It only describes them for me.

replies(1): >>43662302 #

1. boznz ◴[12 Apr 25 07:51 UTC] No.43662302[source]▶

>>43662261 #

I'm using 2.0 Flash and if I ask it, it says yes it can, but it does seem hit and miss as above.

↑