←back to thread

Gemini 2.5 Flash Image

(developers.googleblog.com)
1092 points meetpateltech | 1 comments | | HN request time: 0s | source
Show context
vunderba ◴[] No.45029113[source]
I've updated the GenAI Image comparison site (which focuses heavily on strict text-to-image prompt adherence) to reflect the new Google Gemini 2.5 Flash model (aka nano-banana).

https://genai-showdown.specr.net

This model gets 8 of the 12 prompts correct and easily comes within striking distance of the best-in-class models Imagen and gpt-image-1 and is a significant upgrade over the old Gemini Flash 2.0 model. The reigning champ, gpt-image-1, only manages to edge out Flash 2.5 on the maze and 9-pointed star.

What's honestly most astonishing to me is how long gpt-image-1 has remained at the top of the class - closing in on half a year which is basically a lifetime in this field. Though fair warning, gpt-image-1 is borderline useless as an "editor" since it almost always changes the whole image instead of doing localized inpainting-style edits like Kontext, Qwen, or Nano-Banana.

Comparison of gpt-image-1, flash, and imagen.

https://genai-showdown.specr.net?models=OPENAI_4O%2CIMAGEN_4...

replies(7): >>45030193 #>>45030194 #>>45030942 #>>45032937 #>>45033671 #>>45036899 #>>45041270 #
jay_kyburz ◴[] No.45032937[source]
I really like your site.

Do you know of any similar sites that that compares how well the various models can adhere to a style guide? Perhaps you could add this?

I.e. pride the model with a collection of drawings in a single style, then follow prompts and generate images in the same style?

For example if you wanted to illustrate a book, and have all the illustrations look like they were from the same artists.

replies(1): >>45043281 #
1. vunderba ◴[] No.45043281[source]
Hi Jay, unfortunately I haven't see a site like that but being able to rank models in terms of "style adherence" but it would be a nice feature.

It's basically a necessity if you're working on something like a game or comic where you need consistency around characters, sprites, etc.