pizzathyme 3 days ago

The image generation improvement with o4-mini is incredible. Testing it out today, this is a step change in editing specificity even from the ChatGPT 4o LLM image integration just a few weeks ago (which was already a step change). I'm able to ask for surgical edits, and they are done correctly.

There isn't a numerical benchmark for this that people seem to be tracking but this opens up production-ready image use cases. This was worth a new release.

3
mchusma 3 days ago

Thanks for sharing that. that was more interesting then their demo. I tried it and it was pretty good! I have felt that the ability to iterate from images blocked this from any real production use I had. This may be good enough now.

Example of edits (not quite surgical but good): https://chatgpt.com/share/68001b02-9b4c-8012-a339-73525b8246...

ec109685 3 days ago

I don’t know if they let you share the actual images when sharing a chat. For me, they are blank.

ilaksh 3 days ago

wait, o4-mini outputs images? What I thought I saw was the ability to do a tool call to zoom in on an image.

Are you sure that's not 4o?

AaronAPU 3 days ago

I’m generating logo designs for merch via o4-mini-high and they are pretty good. Good text and comprehending my instructions.

ilaksh 3 days ago

It's using the new gpt-4o, a version that's not in the API

ilaksh 3 days ago

in the api or on the website?

Agentus 3 days ago

also another addition: i previously tried to upload an image for chatgpt to edit and it was incapable under the previous model i tried. Now its able to change uploaded images using o4mini.