OpenAI launched a new picture era AI mannequin on Tuesday, dubbed ChatGPT Photographs 2.0. This mannequin can generate a couple of picture from a single immediate, like a complete examine booklet, in addition to output textual content, together with in non-English languages like Chinese language and Hindi. This launch is accessible globally for ChatGPT and Codex customers, with a extra highly effective model accessible for paying subscribers.
When any main AI firm releases a brand new picture mannequin, it could revive curiosity and increase utilization, particularly if social media customers undertake a meme-able development, remodeling photographs of themselves. Final yr, Google’s launch of the Nano Banana mannequin was a serious second for the corporate, particularly when customers began posting hyperrealistic collectible figurines of themselves on-line. Earlier this yr, ChatGPT Photographs made waves on social media as customers shared AI-generated caricatures.
What’s Totally different?
Because the new mannequin can faucet into ChatGPT’s “reasoning” capabilities, Photographs 2.0 can search the web for current info and generate a couple of picture at a time. In essence, the bot can use extra steps to output extra thorough generations from a single immediate. Photographs 2.0 additionally has a newer data cutoff date: December 2025.
This additionally signifies that outputs from the brand new mannequin are extra granular. For instance, I generated an infographic with San Francisco’s climate forecast for the following day, in addition to actions value doing. The picture ChatGPT generated included correct climate particulars for the wet day, together with accurate-looking drawings of the Ferry Constructing, Castro Theater, Painted Girls homes, and Transamerica Pyramid.
Moreover, Photographs 2.0 is extra customizable for customers who need distinctive side ratios for picture outputs. The brand new mannequin can generate photographs starting from 3:1 large to 1:3 tall, and customers can modify the picture’s dimension as a part of their immediate to the AI instrument.
First Impressions
After a number of hours of producing photographs with the brand new mannequin, I used to be typically impressed with the textual content rendering capabilities, in English not less than. Not that way back, picture outputs that includes textual content, from any of the foremost fashions, typically included quite a few malformed characters or phrases with errant additional letters. ChatGPT struggled to label photographs precisely two years prior, so the cleaner, extra advanced outputs from Photographs 2.0 are an indication of continued enchancment. Google has additionally centered on bettering picture outputs that includes textual content in its current iterations of Nano Banana.
AI-GENERATED BY REECE ROGERS

