Skip to main content
  1. Home
  2. Computing
  3. News

DALL-E 3 could take AI image generation to the next level

Add as a preferred source on Google
DALL-E 2DALL-E 2 Image on OpenAI.
OpenAI

OpenAI might be preparing the next version of its DALL-E AI text-to-image generator with a series of alpha tests that have now been leaked to the public, according to the Decoder.

An anonymous leaker on Discord shared details about his experience, having access to the upcoming OpenAI image model being referred to as DALL-E 3. He first appeared in May, telling the interest-based Discord channel that he was part of an alpha test for OpenAI, trying out a new AI image model. He shared the images he generated at the time.

We've NEVER seen Image Generation This Good! | SNEAK PEAK

The May alpha test version had the ability to generate images of multiple aspect ratios inside the image model. YouTuber, MattVidPro AI then showcased several of the images that were generated in a 16:9 aspect ratio. This version also showed the model’s prowess for high-quality text production, which continues to be a pain point for rival models, even for top generators such as Stable Diffusion and Midjourney.

Some examples showcased images, such as text melded into a brick wall, a neon sign of words, a billboard sign in a city, a cake decoration, and a name etched into a mountain. The model maintains that DALL-E is good at generating people. One such image displayed a woman eating spaghetti at a party from a fisheye point of view.

The leaker returned to the Discord channel in mid-July with more details and new images. He claimed to be a part of a “closed alpha” test version that included approximately 400 subjects. He added that he was invited to the trial via email and was also included in the testing of the original DALL-E and DALL-E 2. This is what led to the conclusion that the alpha test might be for DALL-E 3, though it has not been confirmed.

The model has been updated considerably between May and July. The leaker has showcased this by sharing images generated based on the same prompt, showing how powerful DALL-E 3 has gotten over time. The prompt reads a painting of a pink jester giving a high five to a panda while in a cycling competition. The bikes are made of cheese and the ground is very muddy. They are driving in a foggy forest. The panda is angry.

The May alpha produces the general scene that hits most of the points of the prompt. There’s a little distortion in the hands connecting, and the wheels of the bikes are yellow as opposed to being made of cheese. However, the July alpha is far more detailed, with the pink jester and the panda clearly high-fiving and the bicycle wheels made of cheese in several generations.

Meanwhile, in Midjourney, the jester is missing from the scene, the pandas are on motorcycles instead of bicycles. There are roads, instead of mud. The pandas are happy instead of angry.

There are a host of DALL-E 3 July alpha image examples that show the potential of the model. However, with the alpha test being uncensored, the leaker noted that also has the potential to generate scenes of “violence and nudity or copyrighted material such as company logos.”

Some examples include a gory anime girl, a Game of Thrones character, a Grand Theft Auto V cover, a zombie Jesus eating a Subway sandwich, also suggesting mild gore, and Shrek being dug up from an archeological dig, among others.

MattVidPro AI noted that the image model generates images as if they’re supposed to be in a specific style.

DALL-E 2 launched in April 2022 but was heavily regulated with a waitlist due to its popularity and concerns about ethics and safety. The AI image generator became accessible to the public in September 2022.

Fionna Agomuoh
Fionna Agomuoh is a Computing Writer at Digital Trends. She covers a range of topics in the computing space, including…
Gemini in Chrome can now see exactly what you’re looking at on screen
Google's new "Select from screen" tool makes it easier to ask Gemini questions about text and images in a browser tab.
Google Chrome Gemini Featured

Google is making Gemini a lot more aware of what's happening inside Chrome. The company has started rolling out a new "Select from screen" feature that lets users highlight specific text or images from a webpage and send them directly to Gemini, making conversations with the AI assistant far more contextual.

Gemini can now focus on exactly what users want to ask about

Read more
Microsoft’s new Surface PCs are cheaper — but there’s a catch
Cardboard, Box, Carton

The tech industry’s favorite balancing act is getting harder by the month. Component prices are rising, memory costs refuse to settle down, and laptop makers are scrambling to keep sticker shock under control. Microsoft’s latest Surface refresh feels like a direct response to that problem.

The company has introduced new entry-level versions of its 12-inch Surface Pro and 13-inch Surface laptop, offering lower starting prices without changing the processor or storage. On the surface, that sounds like good news for budget-conscious buyers. Dig a little deeper, however, and you’ll find a compromise hiding in plain sight.

Read more
A new supercomputer has dethroned the U.S — here’s why it matters
Crowd, Person, Architecture

The race to build the world’s fastest supercomputer has been dominated by the United States. Now, China has stormed back into the lead. A newly ranked system called LineShine has claimed the No. 1 position on the latest Top500 list, a closely watched ranking of the planet’s most powerful supercomputers. The machine, located in Shenzhen, pushed past the U.S. government’s El Capitan system and became the first Chinese computer to top the list since 2017. That’s notable on its own. But what makes LineShine particularly interesting is how it got there.

The tortoise just outran the rocket

Read more