Skip to main content
  1. Home
  2. Emerging Tech
  3. Computing
  4. News

Neural network can create high-res images based on a text description

Add as a preferred source on Google

As far as artificial intelligence goes, 2016 has been the year of deep learning. Brain-inspired neural networks have received massive amounts of investment in time, resources and funding — and, boy, has it ever paid off!

In a new piece of research — carried out by investigators at Rutgers University, the University of North Carolina at Charlotte, Lehigh University, and the Chinese University of Hong Kong — neural networks have been used to generate high quality images based on nothing more detailed than basic text descriptions.

Recommended Videos

“Generating realistic images from text descriptions has many applications,” researcher Han Zhang told Digital Trends. “Previous approaches have difficulty in generating high resolution images, and their synthesized images in many cases lack details and vivid object parts. Our StackGAN for the first time generates 256 x 256 images with photo-realistic details.”

A video of the work was shared online by YouTuber Károly Zsolnai-Fehér as part of his excellent series of Two Minute Papers educational videos.

Image used with permission by copyright holder

“For many years, we have trained neural networks to perform tasks like face, traffic sign, or handwriting recognition,” Zsolnai-Fehér told us. “Generally, with millions of training examples, we show the neural network how to do something, and expect them to learn these concepts, and do well on their own afterwards. This piece of work is completely different: here, after learning the neural networks are able to create something completely new — such as synthesizing new, photorealistic images from a piece of text we have written. This opens up a world of possibilities, and I am super-excited to see where researchers take this concept in the future.”

While there have certainly been examples of computational creativity before — ranging from MIT’s Nightmare Machine to projects that can generate predictive video simply by looking at a still image — this is nonetheless an intriguing piece of work. It’s also fascinating because the two-stage method of drawing images looks, to our way of thinking, a whole lot like the way artists will sketch out a piece of work, and then do a second pass to add detail.

We may still be a way from replacing human illustrators with robots, but this is nonetheless an exciting leap forward.

Luke Dormehl
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
Study finds humans will talk to AI ghosts of the dead as reincarnations, and it’s pretty grim
The first AI ghost study is in. The results are about as complicated as you'd expect.
VR Headset, Person, Face

A new study from the University of Colorado Boulder confirms something that sounds both impressive and concerning. People find interacting with AI simulations of their dead loved ones deeply meaningful, and most will come away wanting to do it again.

The researchers call it a "generative ghost," which is a clear reference to generative AI, but I’d still prefer to call it unsettling.

Read more
China’s UBTech unveils eerily lifelike companion robots, and yes, they want to move in with you
UBTech's new humanoid robots are built for companionship, using emotion-aware AI, long-term memory, and humanlike expressions to become part of your everyday life.
UBTech Uworld U1 series robot launch

A humanoid robot designed to live in your house, learn your habits, and pick up on your mood without being prompted is no longer science fiction. Shenzhen-based UBTech Robotics unveiled its Uworld U1 series this week, introducing three robots built for companionship rather than factory work or household chores.

A body that moves like yours, and a brain that reads how you feel

Read more
This $249 LED sign wants to fix your work-life balance
My productivity isn't worth $249... or is it?
Flipper Busy Bar

Flipper Devices has built a reputation among hackers and hardware enthusiasts with the Flipper Zero, a pocket-sized gadget capable of interacting with RFID, NFC, Bluetooth, and other wireless protocols. Now, the London-based company is taking a very different approach.

Its latest product, the Busy Bar, is a desktop productivity display designed to help users stay focused, signal their availability, and automate parts of their workflow. After being teased last year, the device is finally going on sale on July 14. While the concept is genuinely clever, its starting price of up to $249 may make many buyers think twice.

Read more