New AI can draw pictures, inching closer to humanlike smarts

DALL-E’s offbeat images might not be perfect, but they demonstrate that AI is slowly gaining grounds toward humanlike creativity.
Sign up for the Freethink Weekly newsletter!
A collection of our favorite stories straight to your inbox

OpenAI has just introduced two new machine learning algorithms that improve computer vision and can use text cues to draw unique and often offbeat images — like a dog-walking radish wearing a tutu.

Even though it is still a long way from replacing the discerning human eye, it shows that creative AI is gaining momentum. 

What They Do

Like GPT-3 (the OpenAI text generator), DALL-E, a neural network model, aims to “think” like humans.

DALL-E’s training uses images and associated text prompts. Then, based on what it’s learned, it responds to a text prompt like “an armchair in the shape of an avocado.”

Instead of responding with words, the AI responds by creating hundreds of pictures. Then CLIP (another new neural network) ranks them to find the best few dozen. And, surprisingly, the images often appear genuine, as if a human made them.

For example, a prompt that says “storefront with that has the word openai written on it,” will generate an image like this:

Or “an armchair in the shape of an avocado” will prompt the following image:

“Last year, we were able to make substantial progress on text with GPT-3, but the thing is that the world isn’t just built on text,” Ilya Sutskever, OpenAI co-founder and chief scientist, reports to Axios. “This is a step towards the grander goal of building a neural network that can work in both images and text.”

Why It Matters

These new models are the next step toward achieving machine learning algorithms that can carry out tasks that have real-world value, while promising to show general human intelligence — sort of.

But it’s more than just a whimsical way to make cute pictures — like  “an illustration of a baby daikon radish in a tutu walking a dog,” — the machine learning algorithm’s advantage is efficiency.

Training a new model can take a lot of computer power, but Sutskever, according to Axios, says that CLIP improves existing computer vision techniques with less computational cost.

So, Are AIs Going to Take Over?

GPT-3 is the third generation of autocomplete tools designed by OpenAI. It looks for patterns in large amounts of data and then predicts what words should come after a text prompt. A simple example is if you input “fire,” it might add “truck” or “alarm.”

But OpenAI claims that GPT-3 can do more than that — even write full essays or poems.

When it was first released in June 2020, the media was buzzing about its capabilities. GPT-3 could be an important step toward a future where AI can exhibit a human-like ability to reason. But it also attracted criticism because the text it generated sometimes appeared to be unhinged from reality.

While DALL-E is one small step for artificial intelligence — inching toward achieving human creativity’s likeness — it is still far from perfect. It still needs input from a grammar expert, as poorly worded phrases result in fumbled pictures.

We’d love to hear from you! If you have a comment about this article or if you have a tip for a future Freethink story, please email us at [email protected].

Sign up for the Freethink Weekly newsletter!
A collection of our favorite stories straight to your inbox
Related
AI chatbots may ease the world’s loneliness (if they don’t make it worse)
AI chatbots may have certain advantages when roleplaying as our friends. They may also come with downsides that make our loneliness worse.
Will AI supercharge hacking — if it hasn’t already?
The future of hacking is coming at us fast, and it isn’t clear yet whether AI will help attackers and defenders more.
No, LLMs still can’t reason like humans. This simple test reveals why.
Most AI models are incredible at taking tests but easily bamboozled by basic reasoning. “Simple Bench” shows us why.
The future of fertility, from artificial wombs to AI-assisted IVF
A look back at the history of infertility treatments and ahead to the tech that could change everything we thought we knew about reproduction.
“Model collapse” threatens to kill progress on generative AIs
Generative AIs start churning out nonsense when trained on synthetic data — a problem that could put a ceiling on their ability to improve.
Up Next
smart vaccine device
Subscribe to Freethink for more great stories