DeepMind releases massive protein structure database

“This will be one of the most important datasets since the mapping of the Human Genome.”
Subscribe to Freethink on Substack for free
Get our favorite new stories right to your inbox every week

DeepMind, a sister company of Google, is giving the world access to a massive protein structure database — a gift that has the potential to revolutionize scientific research.

“This will be one of the most important datasets since the mapping of the Human Genome,” Ewan Birney, deputy director general of the European Molecular Biology Laboratory, which partnered with DeepMind on the database, said in a press release.

Protein structure: Proteins are molecules that are hugely important to the functioning of living organisms, including humans — practically everything we’re made of and everything our cells do is determined by our proteins.

“It’s the most significant contribution AI has made to advancing scientific knowledge to date.”

Demis Hassabis

Every protein is made up of a long string of hundreds or even thousands of chemical compounds called amino acids, and the way that ribbon folds on itself determines the protein’s function.

Once scientists know a protein’s 3D structure, they know how it interacts with everything else, and they can start exploring ways to use the molecule to develop drugs, study diseases, design energy systems, and more.

The challenge: While it is possible for scientists to see protein structure, it isn’t easy — the standard method involves x-ray crystallography, which is about as expensive and complicated as it sounds.

The process isn’t fast, either. Determining a single protein structure can take anywhere from weeks to months, and after decades of work, scientists have only deciphered about 17% of the human body’s 20,000 proteins, known collectively as the proteome.

Christmas in July: In an attempt to solve science’s protein folding problem, DeepMind created ​​AlphaFold, an AI that can predict a protein structure just based on its amino acids with a high level of accuracy in just a day or two.

Now, the company has announced that it’s making a database of the AI’s predictions freely available online. 

Not only does this database contain structure predictions for all 20,000 proteins in the human body, it also includes 330,000 other proteins found in 20 organisms regularly used for scientific research, including mice, zebrafish, and fruit flies.

There’s more to come, too: DeepMind expects to release at least 100 million more protein structure predictions in the next few months. At that point, the database will include every protein known to science.

“We believe this represents the most significant contribution AI has made to advancing scientific knowledge to date, and is a great illustration of the sorts of benefits AI can bring to society,” DeepMind Founder and CEO Demis Hassabis said in the press release.

We’d love to hear from you! If you have a comment about this article or if you have a tip for a future Freethink story, please email us at [email protected].

Subscribe to Freethink on Substack for free
Get our favorite new stories right to your inbox every week
Related
No, AI won’t take all the jobs. Here’s why.
When you consider the mechanics of integrating AI into the job market, the idea that it will take all our jobs quickly falls apart.
AI doomerism isn’t new. Meet the original alarmist: Norbert Wiener
Decades before Geoffrey Hinton and Eliezer Yudkowsky raised alarms, the computer scientist warned AI could steal jobs and outsmart humans.
Ancient Olympians wouldn’t qualify for today’s Games
Across history, the human body has been reshaped by discipline, medicine, and now technology — each era redefining peak performance.
A tragedy, a lawsuit, and the birth of an AI moral panic
A lawsuit claiming an AI chatbot caused a teen’s suicide risks sparking a new moral panic, echoing past fears built on distorted evidence.
Why AI gets stuck in infinite loops — but conscious minds don’t
Anil Seth suggests the difference is that living beings are rooted in time and entropy, a grounding that may be essential for consciousness.
Up Next
cofounder matchmaking
Subscribe to Freethink for more great stories