Google DeepMind’s Groundbreaking AI for Protein Structure Can Now Model DNA

Google spent much of the past year hustling to build its Gemini chatbot to counter ChatGPT, pitching it as a multifunctional AI assistant that can help with work tasks or the digital chores of personal life. More quietly, the company has been working to enhance a more specialized artificial intelligence tool that is already a must-have for some scientists.

AlphaFold, software developed by Google’s DeepMind AI unit to predict the 3D structure of proteins, has received a significant upgrade. It can now model other molecules of biological importance, including DNA, and the interactions between antibodies produced by the immune system and the molecules of disease organisms. DeepMind added those new capabilities to AlphaFold 3 in part through borrowing techniques from AI image generators.

“This is a big advance for us,” Demis Hassabis, CEO of Google DeepMind, told WIRED ahead of Wednesday’s publication of a paper on AlphaFold 3 in the science journal Nature. “This is exactly what you need for drug discovery: You need to see how a small molecule is going to bind to a drug, how strongly, and also what else it might bind to.”

AlphaFold 3 can model large molecules such as DNA and RNA, which carry genetic code, but also much smaller entities, including metal ions. It can predict with high accuracy how these different molecules will interact with one another, Google’s research paper claims.

The software was developed by Google DeepMind and Isomorphic labs, a sibling company under parent Alphabet working on AI for biotech that is also led by Hassabis. In January, Isomorphic Labs announced that it would work with Eli Lilly and Novartis on drug development.

AlphaFold 3 will be made available via the cloud for outside researchers to access for free, but DeepMind is not releasing the software as open source the way it did for earlier versions of AlphaFold. John Jumper, who leads the Google DeepMind team working on the software, says it could help provide a deeper understanding of how proteins interact and work with DNA inside the body. “How do proteins respond to DNA damage; how do they find, repair it?” Jumper says. “We can start to answer these questions.”

Understanding protein structures used to require painstaking work using electron microscopes and a technique called x-ray crystallography. Several years ago, academic research groups began testing whether deep learning, the technique at the heart of many recent AI advances, could predict the shape of proteins simply from their constituent amino acids, by learning from structures that had been experimentally verified.

In 2018, Google DeepMind revealed it was working on AI software called AlphaFold to accurately predict the shape of proteins. In 2020, AlphaFold 2 produced results accurate enough to set off a storm of excitement in molecular biology. A year later, the company released an open source version of AlphaFold for anyone to use, along with 350,000 predicted protein structures, including for almost every protein known to exist in the human body. In 2022 the company released more than 2 million protein structures.

Most PopularGearThe Top New Features Coming to Apple’s iOS 18 and iPadOS 18By Julian ChokkattuCultureConfessions of a Hinge Power UserBy Jason ParhamSecurityWhat You Need to Know About Grok AI and Your PrivacyBy Kate O'FlahertyGearHow Do You Solve a Problem Like Polestar?By Carlton Reid

The latest AlphaFold’s ability to model different proteins was improved in part through an algorithm called a diffusion model that helps AI image generators like Dall-E and Midjourney create weird and sometimes photo-real imagery. The diffusion model inside AlphaFold 3 sharpens the molecular structures the software generates. The diffusion model is able to generate plausible protein structures based on patterns it picked up from analyzing a collection of verified protein structures, much as an image generator learns from real photographs how to render realistic-looking snapshots.

AlphaFold 3 is not perfect, though, and offers a color-coded confidence scale for its predictions. Areas of a protein structure colored blue indicate high confidence, while red areas show less certainty.

David Baker, a professor at the University of Washington who leads a group working on techniques for protein design, has competed with AlphaFold. In 2021, before DeepMind open sourced its creation, his team released an independent protein-structure prediction inspired by AlphaFold. His own lab recently released a diffusion model to help model a wider range of molecular structures, but he concedes that AlphaFold 3 is more capable. “The structure prediction performance of AlphaFold 3 is very impressive,” Baker says.

Baker adds that it is a shame that the source code for AlphaFold 3 has not been released to the scientific community.

Hassabis, who leads all of Alphabet’s AI initiatives, has long taken a special interest in the potential for AI to accelerate scientific research. But he says the latest techniques being developed for AlphaFold, a highly specialized AI system, could prove useful for building more general systems that aim to exceed human capabilities on many dimensions.

If AI programs like Google’s Gemini become a lot more capable over the next decade, he says, “you could imagine them using things like AlphaFold as tools, to achieve some other goal.”

About Will Knight

Check Also

The Hottest Startups in London in 2024

In the “Start-Up, Scale-Up” review report published last year, chancellor Rachel Reeves promised to make …

Leave a Reply