We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
cover of episode Can AI Crack The Biology Code?

Can AI Crack The Biology Code?

2025/6/3
logo of podcast Short Wave

Short Wave

AI Deep Dive AI Chapters Transcript
People
B
Burleigh McCoy
E
Emily Kwong
Topics
Burleigh McCoy: 我认为人工智能已经彻底改变了蛋白质科学领域。我们现在可以以前所未有的速度和准确性预测蛋白质的结构,这对于理解生物学和解决各种问题至关重要。AlphaFold等工具的出现,使我们能够以前无法想象的方式探索蛋白质的世界,并设计具有全新功能的蛋白质。 Emily Kwong: 我对人工智能在蛋白质科学中的应用感到非常兴奋。它不仅加速了科学研究的进程,而且为解决人类面临的重大挑战提供了新的可能性。从疾病治疗到气候变化,人工智能驱动的蛋白质科学有望为我们带来革命性的解决方案。 Julian Bergeron: 我亲身经历了AlphaFold带来的变革。以前,我的实验室花费数月甚至数年才能确定一个蛋白质的结构,但现在只需几个小时。这使我们能够更快地推进我们的研究,并取得更多的突破。AlphaFold对我的研究产生了巨大的影响。 Pushmeet Kohli: 我认为开放获取对于科学的进步至关重要。通过使AlphaFold等工具易于访问,我们能够加速科学发现的进程,并使更多的人能够参与到科学研究中来。我们希望通过人工智能来加速和推进科学的发展。 David Baker: 我相信蛋白质设计是解决人类面临的许多问题的关键。通过设计具有全新功能的蛋白质,我们可以创造出新的药物、新的材料和新的技术。人工智能为我们提供了前所未有的能力来设计蛋白质,并为世界带来积极的改变。

Deep Dive

Chapters
The accuracy of AI protein prediction is crucial for scientific research. While AlphaFold2 isn't perfect, its confidence scores help scientists assess the reliability of its structural predictions. The model's user-friendliness and open access have greatly accelerated research across various fields.
  • AlphaFold2's accuracy isn't 100%
  • It provides confidence scores for its predictions
  • User-friendly and open access, accelerating research

Shownotes Transcript

Translations:
中文

This message comes from ShipBob. Running a global e-commerce business, you have a lot on your plate. ShipBob can pick, pack, and ship your orders from their global fulfillment network, so you get time back to grow your business. Go to ShipBob.com for a free quote. You're listening to Shortwave from NPR.

Hey, hey, short ravers. Emily Kwong here with producer Burleigh McCoy. What's up, Burleigh? Hey, Emily. Hello. What do you have for us today? Okay, so Emily, today I want to dig into how AI has shaken up the field of protein science, as in the fundamental building blocks of life, proteins. I've heard of them. Yeah. I mean, this is like what you studied back in your scientist days. Yes, yes. I love proteins. Oh, yeah.

We love that you love them. How has AI moved the needle in this field, though? Well, scientists have used it to dig into a problem that protein scientists have struggled with for more than 60 years. And that is, what do these building blocks, of which there are millions, look like? Like their shape? Like their shape, yeah, exactly. And why is that so important? Well, the ability of a protein to do its specific job, so like carry oxygen through your body or turn light into sugar,

That relies wholly on its unique, complicated shape. So to understand how it works, you need to know its shape. But why can't scientists just run an experiment to determine the shape? They can for some proteins, but those experiments can take years and years. And Emily, that's because a scientist essentially needs to take the equivalent of a molecular photo of the protein to map its complicated shape.

But getting the protein to cooperate to get that photo, so like to hold still, for example, without falling apart, that can be super tricky. And it could take a grad student's entire PhD program to figure out a single protein. And other proteins were just abandoned because they would never cooperate. Proteins sound difficult, honestly. So the challenge is how do you figure out a protein's shape?

without running these super tedious experiments. Is this where AI comes in? Yeah, and to give you a sense of kind of how AI has changed the protein game, there's this protein competition that scientists run every other year. Get out, a protein competition, okay.

Yeah. And they've run it for the past 30 years where groups will basically compete on who can accurately guess the most protein shapes. It's like nerd central for sure. We love. And for most of that 30-year history, participants have really only made incremental progress. But in 2020, Google DeepMind used AlphaFold2, that's its AI protein prediction model, and Emily.

AlphaFold 2 blew the other competition out of the water completely. Wow. Okay. Game changer. And now the Google DeepMind team has taken this AI tool to the next level by expanding it beyond proteins.

So today on the show, how scientists have taken a huge step to understanding the building blocks of life using AI. Plus, how other researchers are using the tech to design brand new proteins, ones never before seen in nature. And how AI could help us solve the biggest problems we face today, from disease to climate. You are listening to Shortwave, the science podcast from NPR.

This comes from NPR sponsor, Sierra. No matter the industry, how businesses connect with customers defines their brand. And today, connection starts with a conversation. Sierra is the AI platform for businesses that want to provide better, more human customer experiences. With Sierra, your AI agent solves problems fast. No canned responses, no endless hold music, no frustration.

Just better customer experiences built on Sierra. Visit sierra.ai to learn more. This message comes from BetterHelp. June is Men's Mental Health Month, and every year, 6 million men in the U.S. suffer from depression. If you're feeling overwhelmed, the strongest thing you can do is ask for help, and BetterHelp can make it easy.

Take a short online quiz and connect from home with a qualified therapist. Visit BetterHelp.com slash NPR today to get 10% off your first month. That's BetterHelp, H-E-L-P dot com slash NPR.

This message comes from Square. Your favorite neighborhood spots are using Square to do everything from covering cash flow gaps to expanding to new locations. Wherever your business is growing, Square meets you there. Go to square.com slash go slash NPR to learn more.

Okay, Burleigh, so scientists, it seems, have been trying to figure out the complicated shapes of proteins for decades to better understand how they work. Why has this been such a complicated thing to figure out?

Well, the short answer, Emily, is that there are so many theoretical ways a single protein could fold that it's a big problem to solve. So if you unfolded a protein, it would kind of look like a bunch of beads on a long string. Those beads are little molecules called amino acids. Oh, I remember this from biology. There are like 20 types of amino acids. Yep. Each one is a little different. Right. So each one has a slightly different shape, and that kind of dictates...

dictates how that part of the string can be folded up. Because proteins often have a hundred or more amino acids, you can see how imagining all the ways it could fold would get complicated. Yeah, it just sounds like thousands of different shapes or what, hundreds of thousands of different shapes. Okay, try billions of trillions, Emily. Like there are theoretically more ways for one single protein to fold than there are stars in our night sky.

This sounds like a glorious nightmare. Right? I'm so curious. Okay, so you said that AI has helped us make some leaps and bounds towards a solution. How does this technology work? So this AlphaFold model is a type of AI called a deep learning program, which is this huge network of data processing points called nodes. And the purpose of this network is to learn and then make predictions based on what it's learned.

In AlphaFold's case and other models like it, it learns about proteins from a huge collection of protein structures that scientists have been building on for decades from their experimental data. Okay. So the idea is that after these models use all of that carefully gathered experimental data to learn, they can then predict the shapes of proteins they do not know yet. Exactly. Okay.

Okay, and going back to the protein competition in 2020, how did AlphaFold blow away the competition? So they essentially changed the whole architecture of their model. They had been using AI before, but remember the beads on a string analogy? If amino acids are the beads, even if one bead is far from another on the string, when it all folds up, they could be right next to each other. So with AlphaFold 2, the model looked at distances between all the different amino acids and

and previous knowledge from solved protein structures. Awesome. And the accuracy and speed of the predictions went way up. Okay. I'm assuming that made a huge difference for scientists everywhere studying proteins. Totally. Julian Bergeron, a structural biologist at King's College London, is one of them. He studies the tail-like appendage that propels bacteria. So it's called a flagellum, and it's pretty complicated. It's this huge assembly. ♪

So it's longer than the bacterial cell itself. It consists of 20 to 25 different proteins, but many of them have hundreds of thousands of copies of that protein. And these huge propeller machines are what give some bacteria the ability to make you sick or build plaque on your teeth. So Julien's lab is trying to figure out how these giant machines work.

what their pieces look like, and how it all fits together. And so when the AlphaFold 2 model came out, he just had to try it. And I input a sequence, and then a few hours later I had the model, and I was like, oh my god, this just did it. And we'd been struggling with that problem for months, if not years. And all of a sudden I messaged my lab and I said, we model everything. Yeah.

And we've had dozens of projects that immediately progressed thanks to this. Okay, so it sounds like overnight AlphaFold changed the trajectory of his lab. Yeah. But how did he know that using AlphaFold 2 would actually work? Yeah, so the accuracy is super important, right? Especially when you're basing all of your other experiments on the results.

And it's important to note that like other AI, AlphaFold2 isn't right 100% of the time. So you can't just take the results at face value. But unlike some other AI, included in the results is a score basically telling you how accurate each part of the structure is. Okay. And are others in the field using AlphaFold2?

Yeah, so this is something that actually sets AlphaFold apart from other protein prediction AI models. It's extremely user-friendly. So essentially, anyone who works on a protein or even just has a sequence of a protein can plug it in and get results. I talked to Pushmeet Kohli, vice president of research at Google DeepMind, and he told me why it was important for them to make this tool open access. The mission statement that we have for the science program at Google DeepMind

is to leverage AI to accelerate it

and advanced science. Okay, so I'm scrolling through the AlphaFold website, and I'm seeing scientists using this model for all kinds of things. They're working on malaria and cancer research, drug discovery, plastic eating enzymes. And last week, DeepMind released a new version, AlphaFold 3, which can predict the 3D structure of proteins and other kinds of biomolecules that they attach to. Why are those other biomolecules important?

Yes. So I know we talked about how much proteins are super important. I love them. But I have to admit, they rarely work alone. And...

If we actually want to know how biology works as a whole, we need to understand how proteins work with their partner molecules. So it really gives you a more detailed and more accurate picture of what is happening inside the body, where proteins are not just sort of existing in isolation. They are interacting in a very rich biological space or soup of proteins.

RNA and DNA and small molecules, and it really sheds light into those rich interactions. Now, previous versions of these protein prediction softwares would model where each amino acid was located. But in this new version, AlphaFold3,

It maps things on an even smaller level. So it models where individual atoms are. Wow. So they can predict the structure of multi-protein complexes like the bacterial flagellum or something like proteins in the blood, which attach to iron atoms. That is powerful. Okay.

What are the limits to alpha-fold predictions? Yeah, there are definitely limitations. Pushmeat says that the model works best when a protein has a single defined structure. But some proteins have more than one shape or they have sections that are kind of flimsy. Think cooked versus uncooked spaghetti.

Okay. So the model sounds like some trouble with prediction in some cases, and the results show that. Yeah. So the idea is that these results would say, hey, I'm not so confident in this area of the protein, just so users know. Oh. And another limitation is that the prediction ability depends on the amount of what's called training data available. Uh-huh. So I mentioned that there's a lot of training data for proteins, but...

Some categories have much less training data available. For example, there's much less structural data available for RNAs. Okay, so the prediction is only as good as the data. Exactly, exactly. But Emily. But Burleigh. There's another way scientists can use AI in the protein world. Okay, what's that? To generate brand new proteins.

Humans face new problems today, and we live longer. We're polluting and heating up the planet. And it's reasonable to think that if with more millions of years of evolution, that some of these problems would be solved. But we don't want to wait that long.

So the idea is that we can now create completely new proteins that solve these problems that weren't really relevant during evolution to make the world a better place. So this is David Baker. He's a biochemist and the director of the Institute for Protein Design at the University of Washington. And

And he's been working on proteins for years. He actually developed one of the earlier protein prediction models. His lab has a similar AI program to AlphaFold3. It's called RosettaFold All Atom. But his big focus is designing these brand new proteins. This sounds so futuristic.

Right. Like what kind of new proteins? So far, they've done things like design new protein antibodies, which are important for fighting infections, in this case, to fight influenza. They've made something called a switch protein that could be used as an environmental sensor. And they've also made proteins that could help store carbon, which is a huge hurdle for fighting climate change. I think really across, you know, medicine, sustainability, food,

technology, I think there's huge opportunities to transform the current ways we do things with protein design. So these predictive and generative AI models have fundamentally changed the protein science landscape.

And again, there's definitely room for improving the prediction power. But with what the field has shifted to, like in terms of prediction accuracy and design potential, I mean, it's really gotten this retired protein fanatic like missing my science days. Burleigh, thank you so much for bringing us this big, big story about the little things in life. Thanks, Emily.

This episode was produced by Rachel Carlson. It was edited by our showrunner, Rebecca Ramirez. Burleigh checked the facts. Ko Takasugi-Chernovan was the audio engineer. Special thanks to Jeff Brumfield. Beth Donovan is our senior director. And Colin Campbell is our senior vice president of podcasting strategy. I'm Emily Kwong. Thank you for listening to Shortwave from NPR.

This message comes from Capella University. At Capella, you'll learn relevant in-demand skills you can apply in the workplace right away. A different future is closer than you think with Capella University. Learn more at capella.edu. This message comes from Jackson. Let's face it, retirement planning can be confusing.

At Jackson, we're working to make retirement clear for everyone, starting with you. Our easy-to-understand resources and user-friendly digital tools help simplify your entire experience. You can have confidence in your retirement with clarity from Jackson. Seek the clarity you deserve at Jackson.com. Jackson is short for Jackson Financial Incorporated.

Jackson National Life Insurance Company, Lansing, Michigan, and Jackson National Life Insurance Company of New York, Purchase New York. This message comes from NPR sponsor, Rosetta Stone, an expert in language learning for 30 years. Right now, NPR listeners can get Rosetta Stone's lifetime membership to 25 different languages for 50% off. Learn more at rosettastone.com slash NPR.