We're sunsetting PodQuest on 2025-07-28. Thank you for your support!

Using AI to Understand the Thoughts of the Dead

2024/11/22

Science Quickly

AI Deep Dive AI Insights AI Chapters Transcript

People

Michael Varnum

Rachel Feltman

Topics

Rachel Feltman: 探讨了利用AI探索过去，甚至与已灭绝文明的成员‘对话’的可能性，并采访了该领域专家的观点。 Michael Varnum: 指出了研究人员在了解古代文明成员心理和行为方面面临的挑战，即无法直接获取数据。传统方法只能通过间接途径（如档案数据和文化产品）来推断过去人们的价值观和情感，这不够直接。文章提出了一种新方法：利用像ChatGPT这样的程序模拟现代参与者，并令人惊讶地复制了行为科学中许多经典效应，这启发了研究人员尝试用古代文本训练模型来模拟古代人的行为。目前，研究人员只是尝试用历史文本训练大型语言模型，并测试其对特定知识的理解，但尚未将这些模型用于现代实验或调查。 Michael Varnum进一步讨论了历史文本数据中存在的偏差问题，因为在大部分人类历史上，识字率很低，且只有少数人的文字记录流传至今，这些记录往往偏向精英阶层。他提出了几种解决方法，例如根据现代人口的社会阶层影响进行模型微调，或对模型的回应进行加权处理，以及将历史大型语言模型的结果与其他历史记录和分析进行比较，以验证其可靠性。他还提到，当前大型语言模型更贴近西方和英语国家人口的心理特征，这与训练数据中这些社会群体被过度代表有关，但也表明使用不同语料库可以捕捉到不同文化的特征。 Michael Varnum还展望了这项技术的未来应用，例如帮助研究人员更深入地了解人类心理的普遍性和进化基础，通过研究古代社会来验证某些心理现象的普遍性，例如男女在性策略上的差异。他认为，AI不仅可以模拟参与者或编码数据，还可以生成新的研究假设，未来AI可能在社会心理学研究中扮演更重要的角色，甚至可能生成研究想法，这可能会改变研究人员的工作方式。 Rachel Feltman: 提出了关于历史文本数据偏差的问题，以及如何利用AI技术来解决这些偏差，并探讨了这项技术在研究弱势群体历史方面的应用潜力。

Deep Dive

Key Insights

Why is there a need to use AI to simulate ancient participants in social psychology studies?

There is a need because we lack direct ways to study the mentality and behavior of people from ancient civilizations. Traditional methods, such as archival data and cultural products, are indirect and limited. AI can potentially provide more direct insights by simulating ancient participants based on historical texts.

What kind of experiments have already been conducted using AI to simulate modern participants?

Experiments have replicated 70 different large-scale survey experiments using simulated participants from ChatGPT, with results correlating at about 0.9 with real human data. This suggests that AI models can capture significant aspects of human psychology.

Why is using historical texts to train AI models challenging?

Using historical texts is challenging because they are biased towards the perspectives of literate, elite, and educated individuals from the past. This can lead to a skewed understanding of ancient populations. Researchers need to account for these biases by using additional historical records and weighting responses.

What does the acronym WEIRD stand for in the context of behavioral science?

WEIRD stands for Western, Educated, Industrialized, Rich, and Democratic. It refers to the fact that the majority of participants in behavioral science come from these types of societies, which are not representative of the global population.

What are some potential dream use cases for AI in studying ancient psychology?

Potential use cases include testing the universality of certain psychological traits by extending the temporal window back to ancient societies. For example, researchers could examine differences in preferred sexual strategies between men and women in societies that lived hundreds or thousands of years ago.

How might AI be used beyond simulating participants in social psychology research?

AI could be used to generate new hypotheses for social psychology research. A recent study found that GPT-4 generated hypotheses that were considered more compelling and probably true by social psychologists, suggesting that AI could become a valuable tool for generating research ideas.

Shownotes Transcript

Translations:

中文

This episode is brought to you by Amazon Prime. There's nothing sweeter than bacon cookies during the holidays. With Prime, I get all my ingredients delivered right to my door, fast and free. No last minute store trips needed. And of course, I blast my favorite holiday playlist on Amazon Music. It's the ultimate soundtrack for creating unforgettable memories. From streaming to shopping, it's on Prime. Visit Amazon.com slash Prime to get more out of whatever you're into.

There's been a lot of hype around artificial intelligence lately. Some companies want us to believe that machine learning is powerful enough to practically tell us the future.

But what about using AI to explore the past and even talk to members of long-dead civilizations? For Scientific American Science Quickly, I'm Rachel Feltman. My guest today is Michael Varnum, social psychology area head and associate professor at Arizona State University. He's one of the co-authors of a recent opinion paper that proposes a somewhat spooky new use for tools like ChatGPT.

Michael, thanks so much for joining us today. My pleasure. Thanks for having me on. So you have this new paper kind of

a ghost in the machine sort of vibe. Tell us a little bit about the problem you're setting out to solve. Yeah. So I've been interested in thinking about cultural change for some time, and I've done a lot of work in that area. But we run into some limitations when we're trying to get insight into the mentality or behavior of folks who are no longer with us. We obviously don't have time machines, right? We can't

bring the dead back and ask them to participate in our experiments or run them through economic games. And so typically what folks like me have to do is use rather indirect proxies, right? Maybe we get archival data on things like marriage and divorce or crimes, or we look at cultural products like the language folks used in books and we try to

infer what kinds of values people might have had or what kinds of feelings they might have had towards different kinds of groups. But that's all kind of indirect. What would be amazing is if we could actually get the kind of data we get from folks today, just from say, you know, ancient Romans or Vikings or medieval Persians.

And one thing that really excited me in the past year or two is folks started to realize that you could simulate at least modern participants with programs like ChatGPT and surprisingly, and I think excitingly, replicate a whole host of classic effects in the behavioral sciences. And so we thought, huh.

If we're able to do this based on these models created from the writings of modern people, maybe we could do this based on the writings of ancient people. This might open up a whole new world of possibilities.

Yeah. Could you tell me a little bit more about some of those experiments that have replicated psychological phenomenon using language learning models? One of the more powerful ones set out to replicate 70 different large scale survey experiments with simulated participants from ChatGPT.

And they found that the results correlated at about 0.9 from what folks had observed with real human beings. And of course, this isn't what anyone designed Lama or ChatGPT to do, but in the process of making these models that are able to converse with us in very natural kinds of ways,

they seem to have captured quite a bit of human psychology. And you mentioned in the paper that some folks are already using historical texts to train large language models. So what kind of stuff are they doing so far? So far, these are just baby steps. Folks are just trying to see, okay, if we train a model based on medieval European texts, what's its understanding of the solar system or medicine or biology?

And they have an incorrect number of planets. They believe in the four humors of the body. So, so far, to my knowledge, no one has run these kind of fine-tuned models through modern experiments or surveys, really. But I'm guessing that's going to start happening soon. And I'm really excited to see what folks find. Yeah. Yeah.

So one thing that came to mind when I was reading your paper was the inherent bias we see in the fossil record. You know, our sense of what life looked like in the past is influenced by what gets preserved and that's influenced by all sorts of factors like climate and the bodies of the organisms we're talking about.

You know, I imagine that in most times and places in history, certain people were way overrepresented in written text. So how are you proposing that researchers navigate that to make sure we don't get this really biased sense of what people were like? That's a really vexing challenge for this kind of proposal.

Because, well, for most of human history, no one was literate. Writing is relatively recent. Right. And for the period in which some societies had writing, very few people actually knew how to read and write. Even fewer of them wrote things down that survive into the modern era. And so what you're getting is data that's going to skew towards...

Folks who are more elite, who are more educated. And we think there may be a couple ways to address this and they're imperfect, right? But maybe if we use them in combination, we can still deal a bit with the bias that's going to be baked in to these models.

One way is we know quite a bit about how things like social class affect the psychology of modern populations. So potentially we could fine tune those models a little bit, or we could run them through experiments and surveys and then kind of weight their responses to try to account for that bias. Yeah.

You know, in some cases we have other sources of historical record and analysis. To the extent that those capture more broadly perhaps a mentality or behavioral patterns of past populations, we could see if the results of these historical large language models align with those kinds of conclusions.

But it is tricky for sure. That's going to be a real challenge to overcome. Yeah. And of course, that wouldn't be a challenge that's unique to using historical data. It's a challenge we also see in training LLMs with modern data. Oh, absolutely. Right. And

You know, one thing that inspired this idea is some work by folks like Muhammad Atari and Yan Tao showing that current large language models really look kind of weird in the sense of they're more closely aligned with the psychology of folks in Western and Anglophone populations than in many other parts of the world. And I mean, hey, that makes sense, right? Given the training data is over-representing these societies. But it's also kind of exciting because it suggests that if you had a different kind of corpus

then you would capture some of that cultural zeitgeist and some of the culturally specific mentality of the folks who produced it. Yeah. Could you just tell people what WEIRD stands for in this context? Because I think it's a really good acronym. Yeah. So this is an acronym that Joe Henrich developed about a decade and a half ago, and it stands for Western Educated Industrialized Rich Democracies.

And so it turns out the minority of current human beings live in such kinds of societies. But depending on how you slice it, the vast majority of participants in behavioral science come from these samples. And this matters because it turns out culture affects how we think and act in a wide variety of ways, from the values that we hold to how much interpersonal distance we like when we're in public.

basic patterns of visual attention and cognition, rates of cooperation. It's a very long list. No, and I mean, I can definitely imagine, you know, obviously it's very evocative to talk about ancient history, but I can definitely imagine researchers trying to use, you know, some like

19th, 20th, even 21st century text from like underrepresented groups to sort of re-examine, you know, these psychological studies that maybe left large swaths of the population out. Yeah, I think that's an incredibly good idea. And in some ways, sort of the more recently we go back in the past, the easier it will be to do this kind of research. Yeah. So while it's exciting to think about pushing the envelope very, very far back,

probably the start, you know, the initial starting point will be let's look back 100 years or 150. Yeah. Well, and speaking of which, you know, imagining for a moment that this idea totally takes off and we're getting a bunch of sort of, you know, undead psychology projects going. What are some of your like dream use cases for this? So I do a lot of research that's informed by evolutionary psychology. And

Sometimes we will run an experiment or a survey and we'll try to get data from every continent in the world to see if some part of human psychology might be universal. And when we find it, it's really exciting. But we're making an inferential leap from saying it's universal and it makes adaptive sense.

to this is how people thought in the past and especially in the deep past. And so, being able to push that temporal window back, you know, robustly folks like Kenreck and Schmidt and others have found differences between men and women in preferred sexual strategies. You know, do you want a large number of partners and uncommitted relationships or do you prefer to have more exclusive relationships and fewer partners?

And that seems to hold all across the globe. But we could have a lot more confidence in these things really being a core part of human nature if we started to see them from societies that lived hundreds or thousands of years ago, I think. Totally. The idea is forward-looking and kind of speculative, right? I don't have one of these things on my computer ready to run. But Sachin, Baker, and Kahneman

colleagues recently published a paper where they had GPT-4 generate dozens of new hypotheses for social psychology research.

And then had actual social psychologists generate new hypotheses. And it turns out other social psychologists thought that the AI was coming up with way more compelling and probably true ideas. Interesting. So we may in the future see AI used not just to simulate participants or code data, but even to generate ideas. And you can imagine weird kind of closed loops like that where folks like me might be out of a job. Yeah.

Well, hopefully not. I think there will always be room for that unique human factor. But I think it's great to think about the ways that AI could really be an interesting tool for us. So thank you so much for taking the time to come and chat with us today. Oh, thanks, Rachel. This was my pleasure. I enjoyed the conversation.

That's all for this week's Friday Fascination. We'll be back on Monday with our weekly news roundup. And on Wednesday, we're chatting about something almost as spooky as AI ghosts. The psychology of Black Friday shopping. Science Quickly is produced by me, Rachel Feltman, along with Fonda Mwangi, Kelso Harper, Madison Goldberg, and Jeff Dalvisio. Shane Opposis and Aaron Shattuck fact-check our show. Our theme music was composed by Dominic Smith.

Subscribe to Scientific American for more up-to-date and in-depth science news. For Scientific American, this is Rachel Feltman. Have a great weekend.

Using AI to Understand the Thoughts of the Dead 12:04 Share