We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
People
A
Andrej Karpathy
N
N.L.W.
Topics
Andrej Karpathy: 大型语言模型(LLM)的出现,颠覆了以往技术传播模式。以往新技术通常稀缺、资本密集且需要专业技术知识,但LLM却反其道而行之,普通大众从中受益匪浅,而企业和政府机构的影响相对较小且滞后。ChatGPT作为史上增长最快的消费者应用,拥有数亿周活跃用户,其应用范围涵盖写作、编码、翻译、辅导、摘要、深度研究、头脑风暴等多个领域。这并非简单的升级,而是对个人能力的重大提升,使用门槛极低,模型廉价甚至免费,速度快,任何人都可以通过网址或本地机器访问,并支持多种语言和表达方式。这种技术突破前所未有。然而,在企业和政府领域,LLM的益处却相对有限。原因有三:首先,LLM的能力在于提供广泛但浅显的准专家知识,而组织的优势在于集中各种专业知识。虽然LLM可以提高专家的效率,但对组织的整体提升有限。其次,组织面临的问题更复杂,需要更多的协调工作,例如系统集成、品牌规范、安全协议、隐私考虑、国际化、合规和法律风险等,这些因素难以纳入LLM的上下文窗口。第三,大型组织存在固有的惯性,包括文化、历史先例、政治因素、沟通成本、员工再培训挑战和官僚主义等,这些都阻碍了LLM的快速采用。总而言之,目前LLM对个人的影响远大于对组织的影响。 展望未来,LLM的持续普及取决于性能提升和资本支出的动态范围。目前,尖端LLM性能易于获取且廉价,但未来情况可能发生变化。一旦资金能够购买到性能显著提升的LLM,大型组织将集中资源购买更强大的智能,社会精英与普通大众之间的差距可能再次扩大。但至少目前,我们正处于技术史上一个独特且前所未有的局面。许多科幻作品并未预测到AI革命将以这种方式发展,它并非由政府秘密项目掌控,而是像ChatGPT一样,一夜之间免费出现在每个人口袋里的设备上。 N.L.W.: Andrej Karpathy的观点很有见地,它帮助我们理解AI目前如何改善现状。企业是由专家组成的,而个人通常只擅长少数领域。因此,对于个人而言,AI能够将他们在大多数领域的水平从新手提升到中等水平,这比对专家的提升更有意义。AI对个人的应用场景远多于对专家的应用场景。 大型语言模型可能会导致企业内部核心职能和次要职能的分化,并促进数字员工的兴起。企业可能会选择不招聘某些技能,因为AI工具已经足够胜任。这尤其体现在代理(agent)的兴起上,企业可能会选择低成本的数字员工,即使其能力不如人类员工。很多企业对代理的应用会从边缘职能开始,例如销售和营销部门。 虽然大型语言模型对个人的影响更大,但其在企业领域的应用速度也远超以往任何技术。企业对AI的采用速度非常快,这与以往任何技术都不同。企业上下都意识到AI是具有颠覆性的力量,它不仅带来了新的工具,还解锁了全新的效率和机遇。虽然企业在实验方面受到限制,但许多企业正在积极拥抱外部实验,并将经验应用到内部流程中。 大型语言模型的普及将极大地促进自下而上的创业活动。许多以前难以实现或资本密集型的活动现在变得更容易,这将激励更多人创业。

Deep Dive

Shownotes Transcript

Translations:
中文

Thank you.

Hello, friends. Quick notes before we dive into today's show. First up, as I've been mentioning a couple times, for those of you who are looking for an ad-free version of the AI Daily Brief, you can now head on over to patreon.com slash ai daily brief to find that. Lastly, something I want to gauge people's perspective on.

The AI Daily Brief community has been hugely supportive of and important in the superintelligence story. We're considering reserving part of our current round for investors from this community. However, I'm trying to gauge interest. If this is something you think we should explore, send me a note at nlw at besuper.ai with super in the title. Thanks in advance for your perspective. And with that, let's get into today's show.

Hello, friends. Welcome back to another Long Reads episode of the AI Daily Brief. One more quick reminder before we get into it. As you are listening to this, I am in Florida, probably just about to surprise our kids with their first Disney World trip, which should be great. As I mentioned, though, this means that next week's episodes are a little bit different. We have a slate of really interesting interviews.

Lots of talk about agents and vibe coding and big technological changes, stuff that I think will be a really interesting and enlightening change of pace. If some crazy thing happens, you can be assured that I will find a way to get in there and share some content with you. But in the meantime, assuming that that doesn't happen, we will have a week of pre-recorded interviews, and then we will be back at the end of next week with an Easter Long Read Sunday and then normal episodes to follow that.

For now, though, let's jump over into this recent post from Andrej Karpathy. It's basically a blog post but shared to X. And it's always a real treat when we get to read a piece from one of the big thinkers in the space, especially because these types of pieces tend to be more conversational, let's say, than the essays that get published in an op-ed section in Bloomberg or something. By choosing to put this on X,

Andre's inviting discourse and conversation. And we've seen in the past what happens when Andre invites discourse and conversation. When the man introduced the term vibe coding, which although it's kind of been warped from how he was originally using it, is obviously one of the most influential concepts of the year. In any case, this piece is called Power to the People, How LLMs Flip the Script on Technology Diffusion. And we're going to read it first, and then I'll come back and discuss it. Once again, you guys are on a roll. This is actually me reading it rather than AI.

Andre writes,

This progression feels intuitive. New and powerful technologies are usually scarce, capital-intensive, and their use requires specialized technical expertise in the early stages. So it strikes me as quite unique and remarkable that LLMs display a dramatic reversal of this pattern. They generate disproportionate benefit for regular people, while their impact is a lot more muted and lagging in corporations and governments.

ChatGPT is the fastest-growing consumer application in history, with 400 million weekly active users who use it for writing, coding, translation, tutoring, summarization, deep research, brainstorming, etc.

This is not a minor upgrade to what existed before. It's a major multiplier to an individual's power level across a broad range of capabilities, and the barrier to use is incredibly low. The models are cheap, free even, fast, available to anyone on demand behind a URL or even local machine, and they speak anyone's native language, including tone, slang, or emoji. This is insane. As far as I can tell, the average person has never experienced a technological unlock this dramatic this fast.

Why then are the benefits a lot more muted in the corporate and government realms? I think the first reason is that LLMs offer a very specific profile of capability, that of merely quasi-expert knowledge and performance, but simultaneously across a very wide variety of domains. In other words, they are simultaneously versatile but also shallow and fallible. Meanwhile, an organization's unique superpower is the ability to concentrate diverse expertise into a single entity by employing engineers, researchers, analysts, lawyers, marketers, etc.,

While LLMs can certainly make these experts more efficient individually, e.g. drafting initial legal clauses, generating boilerplate code, etc., the improvement to the organization takes the form of becoming a bit better at the things it could already do. In contrast, an individual will usually only be an expert in at most one thing.

So the broad quasi-expertise offered by the LLM fundamentally allows them to do things they couldn't do before. People can now vibe code apps. They can approach legal documents. They can grok esoteric research papers. They can do data analytics. They can generate multimodal content for branding and marketing. They can do all of this at an adequate capability without involving an additional expert.

Second, organizations deal with problems of a lot greater complexity and necessary coordination. Think various integrations, legacy systems, corporate brand or style guides, stringent security protocols, privacy considerations, internationalization, regulatory compliance, and legal risk. There are a lot more variables, a lot more constraints, a lot more considerations, and a lot lower margin for error. It's not so easy to pull all of it into a context window. You can't just vibe code something. You might be one disastrous hallucination away from losing your job.

And third, there is the well-documented inertia of a larger organization, featuring culture, historical precedents, political turf that escalate in periods of rapid change, communication overhead, retraining challenges of a distributed workforce, and good old-fashioned bureaucracy.

These are major headwinds when it comes to rapid adoption of a sparkling new, versatile but shallow and fallible tool. I don't wish to downplay the impacts of LLMs in corporations or governments. But at least for the moment and in aggregate across society, they have been significantly more life-altering for individuals than they have been for organizations. Mary, Jim, and Joes are experiencing the majority of the benefit, not Google or the government of the United States.

Looking forward, the continued diffusion of LLMs, of course, depends on continued performance improvement and its capability profile. The benefit distribution overall is particularly interesting to chart and depends heavily on the dynamic range of the performance as a function of capital expenditure. Today, frontier-grade LLM performance is very accessible and cheap. Beyond this point, you cannot spend a marginal dollar to get better performance, reliability, or autonomy. Money can't buy better chat GPT. Bill Gates talks to GPT-4.0 just like you do. But can this be expected to last?

trained time scaling, increasing parameters and data, test time scaling, increased time, and model ensembles, increased batch, are forces increasing the dynamic range. On the other hand, model distillation, the ability to train disproportionately powerful small models by training to mimic the big model, has been a force decreasing dynamic range.

Certainly, the moment money can buy dramatically better chat GPT, things change. Large organizations get to concentrate their vast resources to buy more intelligence, and within the category of individual two, the elite may once again split away from the rest of society. Their child will be tutored by GPT-8 Pro Max High, yours by GPT-6 Mini.

But at least at this moment in time, we find ourselves in a unique and unprecedented situation in the history of technology. If you go back through various sci-fi, you'll see that very few would have predicted the AI revolution would feature this progression. It was supposed to be a top-secret government megabrain project wielded by the generals, not Chachipi T, appearing basically overnight and for free on a device already in everyone's pocket.

Remember that William Gibson quote? The future is already here, it's just not evenly distributed. Surprise, the future is already here, and it is shockingly distributed. Power to the people. Personally, I love it. Today's episode is brought to you by Vantub.

Vanta is a trust management platform that helps businesses automate security and compliance, enabling them to demonstrate strong security practices and scale. In today's business landscape, businesses can't just claim security, they have to prove it. Achieving compliance with a framework like SOC 2, ISO 27001, HIPAA, GDPR, and more is how businesses can demonstrate strong security practices.

And we see how much this matters every time we connect enterprises with agent services providers at Superintelligent. Many of these compliance frameworks are simply not negotiable for enterprises.

The problem is that navigating security and compliance is time-consuming and complicated. It can take months of work and use up valuable time and resources. Vanta makes it easy and faster by automating compliance across 35+ frameworks. It gets you audit-ready in weeks instead of months and saves you up to 85% of associated costs. In fact, a recent IDC whitepaper found that Vanta customers achieved $535,000 per year in benefits, and the platform pays for itself in just three months.

The proof is in the numbers. More than 10,000 global companies trust Vanta, including Atlassian, Quora, and more. For a limited time, listeners get $1,000 off at vanta.com slash nlw. That's v-a-n-t-a dot com slash nlw for $1,000 off.

Today's episode is brought to you by Super Intelligent and more specifically, Super's Agent Readiness Audits. If you've been listening for a while, you have probably heard me talk about this, but basically the idea of the Agent Readiness Audit is that this is a system that we've created to help you benchmark and map opportunities for your business.

in your organizations where agents could specifically help you solve your problems, create new opportunities in a way that, again, is completely customized to you. When you do one of these audits, what you're going to do is a voice-based agent interview where we work with some number of your leadership and employees.

to map what's going on inside the organization and to figure out where you are in your agent journey. That's going to produce an agent readiness score that comes with a deep set of explanations, strength, weaknesses, key findings, and of course, a set of very specific recommendations that then we have the ability to help you go find the right partners to actually fulfill. So if you are looking for a way to jumpstart your agent strategy, send us an email at agent at besuper.ai and let's get you plugged into the agentic era.

So I think there is a ton that's interesting in here, and I'm going to take it in honestly no particular order, just kind of my set of thoughts as I reflect on this piece. The first is that I think it's actually a very salient point that helps us understand how AI is improving things right now to recognize that corporations are groups of specialists, whereas an individual inherently can only be a specialist in so many things.

And so given that an individual is a specialist in very few things and a novice in the vast majority of things, AI's capability to take them from novice to intermediate or novice to adequate in terms of design, coding, whatever it is, is a greater change in general than a person who's just operating in their specialist capacity inside the context of the corporation they're contributing to.

Basically, the band of use cases that are valuable to an individual operating as an individual are inherently higher than the band of use cases useful to a specialist operating as a specialist.

I think it's a really salient point, and I also think that it will influence how these things take root even inside enterprises and corporations. One of the interesting and non-obvious outcomes, although it feels like it's getting perhaps more obvious now, is a stronger differentiation within side companies between core functions and secondary functions.

And when it comes to those secondary functions, more possibility that the capabilities of a novice who's a specialist at the main thing using AI tools to do those things might be good enough. In other words, companies might decide not to hire certain skill sets and expertise that they would have before because it's not core to the business even if it's something that they do, yes, have to do. And the AI-ified version is good enough.

Now, obviously, if you're sitting there screaming agents, but agents, I think you're correct to recognize that this seems especially where we might see the rise of people opting to hire digital employees instead of human employees. One of the ways to embody and embrace that good enoughness is to pick the low cost agentic option that, yes, maybe still can't compete with best in class human versions, but for a company for whom that thing is important, but not mission critical, that's totally sufficient.

One of the themes that you'll hear a lot next week in our interview conversations is the idea that a lot of where agents are going to start is going to be on the margins. In sales organizations, in marketing groups, not that those things aren't important, but they are separate from the core function of whatever the business is.

A second thing, though, that I think is important to note is while I agree in general that if you just take an individual operating in their own capacity and an individual operating in their corporate capacity as a specialist, AI and LLM specifically have had or have the potential to have more broader impact on that individual operating in their own capacity. I think that Andre's argument perhaps might mislead people in terms of understanding how big an impact and how fast these technologies are already having in the enterprise sector.

If you look at the previous rate of change and rate of adoption of technologies in the corporate sector and compare it to AI, there is absolutely nothing comparable. The speed with which enterprises and businesses have radically reoriented themselves to at least attempt to adopt these technologies is totally unlike anything we've ever seen.

There is an understanding up and down the organization that these are hugely disruptive forces. They go beyond just new tooling. They are unlocking totally new types of efficiencies as well as totally new types of opportunities, which we have barely scratched the surface of. And relative to the pressure of general corporate inertia, the adoption is actually happening incredibly quickly. Now, like I said, that doesn't undermine his point that there's something really powerful about the fact that people are figuring this stuff out on their own faster.

In fact, a lot of the natural evolution and progression we're seeing is people using their personal Gmail accounts to figure things out and then slowly bringing in what they've learned into the office. One of the things that really smart organizations are doing that some that are perhaps lagging behind aren't is actively embracing that external experimentation to internal new process adoption funnel.

Organizations and enterprises are constrained on how freewheeling and experimental they can be. However, I would say that in general, they overestimate risk and they undervalue finding ways for people to experiment, even if they can't do it with corporate data. The last thing that I will say is that I think that one other outcome of this progression of technology diffusion that Andre is noticing is that we are likely to see a radical increase in bottoms-up entrepreneurship.

The amount of activities that seemed inaccessible to people before, or capital constraining because they would have had to hire someone to do it, and which stood in the way of them just doing the damn thing that they've wanted to do, has decreased radically. And that means the number of people who are going to just throw off the shackles of their normal enterprise and try to do the thing that they've always dreamed of is going to increase.

I think we are going to see an absolute Cambrian explosion of small business entrepreneurs, solopreneurs, people building stuff because that becomes easier. And that's one of the impacts that I'm most excited to see play out. Anyways, big thanks to Andre for another thought-provoking piece. And thanks to you guys, as always, for listening or watching. Until next time, peace.