We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
cover of episode #198 - DeepSeek R1 & Janus, Qwen1M & 2.5VL, OpenAI Agents

#198 - DeepSeek R1 & Janus, Qwen1M & 2.5VL, OpenAI Agents

2025/2/2
logo of podcast Last Week in AI

Last Week in AI

AI Deep Dive AI Chapters Transcript
People
A
Andrey Kurenkov
J
Jeremie Harris
Topics
Andrey Kurenkov: 我认为我们之前的播客对DeepSeek v3的预测是准确的,DeepSeek R1的结果并不令人意外。DeepSeek R1是一个与OpenAI的O1具有竞争力的语言模型,其优势在于推理能力。该模型的训练使用了强化学习方法,并取得了令人印象深刻的成果。DeepSeek R1的发布引发了美国科技股的剧烈波动,这反映了市场对AI技术发展前景的担忧和期待。然而,我认为市场对DeepSeek R1对英伟达的影响存在误读,它实际上利好英伟达的硬件生态系统。DeepSeek R1采用宽松的MIT许可证,这有利于其在商业和研究领域的应用。 此外,DeepSeek还发布了Janus Pro,一个性能优异的开源文本到图像模型。这些模型的发布表明,DeepSeek作为一个实验室,正在对开源AI领域产生重大影响。 Jeremie Harris: DeepSeek V3是一个强大的基础模型,通过强化学习优化就能达到与GPT-4相当的水平。人们对DeepSeek R1对硬件的影响存在误读,它实际上利好英伟达的硬件生态系统。仅仅通过奖励模型正确答案就能有效提升大型语言模型的推理能力,这证明了强化学习的强大潜力。深度学习模型通过强化学习,能够自主发现并利用推理时间缩放定律,这表明该定律是AI系统的一个内在属性。模型会自然地采用比人类更有效率的推理方式,人类可解释性只是对模型的一种额外限制。DeepSeek R1是实际应用的模型,而R1.0则展示了强化学习的未来潜力。DeepSeek证明了可以以更低的成本获得与OpenAI O1相当的性能,这对于英伟达来说是利好消息。DeepSeek的成功凸显了算力在AI发展中的重要性,也进一步强调了出口管制的必要性。DeepSeek的手机应用在Google Play商店排名第一,这表明其模型获得了广泛的关注。DeepSeek的成功并不能改变算力在AI发展中的核心地位,未来算力仍然是决定AI竞争力的关键因素。

Deep Dive

Shownotes Transcript

Our 197th episode with a summary and discussion of last week's big AI news! Recorded on 01/17/2024

Join our brand new Discord here!) https://discord.gg/nTyezGSKwP

Hosted by Andrey Kurenkov) and Jeremie Harris). Feel free to email us your questions and feedback at [email protected] )and/or [email protected])

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/).

In this episode:

  • DeepSeek releases R1, a competitive AI model comparable to OpenAI’s O1, leading to market unrest and significant drops in tech stocks, including a 17% plunge in NVIDIA's stock.   - OpenAI launches Operator to facilitate agentic computer use, while facing competition from new releases by DeepSeek and Quen, with applications seeing rapid adoption.  - President Trump revokes the Biden administration's executive order on AI, signaling a shift in AI policy and deregulation efforts.  - Taiwanese government clears TSMC to produce advanced 2-nanometer chip technology abroad, aiming to strengthen global semiconductor supply amidst geopolitical tensions.

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form).

Timestamps + Links:

(00:00:00) Intro / Banter

(00:03:01) Response to listener comments

Projects & Open Source

(00:06:26) DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning)

(00:30:25) Viral AI company DeepSeek releases new image model family)

(00:34:07) Qwen2.5-1M Technical Report)

(00:38:32) Alibaba’s Qwen team releases AI models that can control PCs and phones)

Tools & Apps

(00:42:09) OpenAI launches Operator, an AI agent that performs tasks autonomously)

(00:47:37) DeepSeek reaches No. 1 on US Play Store)

(00:52:17) Alibaba rolled out Qwen Chat v0.2 and Qwen2.5-1M model)

(00:53:50) Perplexity launches US-hosted DeepSeek R1, hints at EU hosting soon)

(00:55:31) Apple is pulling its AI-generated notifications for news after generating fake headlines)

(00:59:00) French AI ‘Lucie’ looks très chic, but keeps getting answers wrong)

Applications & Business

(01:02:09) DeepSeek’s New AI Model Sparks Shock, Awe, and Questions From US Competitors)

(01:08:16) Microsoft loses OpenAI exclusive cloud provider status to $500 billion Stargate project)

(01:13:34) OpenAI adds BlackRock exec Adebayo Ogunlesi to board of directors)

(01:15:33) ElevenLabs has raised a new round at $3B+ valuation led by ICONIQ Growth, sources say)

Policy & Safety

(01:16:29) Donald Trump unveils $500 billion Stargate Project to build AI infrastructure in the US, promising over 100K jobs)

(01:21:16) Trump Revokes Biden AI Policy, Signs Executive Order to Strengthen AI Leadership)

(01:23:59) Anthropic CEO doesn’t see DeepSeek as ‘adversaries,’ but says export controls are critical)

(01:31:12) Taiwanese govt clears TSMC to make 2nm chips abroad — country lowers its 'Silicon Shield')

(01:33:47) Outro