We're sunsetting PodQuest on 2025-07-28. Thank you for your support!

E367. DeepSeek有新发现 | 谷歌Veo 3、百度布局等新动态

2025/5/21

创新灯塔

AI Deep Dive AI Chapters Transcript

People

西

西娅

Topics

西娅：最近DeepSeek R1大火，我发现AI的推理能力虽然增强了，但是对于提示词的遵循能力却变差了。我在写DeepSeek攻略的时候就深有体会。一篇论文指出，使用KeyAlt推理后，大多数模型的执行准确率反而下降，原因是模型对于任务关键限制的注意力降低了。论文还提出了四种提升指令遵循效果的方案，其中Classifier Selected Reasoning最有效，但是成本也比较高。我认为，真正强大的智能应该懂得聚焦，有思考的分寸感。

Deep Dive

Chapters

The DeepSeek R1 model showed improved AI reasoning but suffered from decreased instruction-following accuracy. A Harvard, Amazon, and NYU study explored this trade-off, identifying a decline in attention to crucial task constraints after using key-word reasoning. Solutions for improving instruction-following were proposed, emphasizing the importance of focused, mindful intelligence.

Improved AI reasoning capabilities of DeepSeek R1 came at the cost of reduced instruction-following accuracy.
Study found that post key-word reasoning, the model's attention to crucial task constraints decreased.
Four solutions proposed to improve instruction-following, with Classifier Selected Reasoning being the most effective but costly.

Shownotes Transcript

今天的节目将探讨DeepSeek R1大火后AI推理能力变强但提示词遵循能力变差的现象该如何看待？《When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs》论文指出的模型准确率下降等问题又该如何解决？谷歌发布的Veo 3在AI视频领域带来重大突破，其能否进一步突破限制走向更广泛应用？百度强化多模态大模型、开展数字人直播带货等举措成效会如何？零一万物高管频繁离职，它能否稳定团队继续前行？接下来让我们来解锁这些商业科技动态吧

00:01:46 DeepSeek R1后AI情况及谷歌Veo 3发布

00:05:27 百度、英伟达在AI领域相关动态

00:08:23 零一万物高管频繁离职，前景待察

本期主播：西娅

后期：西娅

收听平台：小宇宙、喜马拉雅、Apple Podcast 等。

如果喜欢我们的节目，欢迎点赞评论转发。

E367. DeepSeek有新发现 | 谷歌Veo 3、百度布局等新动态 07:56 Share

创新灯塔

Deep Dive

Shownotes Transcript

E367. DeepSeek有新发现 | 谷歌Veo 3、百度布局等新动态