We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
cover of episode E367. DeepSeek有新发现 | 谷歌Veo 3、百度布局等新动态

E367. DeepSeek有新发现 | 谷歌Veo 3、百度布局等新动态

2025/5/21
logo of podcast 创新灯塔

创新灯塔

AI Deep Dive AI Chapters Transcript
People
西
西娅
Topics
西娅:最近DeepSeek R1大火,我发现AI的推理能力虽然增强了,但是对于提示词的遵循能力却变差了。我在写DeepSeek攻略的时候就深有体会。一篇论文指出,使用KeyAlt推理后,大多数模型的执行准确率反而下降,原因是模型对于任务关键限制的注意力降低了。论文还提出了四种提升指令遵循效果的方案,其中Classifier Selected Reasoning最有效,但是成本也比较高。我认为,真正强大的智能应该懂得聚焦,有思考的分寸感。

Deep Dive

Chapters
The DeepSeek R1 model showed improved AI reasoning but suffered from decreased instruction-following accuracy. A Harvard, Amazon, and NYU study explored this trade-off, identifying a decline in attention to crucial task constraints after using key-word reasoning. Solutions for improving instruction-following were proposed, emphasizing the importance of focused, mindful intelligence.
  • Improved AI reasoning capabilities of DeepSeek R1 came at the cost of reduced instruction-following accuracy.
  • Study found that post key-word reasoning, the model's attention to crucial task constraints decreased.
  • Four solutions proposed to improve instruction-following, with Classifier Selected Reasoning being the most effective but costly.

Shownotes Transcript

今天的节目将探讨DeepSeek R1大火后AI推理能力变强但提示词遵循能力变差的现象该如何看待?《When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs》论文指出的模型准确率下降等问题又该如何解决?谷歌发布的Veo 3在AI视频领域带来重大突破,其能否进一步突破限制走向更广泛应用?百度强化多模态大模型、开展数字人直播带货等举措成效会如何?零一万物高管频繁离职,它能否稳定团队继续前行?接下来让我们来解锁这些商业科技动态吧

00:01:46 DeepSeek R1后AI情况及谷歌Veo 3发布 

00:05:27 百度、英伟达在AI领域相关动态 

00:08:23 零一万物高管频繁离职,前景待察 

    本期主播:西娅

    后期:西娅

    收听平台:小宇宙、喜马拉雅、Apple Podcast 等。

    如果喜欢我们的节目,欢迎点赞评论转发。