本期的 5 篇论文如下:
[00:43] TOP1(🔥199) | 🤖 Reinforcement Pre-Training(强化预训练)
[03:06] TOP2(🔥124) | 🕰 Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA(明日依旧为真吗?多语种常青问题分类以提升可信赖的问答系统)
[05:07] TOP3(🔥105) | 🧠 Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models(自信即全部:基于语言模型的小样本强化学习微调)
[07:23] TOP4(🔥99) | 🩺 Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning(灵枢:用于统一多模态医学理解和推理的通用基础模型)
[10:01] TOP5(🔥76) | 🩺 ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning(ReasonMed:一个用于推进医学推理的37万多智能体生成数据集)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递