本期的 13 篇论文如下:
[00:23] 🧠 Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models(自信即全部:基于语言模型的小样本强化学习微调)
[01:07] 🎬 Seedance 1.0: Exploring the Boundaries of Video Generation Models(Seedance 1.0:探索视频生成模型的边界)
[01:50] 🥽 PlayerOne: Egocentric World Simulator(PlayerOne:以自我为中心的真实世界模拟器)
[02:30] 🎬 Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation(用于实时交互视频生成的自回归对抗后训练)
[03:15] 🤖 ComfyUI-R1: Exploring Reasoning Models for Workflow Generation(ComfyUI-R1:探索用于工作流生成的推理模型)
[03:48] 🧠 SeerAttention-R: Sparse Attention Adaptation for Long Reasoning(SeerAttention-R:用于长程推理的稀疏注意力自适应)
[04:25] 🧪 SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner(SWE-Flow:以测试驱动的方式合成软件工程数据)
[05:10] 🎶 Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation(自回归 vs. 流匹配:文本到音乐生成建模范式的比较研究)
[05:52] 🎭 InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions(InterActHuman:基于布局对齐音频条件的多概念人物动画)
[06:34] 🤖 SAFE: Multitask Failure Detection for Vision-Language-Action Models(SAFE:视觉-语言-动作模型的多任务失败检测)
[07:14] 🧠 Reparameterized LLM Training via Orthogonal Equivalence Transformation(基于正交等价变换的重参数化LLM训练)
[07:56] 👁 MIRAGE: Multimodal foundation model and benchmark for comprehensive retinal OCT image analysis(MIRAGE:用于全面视网膜OCT图像分析的多模态基础模型与基准)
[08:39] 🌱 Branched Schrödinger Bridge Matching(分支薛定谔桥匹配)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递