We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
cover of episode 2024.07.12 每日AI论文

2024.07.12 每日AI论文

2024/7/12
logo of podcast HuggingFace 每日AI论文速递

HuggingFace 每日AI论文速递

Shownotes Transcript

Hugging Face 每日AI论文速递

每天10分钟,带您快速了解当日HuggingFace热门AI论文内容 今天带来的 15 篇论文如下:

📊 Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On(Skywork-Math:大型语言模型中数学推理能力的数据规模定律 -- 故事继续)

📊 MAVIS: Mathematical Visual Instruction Tuning(MAVIS:数学视觉指令调优)

📹 Video Diffusion Alignment via Reward Gradients(通过奖励梯度实现视频扩散对齐)

🔍 MambaVision: A Hybrid Mamba-Transformer Vision Backbone(MambaVision:一种混合Mamba-Transformer视觉骨干网络)

📊 GTA: A Benchmark for General Tool Agents(GTA:通用工具代理基准)

📊 The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective(数据与多模态大型语言模型的协同作用:从协同发展角度的调查)

🌐 DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception(DenseFusion-1M:整合视觉专家以实现全面多模态感知)

🎥 Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models(Live2Diff:基于单向注意力机制的视频扩散模型实现直播翻译)

🌲 Gradient Boosting Reinforcement Learning(梯度提升强化学习)

📉 Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients(Q-GaLore:使用INT4投影和层适应低秩梯度的量化GaLore)

📖 SEED-Story: Multimodal Long Story Generation with Large Language Model(SEED-Story:基于大型语言模型的多模态长故事生成)

📹 Generalizable Implicit Motion Modeling for Video Frame Interpolation(可泛化的隐式运动建模用于视频帧插值)

📊 OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects(OmniNOCS:用于2D物体3D提升的统一NOCS数据集与模型)

🎤 Autoregressive Speech Synthesis without Vector Quantization(无需向量量化的自回归语音合成)

🌍 WildGaussians: 3D Gaussian Splatting in the Wild(WildGaussians:自然环境中的3D高斯喷洒)

【关注我们,获取更多信息】

小红书: AI速递