We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
cover of episode Robots, Small Models, and RL with DeepSeek Alumnus Zihan Wang — #86

Robots, Small Models, and RL with DeepSeek Alumnus Zihan Wang — #86

2025/5/22
logo of podcast Manifold

Manifold

AI Deep Dive AI Chapters Transcript
People
S
Steve Hsu
Z
Zihan Wang
Topics
Zihan Wang: 我认为机器人技术在未来几年会有很大的进步,因为现在的大型语言模型具备了视觉能力,可以进行语义理解。我在人民大学高瓴人工智能学院学习,这是一所被低估的学校,但他们发表了很多有影响力的论文。中国大学的变化速度非常快,现在的尖端研究可以在任何地方进行,而且开源基础设施降低了研究的门槛。中国人总是能快速发现有前途的算法并扩大规模。我认为开源就像一种博弈论,当每个人都合作时,社会利益最大化。我希望人工智能能帮助加速人类登上月球,并期待AI在代码调试和机器人领域的应用。 Steve Hsu: 我发现你在Twitter上发表了一些关于AI研究和DeepSeek实习的有趣文章。中国大学的变化速度让美国人感到震惊,现在的年轻教授和学生非常优秀。你认为中国独特或有创意的创新会超过美国吗?即使没有庞大的计算预算,你也可以在大学里进行有影响力的研究。现在的情况比几年前好多了,即使是密切关注AI的投资者和风险投资家,通常也不太了解中国的情况,他们对中国的模型质量感到惊讶。

Deep Dive

Shownotes Transcript

Zihan Wang is an AI researcher at Northwestern University, where he works on vision-language models, robotics, and reinforcement learning. Previously, he interned at DeepSeek, contributing to projects like DeepSeek-V2.

Zihan's homepage:  https://zihanwang314.github.io/)

  • (00:00) - Introduction

  • (01:13) - Zihan's Background, CS and AI Research in China

  • (11:09) - DeepSeek; Human capital flow from PRC to US

  • (16:07) - DeepSeek, Open Source and AI Research

  • (31:52) - Model Size and Performance Constraints

  • (33:01) - Data Bottleneck in Pre-trained Models

  • (34:12) - Transformer Architecture and Scaling Laws

  • (36:30) - Efficiency in Model Training

  • (47:44) - Chain of Experts Architecture

  • (01:01:06) - Future of AI and Robotics

Music used with permission from Blade Runner Blues Livestream improvisation by State Azure.

Steve Hsu is Professor of Theoretical Physics and of Computational Mathematics, Science, and Engineering at Michigan State University. Previously, he was Senior Vice President for Research and Innovation at MSU and Director of the Institute of Theoretical Science at the University of Oregon. Hsu is a startup founder (SuperFocus.ai, SafeWeb, Genomic Prediction, Othram) and advisor to venture capital and other investment firms. He was educated at Caltech and Berkeley, was a Harvard Junior Fellow, and has held faculty positions at Yale, the University of Oregon, and MSU. Please send any questions or suggestions to [email protected] or Steve on X @hsu_steve.