We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
cover of episode “DeepSeek beats o1-preview on math and ties on coding; will release weights” by Zach Stein-Perlman

“DeepSeek beats o1-preview on math and ties on coding; will release weights” by Zach Stein-Perlman

2024/11/21
logo of podcast LessWrong (30+ Karma)

LessWrong (30+ Karma)

Shownotes Transcript

DeepSeek-R1-Lite-Preview was announced today. Post. Chatbot. Chinese blogpost translation. DeepSeek says it will release the weights. The model appears to be stronger than o1-preview on math, similar on coding, and weaker on everything else. DeepSeek is Chinese. I'm not really familiar with the company. I thought Chinese companies were at least a year behind the frontier. Chinese companies tend to game benchmarks more than the frontier Western companies, but I think DeepSeek does this less than other Chinese companies. The blogpost also shows inference-time scaling, like o1:

The original text contained 2 images which were described by AI.


First published: November 20th, 2024

Source: https://www.lesswrong.com/posts/TcgpsgvLBBvvzGtiN/deepseek-beats-o1-preview-on-math-and-ties-on-coding-will)

    ---
    

Narrated by TYPE III AUDIO).


Images from the article: undefined)undefined) Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts), or another podcast app.