We're sunsetting PodQuest on 2025-07-28. Thank you for your support!

“DeepSeek beats o1-preview on math and ties on coding; will release weights” by Zach Stein-Perlman

2024/11/21

DeepSeek-R1-Lite-Preview was announced today. Post. Chatbot. Chinese blogpost translation. DeepSeek says it will release the weights. The model appears to be stronger than o1-preview on math, similar on coding, and weaker on everything else. DeepSeek is Chinese. I'm not really familiar with the company. I thought Chinese companies were at least a year behind the frontier. Chinese companies tend to game benchmarks more than the frontier Western companies, but I think DeepSeek does this less than other Chinese companies. The blogpost also shows inference-time scaling, like o1:

The original text contained 2 images which were described by AI.

First published: November 20th, 2024

Source: https://www.lesswrong.com/posts/TcgpsgvLBBvvzGtiN/deepseek-beats-o1-preview-on-math-and-ties-on-coding-will)

---

Narrated by TYPE III AUDIO).

Images from the article: undefined )) Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts), or another podcast app.

“DeepSeek beats o1-preview on math and ties on coding; will release weights” by Zach Stein-Perlman

LessWrong (30+ Karma)

Shownotes Transcript

“DeepSeek beats o1-preview on math and ties on coding; will release weights” by Zach Stein-Perlman 01:33 Share

LessWrong (30+ Karma)

Shownotes Transcript

“DeepSeek beats o1-preview on math and ties on coding; will release weights” by Zach Stein-Perlman