DeepSeek-R1-Lite-Preview was announced today. Post. Chatbot. Chinese blogpost translation. DeepSeek says it will release the weights. The model appears to be stronger than o1-preview on math, similar on coding, and weaker on everything else. DeepSeek is Chinese. I'm not really familiar with the company. I thought Chinese companies were at least a year behind the frontier. Chinese companies tend to game benchmarks more than the frontier Western companies, but I think DeepSeek does this less than other Chinese companies. The blogpost also shows inference-time scaling, like o1:
The original text contained 2 images which were described by AI.
First published: November 20th, 2024
---
Narrated by TYPE III AUDIO).
Images from the article:
)
)
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts), or another podcast app.