We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
cover of episode Who Is the Mysterious Founder of China’s DeepSeek?

Who Is the Mysterious Founder of China’s DeepSeek?

2025/5/20
logo of podcast Big Take Asia

Big Take Asia

AI Deep Dive AI Chapters Transcript
People
K
K. Oanh Ha
S
Saritha Rai
Topics
Saritha Rai: 我认为梁文峰是一位非常神秘且低调的科技企业家,他致力于将中国打造成人工智能领域的强国。他早年创立量化对冲基金并获得了巨大的成功,但他对人工智能的热情促使他创立了DeepSeek。DeepSeek的早期宣言就展现了他避免平庸、挑战AI领域难题的决心,以及将中国推向AI领导地位的雄心。DeepSeek通过技术创新和开源策略,在全球AI领域迅速崛起,并对美国构成了挑战。尽管DeepSeek取得了显著的成就,但它也面临着来自中国国内其他科技巨头的竞争,以及如何将技术商业化的挑战。此外,DeepSeek的开源策略也引发了一些争议,一些人认为这可能存在安全风险。 K. Oanh Ha: 我对梁文峰和DeepSeek的崛起感到非常好奇。DeepSeek的稀疏性技术是否可以理解为,AI模型只使用大脑的特定部分进行计算,而不是动用所有资源?DeepSeek选择开源模型的原因是什么?这在战略上有什么意义?

Deep Dive

Chapters
This chapter introduces Liang Wenfeng, the mysterious founder of DeepSeek, a Chinese AI startup that has taken the tech world by storm. It explores his background, education, and previous ventures, highlighting his transition from quantitative finance to AI.
  • Liang Wenfeng, founder of DeepSeek, is a relatively unknown figure despite his company's global impact.
  • He hails from a small village in Guangdong province and studied at Zhejiang University.
  • Prior to DeepSeek, he founded a successful quantitative hedge fund, High Flyer Management.
  • DeepSeek's early culture was described as geeky and unorthodox, attracting top talent from companies like Google and Facebook.

Shownotes Transcript

Translations:
中文

This is an iHeart Podcast.

Every business starts with an idea. How can you go from daydreamer to industry leader? Amazon Business accelerates your journey. With smart business buying, get everything you need to grow in one familiar place, from office supplies to IT essentials and maintenance tools. Amazon Business takes the buying experience you know and love from Amazon, plus tools that help you save costs and make insights-based decisions. Ready to bring your visions to life? Learn how at amazonbusiness.com.

AI is redefining what's possible for your business with more unique challenges to solve and higher stakes than ever. Microsoft helps you stay ahead.

Our trustworthy AI tools and guidance can empower leaders like you to drive greater impact. And with Azure's simplified platform management, we're helping businesses go further, faster, unlocking up to 150% improved output. Whatever challenge comes next, let Microsoft help you keep pushing forward. For more details, visit microsoft.com slash challengers. Bloomberg Audio Studios. Podcasts. Radio. News.

Earlier this year, a new product from the Chinese AI startup DeepSeek shocked the world and rattled Wall Street.

China's DeepSeek is freaking out the AI world right now. Tech stocks tumbled as its app surged to the top of the download charts. But despite the global attention, very little is known about the man behind DeepSeek, Chinese entrepreneur Liang Wenfang. Liang Wenfang is certainly a mystery figure. Bloomberg's Sarita Rai covers artificial intelligence in Asia.

He's certainly one of the most inaccessible and low-key tech entrepreneurs that I've come across. Just to illustrate how private he is, we weren't able to find any pictures of him on the internet when we scoured through his website and all of that, but finally appeared in a really high-profile meeting with President Xi Jinping, and that picture got out into the world, and he was everywhere.

What does this man of mystery look like? He is slim, wears glasses, but doesn't talk much. Baby-faced? Yes, I think we could describe him like that.

DeepSeek rarely answers questions about Liang, citing his privacy. But Sarita and her colleagues were curious about this man, whose AI systems turned the tech world on its head. So they spoke with dozens of people familiar with his work, from former employees and fellow researchers to investors and insiders in the industry. And what we found is that, yes, he is extraordinarily low-key, very shy,

but extraordinarily driven and talented and passionate. And I think he has somewhat taken on DeepSeek as a sort of a mission to establish China in AI, trying to make sure that China is a force to reckon with in AI.

Welcome to the Big Take Asia from Bloomberg News. I'm Wan Ha. Every week, we take you inside some of the world's biggest and most powerful economies and the markets, tycoons, and businesses that drive this ever-shifting region.

Today on the show, who is Liang Wenfeng? We learn about the mysterious tech founder who led DeepSeek to the front line of AI advances. Plus, what does the company's rapid rise tell us about the U.S.-China artificial intelligence race? Sarita, thanks for joining us. I'm fascinated by AI. You guys did such an interesting job with the story. I wonder if we can start with who is Liang Wenfeng? What do we know about his roots?

So Liang is about 40 years old, born in a small village called Miriling in the Guangdong province. His parents were schoolteachers, mainly taught primary school kids.

He was extremely bright and went on to study at Zhejiang University and then also did his master's there. At Zhejiang University, Liang and his friends immerse themselves in all things tech: machine learning, signal processing, electronic engineering. They even developed programs to trade stocks during the financial crisis.

After graduating, Liang joined forces with two of his classmates and set up a quantitative hedge fund called High Flyer Management. So quant funds basically work with mathematical models and statistical analysis to do stock trading. Humans are not involved in taking the decisions. At its peak,

High Flyer Management was managing something like $14 billion in assets. So it was quite a sizable fund. And in its most successful runs, it was providing annualized returns, averaging 35% to its investors. So I would say that it was doing very well indeed.

According to former employees, High Flyer had a geeky startup culture. Its early job postings boasted of attracting top talent from Google and Facebook and said they were looking for math and coding geeks with quirky brilliance.

The early job postings also referred to Sheldon, who is this very awkward main character in the prominent American sitcom called The Big Bang Theory. For example, I cry because others are stupid and it makes me sad. Sheldon has a legend of fans and is extraordinarily funny without meaning to be.

So, you know, the whole culture of deep seek in the early days revolved around recreating some of that geeky, nerdy culture. There were free snacks, poker game nights. Everybody was dressed in T-shirts and slippers. Sounds like a great place to work. Yeah, it was really a very unorthodox startup culture.

Unlike what you'll probably see in the big tech companies in China, such as Alibaba and Tencent. And how did Liang transition from doing quant financials into AI and building DeepSeek?

Liang was always extraordinarily interested in machine learning and artificial intelligence. And then a few months after OpenAI, you know, launched ChatGPT, the chatbot that became an overnight global success. It was then the spring of 2023, a few months had passed after the launch of ChatGPT. And Liang then...

announced that DeepSeek would be set up. In its early manifesto, DeepSeek talked about shunning mediocrity and tackling the big challenges in AI and, of course, ultimately cracking artificial general intelligence. The manifesto also laid out DeepSeek's ambition to position China as a leader in cutting-edge technologies.

You know, Liang has given two interviews, rare as they may be. In both those interviews, he's talked about bringing China's AI ecosystem to the forefront of where the world is. You know, China has been accused constantly of being a copycat. He wanted in AI China to chart a different path.

DeepSeq worked fast. Since 2023, it's released over half a dozen AI models and helped pioneer a technique called sparsity, which enabled those models to train and run more efficiently. Developers started to take note

Then, earlier this year... Back to that top story now, DeepSeek shaking up global tech. When they released their reasoning model R1, that caused such an upheaval in the industry and caused a trillion dollar stock market meltdown. That's when the world really started paying attention to this secretive AI entrepreneur in China.

And Sarita, what is so groundbreaking about DeepSeek's R1 model?

The AI industry until recently was always about billions of dollars spent in building the infrastructure, the data centers, the graphics processing units in the data centers that would train these models. But what DeepSeq did was show that its models could match or even outdo on some benchmark measures what the latest open AI or anthropic models were doing.

and with far less computational power, with far less resources, and as Deep Seek claimed, with far less capital as well. So how did Liang and his team manage to achieve true innovation at what it says is a fraction of the cost? And what does Deep Seek's success say about the AI race between China and the U.S.? That's after the break.

How can you grow your business from idea to industry leader? Bring your vision to life with smart business buying tools and technology from Amazon Business. From fast, free shipping to in-depth buying insights and automated purchase approvals, they deliver everything you need to achieve your goals. It's not easy to stand out from the crowd. Simplify how you stock up to get ahead. Go to AmazonBusiness.com for support.

In business, timelines shift, opportunities pop up fast, and your brand has to show up strong. That's why smart teams trust 4imprint. Whether you're planning ahead or responding to a last-minute need, 4imprint makes it simple to get the right promotional products fast, done right, and without stress. With thousands of options, including apparel, drinkware, tech, and trade show essentials, you'll find the right fit for your brand and timeline. Many available with a 24-hour turnaround.

Not sure where to start? 4imprint offers free product samples, expert guidance, and free logo assistance so your order turns out exactly as planned. And every order is backed by 4imprint's 360-degree guarantee, so it will arrive on time, on budget, and printed perfectly. That's what it means to be 4imprint certain.

Explore solutions at 4imprint.com and get your brand where it needs to be quickly, confidently and without compromise. For much of the last decade, the U.S. has tried to restrict China's access to semiconductors. Tensions reached a fever pitch in 2022 and the following year when Washington targeted Beijing with two rounds of chip export controls.

NVIDIA and shares of semiconductor companies have slumped today after the Biden administration said it would tighten restrictions on exports of AI chips to China. That limited sales from American firms like NVIDIA, whose cutting-edge chips are used by tech companies to help train their AI models. The move presented a significant challenge for developers in China. But as Bloomberg's Sarita Rai says, it also forced them to develop workarounds.

Necessity is always the mother of innovation. This has been proven by AI developers in China. Never mind the export curbs, they've still gone on to build good models that have benchmarked with the best around the world. And one of the most innovative approaches from DeepSeek is a sparsity technique we mentioned earlier.

Now, sparsity is something to do with building a model without having the high-end computational power. It's when a large language model doesn't have to be entirely harnessed to give an answer to a query. Instead, Liang and his fellow developers

tried to apportion the expertise of the model into smaller expert groups and then only harness those groups that required to be used. So in doing that, they made it much more computation efficient and also much more cost efficient.

Is it basically instead of using your whole brain, are you using just certain parts of your brain to do that computation? That's exactly right, Oan. You know, instead of entirely using every little gray cell in your brain, it only fires up those neurons or little portions in your brain that contain that put

particular field of expertise and then bring that to, you know, respond to a query or give an answer to a particular question, whether it's a command or a coding question.

The sparsity breakthrough impressed DeepSeek's competitors, but its price point is what ultimately made headlines. DeepSeek said it cost them just $5.6 million to train its V3 model. That's far less than the estimated $100 million OpenAI spent on its most advanced version of ChatGPT.

Now, there is definitely a whole lot of skepticism around that number because just the infrastructure, the training of the model, the talent and the time it takes, all of it adds to quite a sizable sum of money. So the skepticism is warranted. People have estimated that there was no way DeepSeek could have pulled that off without at least a billion dollars or more.

Also in DeepSeek's favor is that AI startups like it have a staunch ally in China's government and President Xi Jinping. Sarita says Xi sees generative AI, robotics, and other high-tech ambitions as beneficial to the state's agenda, part of a larger push for self-reliance in key technologies. And DeepSeek's success has spurred much bigger rivals such as Alibaba, Tencent, and ByteDance to release their own AI models.

Sarita, DeepSeek's model is entirely open sourced at this point. That means any individual or company could incorporate DeepSeek's algorithms into their own programs. Why did the company choose this approach and why is that important?

Open source, on one level, you could say that it is democratizing AI and taking it out into the world. But let's not forget that China's AI models would have otherwise found fewer takers around the world if they were proprietary models and were on par in terms of what they cost with Western companies such as OpenAI.

By making it cheap and by making it open source, China allowed people around the world to quickly take a look at the models and begin using them, allowing them to be adopted much faster in the business and AI ecosystem, thereby outdoing the likes of OpenAI.

Now, that's huge. It's not only about democratizing models. It's strategically about making sure that you cut out your competitor by making things so cheap that the world adopts it quickly, and then it becomes mainstream. As a result, Microsoft and Amazon both offer DeepSeek on their cloud services. And DeepSeek's models have been incorporated into Perplexity, an AI-powered search engine that also offers models from OpenAI and Anthropic.

There is definitely a question about, you know, how fast AI is advancing. And there's a fear around the world about having all of the controls rest only with one or two companies in the world. I think that was what CheapSeek and others were trying to put out a message in the world saying that,

All of the controls cannot be left to one or two companies and the proprietary models that they are building. It should be much more democratic. Therefore, I think the open source philosophy is about de-risking concentration and allowing more people to build with technologies that are much more available.

Is there potentially a kind of clash of cultures or a clash of values as well when it comes to building AI between the West's approach and the Chinese approach? Very clearly, because if you look at early models of deep seek or even the, you know, not deep.

tweaked or fine-tuned models of deep-seek, they are very much working within the boundaries of China's censorship rules. For instance, you cannot ask questions about Taiwan or Xi Jinping without it giving a very bland, official answer. Whereas if you take that same model and you can train it with other data and make it culturally suitable to different countries,

That's one of the things that DeepSeek learned early on, that by open sourcing the model and by giving developers and users a chance to customize to their own cultural context, DeepSeek could find much more opportunity.

quicker and faster adoption around the world than by controlling a lot of it and controlling it in a way that it could only give China friendly answers around the world.

And while some applaud China's innovations in AI, many in the U.S. suspect darker reasons for the success. An April report from a U.S. House of Representatives committee alleged significant ties between DeepSeek and the Chinese government. It concluded that the company unlawfully stole data from OpenAI. The Chinese embassy rejects those claims as groundless. Meanwhile, DeepSeek and Liang haven't commented on the House report.

Sarita, it seems like there is very much of an arms race of sorts when it comes to AI, certainly between the U.S. and China at the moment. It's very much a race, and I think it would be too early to call a winner. All I can say is that a year ago, I would not have called it as a close race.

It's a marathon, but you have to go at a sprinting pace. We are really at the very beginning of it. And for whichever country cracks the race, there is a lot of economic gains to be had. So every country, particularly the U.S. and China, do not want to let up in AI.

And what are the challenges ahead right now for DeepSeek that you see? I think one of the main challenges is what next? What can they do that outdoes what they've already done? But there's also, I think for DeepSeek, competition within its own home ground. A bunch of China companies such as Alibaba and ByteDance and Tencent. Building models that are outdoing DeepSeek's last flagship model.

So there's this pressure on DeepSeek to do better. But also, I think there is also the question around commercializing these models. How are companies like DeepSeek going to make money? There is no clear answer yet whether DeepSeek wants to make money. And if it does, how will it make money? This is The Big Take Asia from Bloomberg News. I'm Wan Ha. This episode was produced by Yang Yang and Naomi Ng.

It was edited by Patty Hirsch and Joshua Brustein. It was fact-checked by Bloomberg's editorial team and mixed and sound designed by Taka Yasuzawa. Our senior producer is Naomi Chavon. Our senior editor is Elizabeth Ponzo. Our deputy executive producer is Julia Weaver. Our executive producer is Nicole Beamsterbauer. Sage Baumann is Bloomberg's head of podcasts.

If you like this episode, make sure to subscribe and review The Big Tech Asia wherever you listen to podcasts. It really helps people find the show. Thanks for listening. See you next time. I know a lot of cops. They get asked all the time, have you ever had to shoot your gun? Sometimes the answer is yes. But there's a company dedicated to a future where the answer will always be no. This is Absolute Season 1. Taser Incorporated. I get right back there and it's bad.

Listen to Absolute Season 1, Taser Incorporated, on the iHeartRadio app, Apple Podcasts, or wherever you get your podcasts. Sometimes life calls for a better mattress, and sometimes life hands you a better place to find one.

Announcing the grand opening of Mattress Warehouse with the largest selection of top brands at the lowest prices. That's the warehouse advantage. Plus, you get a one-year low price guarantee and 0% interest financing. So where would you look to find the perfect mattress? In a little mattress store or at Mattress Warehouse? Shop our grand opening sale going on now. Visit mattresswarehouse.com. This is an iHeart Podcast.