Transcript of China Challenges Silicon Valley for A.I. Dominance

The Daily

24:41 1,273 views Published over 1 year ago

00:00:00

Hey, I'm Tracy Mumford. You can join me every weekday morning for the headlines from the New York Times.

00:00:05

Now, we're about to see a spectacle that we've never seen before.

00:00:08

It's a show that catches you up on the biggest news stories of the day. I'm here in West Square. We'll put you on the ground where news is unfolding.

00:00:15

I just got back from a trip out to the front line, and every soldier- And bring you the analysis and expertise you can only get from the Times newsroom. I just can't emphasize enough how extraordinary this moment is.

00:00:27

Look for the headlines wherever you get your podcast. Lasts.

00:00:31

From the New York Times, I'm Natalie Kittrowaf. This is The Daily.

00:00:35

Last week, financial markets went into a panic over an obscure Chinese tech startup called Deep Seek.

00:00:50

That company now threatens to upend the world of artificial intelligence and the race for who will dominate it. Today, my colleague Kevin Ruce a tech columnist and the co-host of the podcast Hard Fork, on how Deep Seek caught us all off guard.

00:01:11

It's Monday, February third.

00:01:26

Hi, Kevin.

00:01:27

Hello. So great to be here.

00:01:30

Let's jump in. Kevin, how did this giant AI tech freak out begin? Tell us that story.

00:01:38

The freak out really started in earnest with a Chinese AI company called Deep Seek. Deep Seq had released a new AI model. Models are released all the time. Generally, they don't make international news, but this model was different in a few ways. Deep Seek released its new chatbot app, which is said to perform as well as ChatGPT. One of them was that it just appeared to be a really good model, better than the leading Chinese models at the time and on par or close to on par with the leading American models. A new AI king was crowned today, well, at least for now. And so the Deep Seq app. Unseeding Open AI's ChatGPT goes to number one on the App Store charts. It vaults ahead of ChatGPT in all these better known apps. Now, the most stunning thing here isn't necessarily that China has developed a pretty good AI app. It's how cheap it is. The more notable thing, the thing that really caused the American AI industry to start to panic, was how cheaply this model appeared to have been built.

00:02:51

How cheap are we talking?

00:02:53

We think that this model cost Deep Seek about $5. 5 million to train. Now, That might sound like a lot of money, but it's really not compared to what many of the American AI companies are spending. Metta said that it was spending 65 billion dollars. Microsoft says it plans to spend about $80 billion. Microsoft said it was spending $80 billion. And some of the world's most prominent technology leaders, pledging to invest an initial $100 billion. Openai had just announced this giant partnership where they were planning to spend It'll be up to half a trillion over the next four years. As much as half a trillion dollars to build the infrastructure for AI.

00:03:37

Whoa.

00:03:38

Yeah, it's wild. On top of that, Deep Seq says that they built their model without access to the latest and greatest American AI chips, which up until now were thought to be necessary to build the most powerful models. A lot of people put a lot of money into AI, and now they're wondering if that money is needed the the way that some of these American companies have said it is. Exactly. Investors start saying, Wait a minute, if it only costs $5. 5 million to train a leading edge AI model, Then what the heck are all these American companies spending hundreds of millions of dollars or even billions of dollars to train roughly equivalent models? Tech stocks plummeted Monday as investors raised concerns about advancements in Chinese artificial intelligence. And so the stocks of many of the American tech companies start to fall. Right. And so after all of this, people in the American tech industry start asking questions. Who is Deep Seq? And how are they getting these incredible models with so little money spent on them?

00:04:57

Okay, we're going to get to those questions of who this company is and how they did this. But I just first want to dig into the anatomy of the market panic. What are the real fears driving this?

00:05:10

It depends who you ask because there are a couple overlapping panics that are starting to happen around this time. Of course, again, there's the investor panic. I mean, imagine if you had your whole portfolio invested in American AI companies, it would be like if you just bought a very high-end sports car, like a Lamborghini, and you had been driving it around and were so proud of how fast it could accelerate and how well it handled. Then some random guy shows up with a soapbox car made of balsa wood, and it can go just as fast as your car. You'd be like, What the heck? Why did I just spend all this money on this Lamborghini?

00:05:53

Yeah, and should I maybe be investing in balsa wood cars?

00:05:57

Yes. Then, of course, there's the geo political freak out because Deep Seek is a Chinese AI company. There has been this race happening between primarily the US and China for years about AI and AI supremacy. Who was going to be able to build the most powerful AI models before the other one. That is a very important question for things like assessing the future of military conflict. If one country's AI is way better than another country's AI, they might have an advantage. In fact, the US has banned the export of the most powerful AI chips to China for exactly this reason, to try to hobble the Chinese AI companies to keep them from catching up when it comes to building the leading edge models that could become very important. Instead, Deep Seq had to make do with these Kirkland Signature chips that are pretty good, but they're not the best. That combined with the amount of money spent really made people say, How do they pull this thing off?

00:07:02

Kevin, it certainly seems that at least based on what Deep Seq is saying, it has managed to pull off a pretty impressive feat here. But I'm wondering, can we trust what the company says? Can we trust their claims about how they pulled this off?

00:07:20

Yeah. There are a lot of people who are skeptical of what Deep Seq has claimed, in particular, the cost of the model. $5. 5 million might not be the It doesn't include all of the research and the engineer salaries and things that went into that, so that the real cost is probably significantly higher than that. But there are questions about, did they smuggle in very powerful chips that would have actually allowed them to build a model this good? Is there something going on? Is the Chinese government funneling money to them and not telling us about it? There are lots of theories. But then as time wears on and people who are experts in this stuff start digging through the details, they're coming to the conclusion that, well, yeah, maybe the cost is a little higher than deep sea claims. Maybe they have a few more chips than they're telling us about. But in general, this seems like they actually just did build a really good model using some very clever engineering techniques.

00:08:19

Okay, so let's talk about those engineering techniques. I mean, how actually did Deep Seq do this? Make a chatbot on a shoestring budget, potentially with second-rate chips.

00:08:30

Because Deep Seq did not have access, we don't think, to the most powerful chips that American companies are using, they had to get clever about becoming more efficient with their model. I won't bore you with the technical details. It includes terms like mixture of experts, architecture. But basically, they were able to use some clever tricks to squeeze the most power out of the chips that they did have.

00:08:58

It occurs to me, Kevin, that this company was operating under a lot of constraints. It sounds like that may have forced the engineers to think about how to tackle this problem differently. As in, it seems possible that not having these critical ingredients actually spread innovation.

00:09:16

Yeah. I mean, there's this saying in the tech industry that constraints inspire creativity. That is definitely true here. Deep Seek did not have access to the best American AI They did not have the largest budget or the most sophisticated team, but they were really scrappy and smart. They had a lot of really good young engineers, and they were able to pull this off.

00:09:45

Kevin, how do the big American tech companies contend with that? What do they say to investors who are wondering about whether maybe these companies have been throwing money away when some of this work on these AI models could have been done much more cheaply?

00:10:02

What the AI companies in America are saying in response to this market panic is, look, we've still got to build these big, expensive supercomputers to stay at the forefront of AI to have the best models. If we take the techniques that Deep Seek has now shown are possible, these efficiency gains, we could have them, too. Think about how powerful our models would be if we put a billion dollars into the the same model that Deep Seek was able to make for much less. That is what the American AI companies are saying. But I think there are real questions among investors about whether the scale of investment that they have been planning is really necessary.

00:10:44

For you, Kevin, obviously, you've been covering this world for a very long time. Does that show you that more money doesn't necessarily mean more innovation in the world of AI, where more money has been a given as an assumption of what's needed? I mean, does it actually suggest that maybe smaller, as you said, scrappier startups could make huge gains in this world?

00:11:09

Yeah, I think it threw into question this fundamental assumption that only the big dogs could play in AI. You had to be Microsoft or Amazon or Google if you wanted a chance to build the state-of-the-art AI models. I think what the Deep Seek story suggested is that there may be a whole other world of competitors out there trying to stay close to the frontier and that they might not have to have the resources of one of the world's largest corporations to do it. But there was one other piece of this that I think really suggests that the AI race has entered a new phase, which is that Deep Seek did something that a lot of American companies have been hesitant to do, which is that they released their AI models as open source software, meaning that anyone on the internet can download download and use, can make their own versions of, can adapt, can tweak. It is software that can be reused and remixed and improved upon by anyone. When Deep Seq released its models this way, they really sent a message to the world that says, We are serious here about competing, and we're so serious that we're going to give away our models for free so that anyone who wants to can make them better.

00:12:29

All All of a sudden, it just flipped the entire AI race onto its head and really sent it into a new gear.

00:12:39

We'll be right back.

00:12:51

I'm Jonathan Swann. I'm a Whitehouse reporter for the New York Times. I have a pretty unsentimental view of what we do. Our job as reporters is to dig out information that powerful people don't want published, to take you into rooms that you would not otherwise have access to, to understand how some of the big decisions shaping our country are being made. Then painstakingly, to go back and check with sources, check with public documents, make sure the information is correct. This is not something you can outsource to AI. There's no robot that can go and talk to someone who is in the situation room and find out what was really said. In order to get actually original information that's not public, that requires human sources, and we actually need journalists to do that. As you may have gathered from this long rift, I'm asking you to consider subscribing to the New York Times. Independent journalism is important, and without you, we simply can't do it.

00:13:48

Kevin, it sounds like Deep Seek has already or is about to really change the landscape of AI. My question is, is that good? For people like you and me, I mean, maybe you more than me who use chat bots for consumers.

00:14:09

It's a complicated question to answer because there are ways in which it is probably it in ways in which it is probably bad. The case that this is a good thing is that in general, when you make things cheaper, they can be accessible to more people. I mean, remember, Deep Seek is not just free to use in the app or on the website. It was also released as open-source software, meaning that anyone with an internet connection can download it and install it on their own computers or maybe tweak it to serve their own purposes. If you are a person who wants to use AI, maybe you have a small business, or maybe you just want to use this to help you write letters or emails, maybe you're a student who wants to use this, you can now access a very powerful model for free. Maybe you are a developer or a startup who wants to build your own AI tools. Well, now you have this deep seek model that you can take off the shelf as open source software and build your own version of it or run it on your own hardware. The people that I talk to in the tech industry who are at startups or smaller companies are very excited about this.

00:15:18

This is a great development for them. It also means that if you are a person who worries that all the AI power is going to go to a few huge companies, then Then the democratization of AI through open-source models like Deep Seek probably makes you feel optimistic. That is the positive case for this. But there are also a lot of people who are really worried about what Deep Seek has done. I think the Deep Seek moment has really sparked a lot of new fears about how quickly this whole field of AI is progressing. Just in the last few years, the leading AI models have gone from maybe being as smart as the average high school student to as smart as a college student to now being able to complete a lot of tasks that would have taken a PhD to complete. These models are just getting much better very quickly. A lot of folks in the AI community are just nervous about that. They say things like, Well, maybe we're going to get an AI that is as smart as the smartest humans in just a few years. We don't really have a playbook for dealing with technology that is more intelligent than us.

00:16:35

There are people who worry about these runaway AI scenarios where you get super intelligent AIs that can take control or maybe even harm humans. But even if you're not a believer in that superhuman intelligence risk, there are just a lot of questions about whether we as a society are ready for advanced AI? Are we ready for the possibility that it could eliminate jobs? Are we ready for the possibility that it could really lead to a proliferation of misinformation or propaganda or even automated cyber attacks and things like that? All of that is swirling around the conversation about Deep Seq because we have just accelerated the AI race again. Now, it is not just American companies competing with each other over who is in the lead of that race. China has also stepped in, and there's a lot of fear and anxiety about what happens if we fall behind.

00:17:34

Kevin, if this really is that important of a moment in the global AI arms race, how should we expect the United States to react? I mean, does the US government just ban this thing? We saw TikTok banned because it was owned by a Chinese company. Is that the move here?

00:17:53

Well, it may well be because if you're a person who believes that TikTok is a national security threat, there's nothing about Deep Seek that is less of a threat. It is a Chinese company. It is subject to all the same laws and censorship guidelines as other Chinese software companies are. For example, if you ask Deep Seek to tell you what happened at Tiena in Square or to say something mean about Xi Jinping, the leader of China, it won't do it. I would not be surprised if in the coming weeks and months, we do see lawmakers in the US saying, Wait a minute, we passed a law to ban TikTok. Why are we not also passing a law to ban Deep Seek? I think that's one potential outcome here. But there's a key difference, which is that TikTok is not open-source software. You cannot download TikTok and create your own version of it. And so already the Deep Seek models have been downloaded and recreated all over the world by lots and lots of different people and companies. I think what the Deep Seek story suggests is that it is going to be quite challenging to contain the spread of powerful AI without some big moves.

00:19:07

Kevin, if we really are past that point of containment here, if we're off to the races, does it matter that this innovation happened in China by a Chinese company? I mean, isn't this bigger than that at this point?

00:19:22

There are people in the American tech scene who are calling this deep seek moment the Sputnik moment for the the AI race, because just as the launch of Sputnik by the Soviet Union kicked off the 20th century space race and created profound fear and anxiety among Western nations about whether they were behind their biggest political adversary when it came to technology, a lot of people are looking at this moment with Deep Seek as kicking off a new era in the AI race where we really want to stay ahead of China. There are people who say that having a lead in AI, if you are the United States, even if it's just a lead of a couple months or a couple years over your political adversaries is very important. That may be true, but people who study AI, people who look at this industry closely, who are paying attention to the trends in AI, believed that these models would become cheaper and cheaper over time, as well as becoming more powerful over time. This really fits neatly with a lot of what people had been predicting for years. Now, they might not have predicted that this moment would happen from a Chinese AI company.

00:20:40

They might not have predicted exactly what the breakthroughs would be that allowed for the models to get cheaper. But this is in keeping with the overall trend in AI that we have seen over the past few years, which is that the models keep getting better and they keep getting more efficient.

00:20:54

In a way, it follows the normal progression of any new product. At first, it's expensive, and then more and more competition leads to innovation. The thing gets cheaper. Everything becomes more democratized.

00:21:06

Yeah. That is what happened here, but just maybe a little faster than people had expected. But I think the larger point is that these systems are now improving so rapidly and in so many places all at once that I think it is only a matter of time before nearly everyone in the world has access to very powerful AI models. And I just think that world looks a lot different than the one we live in today.

00:21:37

Kevin, thank you so much.

00:21:41

Thanks for having me.

00:21:47

We'll be right back.

00:21:57

Here's what else you need to know today. On Saturday, President Trump declared tariffs of 25% on all goods from Canada and Mexico, with a partial carve-out for Canadian energy and oil exports. He also announced an additional 10% tariff on products coming in from China. The tariffs are set to take effect at 12: 01 AM Eastern Time on Tuesday and have raised concerns of an escalating trade war with America's largest trading partners. Those countries responded swiftly to Trump's announcement. The Canadian government detailed its own retaliatory 25 % tariffs on more than $100 billion worth of US goods, including everything from American made honey, tomatoes, and whiskey to refrigerators and toilets. President Claudia Sheimbam of Mexico said she would unveil her country's response on Monday, while China's Commerce Ministry promised to bring a legal case at the World Trade Organization and, quote, corresponding countermeasures. President Trump acknowledged on social media that his tariffs could cause, quote, some pain, but said it would be, quote, worth the price that must be paid. And in a new effort to tighten the immigration system, the Trump administration is ending temporary protected status for 300,000 Venezuelan migrants in the United States, according to government documents obtained by the Times.

00:23:25

The move could make those migrants vulnerable to potential deportation in the coming months. The designation of temporary protected status was meant to protect migrants fleeing dangerous situations in their home countries. But Republican critics have said it allowed migrants to stay in the United States for much longer than intended.

00:23:47

Today's episode was produced by Shannon Lynn, Claire Tennis-Sketter, Alex Stern, and Kaitlyn O'Keefe.

00:23:59

With with help from Will Reid. It was edited by Mark George and Chris Haxel, with help from Paige Cawet. Contains original music by Pat McCusker, Marion Lozano, and Dan Powell, and was engineered by Alyssa Moxley. Our theme music is by Jim Brunberg and Ben Lansberg of WNDYRLE.

00:24:27

For The Daily.

00:24:34

I'm Natalie Kittrowff. See you tomorrow.

Episode description

Financial markets went into a panic last week over an obscure Chinese tech start-up called DeepSeek. The company now threatens to upend the world of artificial intelligence and the race for who will dominate it.Kevin Roose, a tech columnist at The Times, discusses how DeepSeek caught us all off guard.Guests: Kevin Roose, a technology columnist for The New York Times and co-host of the Times tech podcast, “Hard Fork.”Background reading: DeepSeek’s model has rocked Silicon Valley and upended several fundamental assumptions about A.I. progress.Listen to “Hard Fork”: Your guide to the DeepSeek freakout.For more information on today’s episode, visit nytimes.com/thedaily. Transcripts of each episode will be made available by the next workday. Photo: Greg Baker/Agence France-Presse
Unlock full access to New York Times podcasts and explore everything from politics to pop culture. Subscribe today at nytimes.com/podcasts or on Apple Podcasts and Spotify.