back to indexSatya Nadella | BG2 w/ Bill Gurley & Brad Gerstner
Chapters
0:0 Intro
1:31 Becoming Microsoft CEO
6:42 Satya’s Memo to CEO Committee
10:42 Satya’s Advantage as a CEO
11:34 Advice for CEOs
15:1 Microsoft’s Investment in OpenAI
19:42 AI Arms Race
23:55 Legacy Search and Consumer AI
28:7 The Future of AI Agents
38:32 Near-Infinite Memory
39:47 Copilot Approach to AI Adoption
50:26 Leveraging AI within Microsoft
56:3 CapX
60:20 The Cost of Model Scaling and Inference
75:15 Open AI Conversion to Profit
78:5 Next Steps for OpenAI
79:43 Open vs. Closed and Safe AI
00:00:02.180 |
has already been created, which is open AI, in some sense. 00:00:06.600 |
It's kind of like the Google or the Microsoft 00:00:46.320 |
You took over servers and launched Azure in 2011. 00:00:53.000 |
And it was just before that that a pretty now well-known essay 00:01:01.480 |
Now, since then, you've taken Azure from $1 billion 00:01:08.440 |
The total revenues of the business are up $2.5x. 00:01:23.480 |
And as you reflect back on that over the course of last decade, 00:01:30.920 |
that you thought you could do then to unlock the value, 00:01:37.720 |
which has been just an extraordinary success? 00:01:55.800 |
it's just one continuous sort of period for me. 00:02:09.680 |
But what I felt was essentially pattern match 00:02:14.400 |
when we were successful and when we were not, 00:02:18.760 |
and do more of the former and less of the latter. 00:02:21.480 |
I mean, in some sense, it's as simple as that, 00:02:27.680 |
that was just after Windows 3.1 was launched. 00:02:40.360 |
and I was thinking of going to business school, 00:02:44.640 |
And I said, "Ah, maybe I'll go to business school." 00:02:59.360 |
when I went and saw the basically Windows NT, 00:03:05.000 |
it was not called Windows NT at that time, and x86. 00:03:09.920 |
And I said, "God, this, what's happening in the client 00:03:16.240 |
And this is a platform company and a partner company, 00:03:30.520 |
Like, for example, I mean, we recognized the browser. 00:03:34.280 |
We competed and got that browser thing eventually right. 00:03:42.640 |
We sort of felt like, wow, the big thing there was the browser 00:03:47.320 |
because it felt more like an operating system, 00:03:51.600 |
which is the organizing layer of the internet 00:04:02.680 |
Obviously, the iPhone happened, and we got the cloud right. 00:04:16.120 |
which are not coming out of because somebody else got it 00:04:21.520 |
Sometimes it's okay to fast follow, and it worked out, 00:04:26.560 |
That was one of the hardest lessons I think we've learned. 00:04:29.880 |
Do it because you have permission to do this, 00:04:35.200 |
Like, both of those matter to me, the brand permission. 00:04:37.720 |
Like, you know, Jeffrey Moore once said this to me, 00:04:40.080 |
which I say, "Hey, why don't you go do things 00:04:44.200 |
I love that, right, which is cloud was one such thing, 00:04:48.840 |
in fact, when I first remember showing up in Azure, 00:04:52.760 |
people would tell me, "Oh, it's a winner-take-all, 00:05:01.200 |
I'd compete against Oracle and IBM in the servers, 00:05:09.880 |
And all you need to do is just get into the game 00:05:15.760 |
So, in some sense, a lot of these transitions for me 00:05:21.680 |
you kind of recognize your structural position, 00:05:27.480 |
of where you have permission from those partners 00:05:49.080 |
And, you know, there are things that cultivated, 00:06:01.480 |
I would say those are the necessary conditions 00:06:05.080 |
to even have a real chance for shots on goal. 00:06:09.920 |
But I would just say getting that strategy right 00:06:13.280 |
by recognizing your structural position and permission 00:06:22.760 |
I have a couple of questions about the transition 00:06:27.600 |
I mean, there's a, I think that it's definitive 00:06:31.160 |
that you may be the best CEO hire of all time. 00:06:43.080 |
that you wrote a 10-page memo to the committee 00:07:10.720 |
I never thought that first, Bill would leave, 00:07:18.160 |
oh yeah, you know, founders are going to retire 00:07:26.280 |
So when Steve decided to retire, I forget now, 00:07:29.040 |
I think in August of 2013, there was a pretty big shock. 00:07:33.840 |
our server and tools business, as it was called, 00:07:48.360 |
And then eventually the board came around and asked, 00:07:51.880 |
and there were a lot of other candidates at that time, 00:08:06.440 |
You know, one of the terms I used in that memo, 00:08:09.560 |
which I subsequently used even in the first piece of email 00:08:15.080 |
had ambient intelligence and ubiquitous computing. 00:08:19.920 |
And I dumbed it down to mobile first, cloud first later, 00:08:22.800 |
because, you know, my PR folks came and said, 00:08:25.760 |
Nobody will understand what ambient computing is 00:08:28.360 |
and, you know, ubiquitous or other ambient intelligence 00:08:34.160 |
How do you really go where the secular shift is? 00:08:44.880 |
In fact, one of the things I've always resisted 00:08:57.200 |
to me, I've never, I don't allocate my capital 00:09:01.800 |
here is the M365 capital, here is, you know, gaming. 00:09:06.600 |
I kind of think of, hey, there's a cloud infra. 00:09:16.720 |
The other one is M365, dynamics, gaming, what have you. 00:09:20.600 |
And so in some sense, that was all in that memo 00:09:28.960 |
And one of the assumptions at that time was that this, 00:09:33.240 |
you know, we had a 98%, 99% gross margin business 00:09:53.080 |
You know, we'll sell more to small businesses. 00:09:57.560 |
We will sell more in aggregate in terms of even upsell, 00:10:05.120 |
Because, you know, we had sold a bit of exchange, 00:10:08.240 |
but if you think about it, exchange, SharePoint, Teams, 00:10:12.560 |
So that was the basic arc that I had in that memo. 00:10:22.360 |
there's CEO hires made in the world all the time 00:10:27.320 |
I mean, Intel's going through a second reboot here 00:10:38.880 |
So what did you do and what would you advise new CEOs 00:11:10.400 |
it never felt like somebody from the outside coming 00:11:19.880 |
because I was pretty much part of the culture, right? 00:11:22.800 |
I couldn't say anything that I was not part of. 00:11:40.400 |
we were all strutting around as if we were like, 00:11:56.360 |
that is the culture that you want to avoid, right? 00:12:04.600 |
there's only one thing that brings civilizations, 00:12:07.720 |
countries, and companies down, which is hubris. 00:12:14.520 |
my wife had introduced me to a book by Carol Dweck, 00:12:21.840 |
more in the context of my children's education 00:12:27.000 |
And I said, God, this thing is like the best. 00:12:29.480 |
You know, all of us are always talking about learning 00:12:38.040 |
So I attribute a lot of our success culturally to that meme, 00:12:55.240 |
You can be a better parent, a better partner, 00:12:57.600 |
a better friend, a neighbor, and a manager and a leader. 00:13:02.120 |
and the pithy way I've always characterized it is, 00:13:05.640 |
hey, go from being the know-it-alls to learn-it-alls, 00:13:11.680 |
because the day you say, I have a growth mindset, 00:13:14.080 |
means you don't have a growth mindset by definition. 00:13:16.440 |
And so it has been very, very helpful for us. 00:13:24.320 |
you got to give it time, oxygen, breathing space, 00:13:37.480 |
or even my executive staff, or what have you, 00:13:42.920 |
And I've been very, like, the other thing is, 00:13:50.440 |
pretty much for the last, now close to 11 years, 00:14:01.960 |
That ambient intelligence, ubiquitous computing, 00:14:04.640 |
and then the specific set of products/strategies. 00:14:15.120 |
I repeat it until I'm bored stiff, but I just stay on it. 00:14:22.280 |
you mentioned the phase shifts that we've been through, 00:14:25.800 |
and I've heard you say that as a large platform company, 00:14:33.480 |
is determined in that first three or four years 00:14:37.160 |
when the market position is established, Satya. 00:14:40.200 |
You know, I've heard you say, you basically, you know, 00:14:42.600 |
Microsoft was coming off of having missed search, 00:14:49.000 |
caught the last train out of town on cloud, right? 00:14:53.000 |
So as you started thinking about the next big phase shift, 00:15:02.320 |
that Google was likely ahead in AI with DeepMind. 00:15:21.840 |
because there are a couple of things there, right? 00:15:25.120 |
One is we were at it on AI for a long, long time. 00:15:30.120 |
Obviously, you know, when Bill started MSR in 1995, 00:15:37.680 |
I mean, he was always into this natural user interface. 00:15:44.640 |
There was, you know, in fact, Kaifu worked here, 00:15:52.320 |
focus on trying to crack natural user interface. 00:15:56.040 |
Language was always something that we cared about, right? 00:16:10.960 |
So we missed, I would say, even in the early 2010s, 00:16:20.240 |
at around the same time that Google doubled down 00:16:26.280 |
And so that actually bothered me quite a bit, 00:16:44.040 |
which is you could train it on one language pair, 00:16:46.320 |
and it got better on another language, right? 00:16:56.240 |
And so ever since I've been obsessed with language, 00:17:06.320 |
they were looking for, obviously, Azure credits 00:17:08.720 |
and what have you, and we gave them some credits. 00:17:12.160 |
And that time, they were more into RL and Dota 2 00:17:18.560 |
And then we stopped for, I forget even exactly what happened, 00:17:27.880 |
sort of what they wanted to do with language. 00:17:32.560 |
which they talked about transformers and natural language. 00:17:39.200 |
And it goes back a little bit to how I think, 00:18:00.840 |
The way he thought about it was you schematize the world, 00:18:03.680 |
right, take people, places, things, you know, 00:18:17.240 |
And then, you know, you'll make sense of all information. 00:18:20.600 |
And this was, it was just, it's just impossible to do. 00:18:24.360 |
And so, therefore, you needed some breakthrough. 00:18:26.680 |
And we said, maybe the way to do that is how we schematize. 00:18:30.360 |
After all, the human brain does it through language 00:18:35.000 |
And so, therefore, anyway, so that's what led me to OpenAI 00:18:46.600 |
In fact, I think the first memo, weirdly enough, 00:19:09.840 |
And then, of course, once we started seeing it work 00:19:18.920 |
- One of the things that has happened, I think, 00:19:22.520 |
in previous phase shifts is some of the incumbents 00:19:28.080 |
You even talked about Microsoft perhaps missing mobile 00:19:51.440 |
and how you think about the key players in the race. 00:19:55.760 |
Google, Amazon, Meta with Lama, Elon has entered the game. 00:20:16.240 |
Interestingly enough, now people talk about the Mag 7. 00:20:22.400 |
even to your point about everybody's awake to it. 00:20:35.600 |
because I think the company of this generation 00:20:42.640 |
It's kind of like the Google or the Microsoft 00:20:52.640 |
So therefore, I think it's going to be very competitive. 00:21:03.840 |
For example, on the hyperscale side, absolutely not, right? 00:21:10.760 |
even ex-China, multiple providers of frontier models 00:21:18.960 |
In fact, one of the best structural positions 00:21:27.720 |
the Azure structure is slightly different, right? 00:21:37.720 |
We have 60-plus regions, more regions than others. 00:21:40.120 |
So it was not like we built our cloud for one big app. 00:21:55.240 |
with nexus to data and the app server and what have you. 00:21:59.080 |
So I think there is going to be multiple winners 00:22:04.400 |
There is going to be in the models even there, 00:22:10.680 |
that each hyperscaler will have a bunch of models 00:22:16.320 |
Like every app today, even including Copilot, 00:22:20.520 |
And so there's, in fact, a complete new app server. 00:22:23.280 |
Like everyone, there was a mobile app server, 00:22:28.400 |
And for us, that's Foundry and we're building one 00:22:34.080 |
Then in apps, I think there will be more folk... 00:22:38.160 |
I would say network effects is always going to be 00:22:43.640 |
There'll be different network effects in consumer, 00:22:50.680 |
I think you have to analyze it at structurally by layer. 00:23:05.040 |
which is watch for the one who comes and adds to it, right? 00:23:13.600 |
who is the new entrepreneur will come out of the blue. 00:23:16.280 |
And at least I would say OpenAI is one such company, 00:23:22.240 |
Yeah, if we think about the app layer for a second, 00:23:26.480 |
start with consumer AI a little bit here, Satya. 00:23:38.440 |
but it's massively threatened by a new modality 00:23:53.240 |
continue to grow the legacy search businesses 00:24:02.560 |
or your consumer efforts under Mustafa need to do 00:24:17.000 |
is what you said last, which is chat meets answers. 00:24:21.680 |
And that's ChatGPT, both the brand, the product, 00:24:40.200 |
So, in fact, so that's why I was so thrilled. 00:24:44.520 |
Like I've been trying to get an Apple search deal 00:25:06.560 |
So to that point, the way I look at it and say, 00:25:09.680 |
is at the same time, distribution matters, right? 00:25:35.160 |
I mean, even now, even though I wanna go to Copa, 00:25:40.920 |
And like, if I have to think about Bing versus Copilot, 00:25:51.680 |
That shift, I think, is what's happening universally. 00:25:55.200 |
And we are away, maybe one or two of these agents 00:26:15.040 |
because the commercial intent has not migrated. 00:26:22.920 |
And so I think, yes, this is a secular shift. 00:26:28.400 |
we have three properties in Mustafa's world, right? 00:26:35.200 |
So we think, in fact, he's got a crisp vision 00:26:42.200 |
One is a feed, one is search in the traditional way, 00:26:46.240 |
and then the other is this new agent interface. 00:26:51.240 |
And they all have a social contract with content providers. 00:26:58.440 |
We need to have ad-supported models, all of those. 00:27:04.600 |
The one advantage we do still have is Windows. 00:27:14.200 |
because we had won against Netscape only to lose to Google. 00:27:17.960 |
And we are getting it back now in an interesting way, 00:27:26.080 |
Like the good news about Windows for at least is, 00:27:34.440 |
You can go do your best work and go over the top. 00:27:48.400 |
I mean, I always say Google makes more money on Windows 00:27:55.520 |
this is the best news for Microsoft shareholders 00:28:00.280 |
that we can now go contest it and win back some share. 00:28:04.320 |
- Hey Satya, one thing that everybody's talking 00:28:07.320 |
and if you just kind of think forward in your mind a bit, 00:28:11.560 |
you can imagine all kinds of players wanting to enact action 00:28:16.560 |
on other apps and other data that may be on a system. 00:28:28.120 |
but you have apps on like the iPhone ecosystem 00:28:35.600 |
and this is partially in terms of service question, 00:28:40.560 |
will Apple allow Microsoft to control other apps on iOS? 00:28:45.520 |
Will Microsoft let a chat GBT instantiate apps 00:28:55.880 |
It goes all the way to when you start thinking 00:29:00.040 |
like will booking.com let Jim and I run transactions on it 00:29:09.240 |
- Yeah, I think that this is the most interesting question. 00:29:27.680 |
how did business applications of various kinds 00:29:34.120 |
They did manage that interop using connectors 00:29:39.880 |
So there was a business model that emerged, right? 00:29:46.640 |
hey, you can access SAP data as long as you had connectors. 00:29:59.320 |
it's unclear exactly what happens in consumer 00:30:04.280 |
was a lot of advertising and traffic and what have you. 00:30:09.280 |
Some of those things go away in an agentic world. 00:30:12.480 |
So I think the business model is slightly unclear to me 00:30:20.120 |
I think what will happen is everybody will say, 00:30:22.280 |
hey, in order for you to either action into my action space 00:30:26.760 |
or to get data out of my sort of schema, so to speak, 00:30:31.640 |
there is some kind of an interface to my agent 00:30:38.880 |
like today, for example, when I go to co-pilot at Microsoft, 00:30:43.240 |
I have connectors into Adobe, into my instance of SAP, 00:30:48.240 |
obviously our instance of CRM, which is Dynamics. 00:30:57.720 |
really went to a business application, right? 00:31:03.280 |
And somebody in the org is sort of inputting data into it. 00:31:12.960 |
I can literally say, hey, I'm meeting with Bill. 00:31:15.320 |
Tell me about all the companies that Benchmark's invested in. 00:31:23.400 |
collating it all together, giving me a note, what have you. 00:31:30.040 |
can be monetized by us and by even these connectors. 00:31:35.040 |
like the thing that could happen really quickly, 00:31:38.720 |
like would you allow chat GPT on the Windows OS 00:31:55.520 |
So which is, is it the user or is it the operating system? 00:32:09.520 |
like one of my big fears is the security risk, right? 00:32:15.840 |
and that malware started sort of actioning stuff, right? 00:32:23.240 |
that we will build into the OS itself, right? 00:32:32.600 |
the user will be in control on an open platform like Windows 00:32:36.720 |
and I'm sure Apple and Google will have a lot more control 00:32:46.000 |
or, you know, depending on how AT rules on all of those, 00:32:51.000 |
you know, ultimately it'll be an interesting thing to watch. 00:33:00.400 |
or let's just call it the Android AI or the iOS AI 00:33:08.880 |
through a Microsoft client on that smartphone? 00:33:22.440 |
Which is we licensed the sync for Outlook to Apple 00:33:31.640 |
It was kind of a, it was an interesting case. 00:33:34.200 |
And I think that there was a lot of value leaked perhaps, 00:33:40.760 |
why we were able to hold on to Exchange, right? 00:33:48.840 |
And so one of the things I think is going to your point, 00:33:57.320 |
is we have to have a trust system around Microsoft 365. 00:34:12.080 |
The IT folks in the customer will have to permit it. 00:34:26.480 |
It's kind of like what Apple Intelligence is doing. 00:34:43.880 |
Mustafa said that 2025 will be the year of infinite memory. 00:35:01.640 |
combined with being able to take some actions on our behalf. 00:35:19.200 |
book me the four seasons in Seattle next Tuesday 00:35:24.080 |
And Bill and I have gone back and forth on this one 00:35:34.800 |
is it, you know, does that seem like a hard one 00:35:38.400 |
- Yeah, I mean, the most open-ended action space 00:35:49.120 |
or maybe three things that are really exciting. 00:35:51.800 |
Beyond I'll just say, I'm sure we'll talk about it, 00:36:05.360 |
And the other one I would say is even entitlements, right? 00:36:09.880 |
Which is, you know, what can you like, you know, 00:36:11.720 |
one of the most interesting products we have even 00:36:15.960 |
because increasingly, what do you have permissions to? 00:36:20.560 |
You know, you have to be able to access things 00:36:23.400 |
Somebody needs to have governance on it and what have you. 00:36:25.800 |
So if you put all those three things together 00:36:28.480 |
and this agent is going to then be more governable 00:36:39.000 |
then I think you're off to a very different place 00:36:42.800 |
where for doing more autonomous work, so to speak. 00:36:46.360 |
I still think, one of the things I always think 00:36:48.560 |
is you're built, I like this co-pilot as the UI for AI 00:37:05.000 |
In fact, that's kind of why we think of co-pilot 00:37:15.120 |
I don't think the models, I take even 4.0, right? 00:37:22.560 |
So you can do in the enterprise setting significant more, 00:37:26.280 |
more so than consumer because consumer web function calling 00:37:30.480 |
is just hard, where at least in an open-ended web, 00:37:37.720 |
But once you say, hey, let's go do a book me a ticket 00:37:40.520 |
on anything and it just, and if there's schema changes 00:37:45.600 |
You can teach it that, that's where I think '01 00:37:48.200 |
can get better if it's a verifiable, auto-gradable 00:37:54.360 |
But I think we are maybe a year, year to two years away 00:37:57.920 |
from doing more and more, but I think at least 00:38:01.160 |
from an enterprise perspective, going and doing, 00:38:04.760 |
here's my sales agent, here's my marketing agent, 00:38:11.520 |
We built 10 or 15 of them into dynamics, right? 00:38:14.440 |
Even looking into sort of my supplier communications 00:38:18.640 |
and automatically handling my supplier communications, 00:38:22.400 |
updating my databases, changing my inventories, my apply. 00:38:26.000 |
Those are the kinds of things that you can do today, 00:38:29.400 |
- Mustafa made this comment about near-infinite memory 00:38:32.480 |
and I'm sure you heard it or hear it internally. 00:38:35.800 |
Is there any clarification you can offer about that 00:38:42.640 |
the idea that you have essentially a type system 00:38:58.960 |
He made it sound like you guys had an internal 00:39:04.080 |
- Yeah, I mean, there's an open source project even. 00:39:08.040 |
I think it's, I forget, it's the same set of folks 00:39:11.320 |
who did all the TypeScript stuff who are working on this. 00:39:15.960 |
So what we're trying to do is essentially take memory 00:39:19.440 |
and schematize it and sort of make it available 00:39:24.760 |
Like each time I start, let's just imagine I'm prompting 00:39:35.480 |
I think is a good way for us to build up a memory system. 00:39:43.200 |
you know, the Microsoft AI business has already reported 00:39:50.680 |
and that you're not actually renting raw GPUs to others 00:39:54.560 |
to train on because your inference demand is so high. 00:39:58.400 |
So as we think about this, there's a lot of, I think, 00:40:01.040 |
skepticism out there in the world as to whether or not 00:40:06.520 |
And so if you think about the key revenue products 00:40:10.320 |
that people are using today and how it's driving 00:40:18.040 |
from Amazon or Google, I'd be interested in that. 00:40:21.840 |
- Yeah, I think that's a good, so the way for us 00:40:25.040 |
this thing has played out is, you've got to remember 00:40:33.480 |
So it's sort of not in our quarterly results, 00:40:36.840 |
it's more in the other income based on our investment. 00:40:41.320 |
So that means the only thing that shows up in revenue- 00:40:52.920 |
And so most of the revenue or all the revenue 00:40:58.800 |
is pretty much our API business, or in fact, to your point, 00:41:08.680 |
And so the fact is the big hit apps of this era are what? 00:41:18.080 |
and the APIs of OpenAI and Azure OpenAI, right? 00:41:28.920 |
these would probably be in the four or five of them. 00:41:35.880 |
The advantage we have had and OpenAI has had, 00:41:39.840 |
which is we've had two years of runway, right? 00:41:47.480 |
hey, everybody's awake, but, and it might be, 00:41:55.480 |
You say that and somebody else drops some sample 00:42:10.720 |
That was the great advantage we've had with OpenAI. 00:42:13.560 |
OpenAI was able to really build out this escape velocity 00:42:34.280 |
So suddenly we got access to many, many more logos 00:42:42.320 |
who are using Azure in some shape or fashion and so on. 00:42:47.440 |
And when it comes to the traditional enterprise, 00:42:55.880 |
and then are building agents on the other end using Foundry. 00:42:59.800 |
But like these things are design wins and project wins 00:43:02.560 |
and they're slow, but they're starting to scale. 00:43:05.240 |
And again, the fact that we've had two years of runway on it, 00:43:12.840 |
the adverse selection problems here would have been 00:43:15.400 |
lots of tech startups all looking for their H100 allocation 00:43:37.760 |
In fact, even in the, I think the investor side, 00:43:41.520 |
which is now people are wanting to be more capital light 00:43:51.440 |
will not want to, you know, want to look for it more. 00:43:54.160 |
So that's kind of what we've been selective on. 00:43:58.920 |
that training of those models and those model clusters 00:44:02.600 |
was a much bigger part of their AI revenue versus yours. 00:44:33.040 |
of any of these AI products, there is ChatGPT. 00:44:38.280 |
- And then there is, you know, like even Gemini, 00:44:44.640 |
I mean, obviously I think it will grow, you know, 00:45:04.320 |
I think those are the four, I would say, in a Dow. 00:45:07.280 |
Like, is there anything else that comes to your mind? 00:45:13.440 |
that I think are starting to get some traction, 00:45:16.160 |
A lot of them build on top of Lama, but, you know. 00:45:23.960 |
what are the apps that have more than 5 million Dow, right? 00:45:36.600 |
in terms of the non-affiliated apps, you named them. 00:45:40.240 |
- And Zack's stuff all runs on his own cloud. 00:45:48.520 |
obviously the coding space is off into the races, 00:45:54.360 |
and there's a lot of venture-backed players there. 00:45:59.400 |
I have a question about the Copilot approach. 00:46:01.960 |
And I guess Mark Benioff's been kind of obnoxiously critical 00:46:06.600 |
on this front and called it Clippy 2 or whatever. 00:46:23.440 |
isn't necessary to know if you did an AI first product? 00:46:35.200 |
that may be able to be obfuscated for the user. 00:46:38.800 |
- Yeah, I mean, it's a very, very, very important question. 00:46:46.440 |
so let me just speak of our own dynamics thing. 00:46:51.760 |
I think the notion that business applications exist, 00:46:58.920 |
that's probably where they'll all collapse, right? 00:47:10.600 |
The business logic is all going to these agents. 00:47:16.560 |
And these agents are going to be multi-repo CRUD, right? 00:47:27.760 |
and all the logic will be in the AI tier, so to speak. 00:47:40.360 |
then people will start replacing the backends, right? 00:47:42.520 |
We have people, that's what, in fact, it's interesting. 00:47:45.840 |
As we speak, I think we are seeing pretty high rates 00:47:49.520 |
of wins on dynamics, backends, and the agent use. 00:48:03.560 |
the other fascinating thing that's increasing 00:48:07.120 |
but even what we call finance and operations. 00:48:10.480 |
Because people want more AI native biz apps, right? 00:48:14.760 |
That means the logic tier can be orchestrated 00:48:24.520 |
to my business application should be very seamless. 00:48:29.520 |
Now, in the same way, and you could even say, 00:48:35.160 |
one of the most exciting things for me is Excel with Python 00:48:45.360 |
like this, by the way, it would be fun for you guys, right? 00:48:54.000 |
oh, you know, it is like having a data analyst. 00:49:06.720 |
It will literally, like how GitHub co-pilot workspace 00:49:09.320 |
creates the plan, and then it executes the plan. 00:49:12.600 |
This is like a data analyst who is using Excel 00:49:26.960 |
with all of its action space, because it can generate, 00:49:31.960 |
That is, in fact, a great way to reconceptualize Excel. 00:49:40.880 |
After all, there's a code interpreter, right? 00:49:45.120 |
And so, yes, I think there will be disruption, 00:49:49.080 |
at least our M365 stuff is one is build co-pilot 00:49:59.200 |
You can say the Excel is an agent to my co-pilot. 00:50:15.280 |
Go into Excel and have the co-pilot go with it. 00:50:25.320 |
wringing their hands about a lot today, Satya, 00:50:27.400 |
is the ROI people are making on these investments. 00:50:34.760 |
Are you leveraging AI to increase productivity, 00:50:38.240 |
reduce costs, drive revenues in your own business? 00:50:41.440 |
If so, kind of what are the biggest examples there? 00:50:44.560 |
You know, and maybe to a finer point on that, 00:50:47.520 |
you know, when we had Jensen on, I asked him, 00:50:52.840 |
what did he expect his headcount to increase by? 00:50:59.880 |
"Well, I'll have 100,000 agents helping us do the work." 00:51:07.240 |
you know, do you expect to see that similar type 00:51:26.120 |
I love this thing of, I've been going to school 00:51:40.840 |
in how the good industrials can literally say, 00:51:43.840 |
hey, I'll add 2 to 3, you know, 100 basis points 00:51:50.760 |
which is increased value, reduced waste, right? 00:51:55.280 |
So I think of AI as the lean for knowledge work. 00:52:00.280 |
You know, we are really going to school on it, 00:52:08.920 |
we had all this business process re-engineering. 00:52:13.560 |
where people who can think end-to-end process flows 00:52:33.760 |
This is everything from Xbox support to Azure support. 00:52:38.280 |
This is really, I mean, this is serious one year 00:52:41.840 |
because of the deflection rate on the front end. 00:52:44.480 |
Then the biggest benefit is the agent efficiency, right? 00:52:48.960 |
Where the agent is happier, the customer is happier, 00:52:54.520 |
And so that's, I think, the most obvious place 00:52:57.760 |
and that we have in our contact center application 00:53:05.480 |
That's the other, and with GitHub Copilot Workspace, 00:53:11.400 |
what is agentic sort of side comes in, right? 00:53:14.080 |
You go from an issue to a plan or to a spec to a plan 00:53:27.840 |
As I said, and then the O365 is the catch-all, right? 00:53:36.360 |
I mean, just to give you a feel, like even my own, right? 00:53:40.400 |
I would say the workflow of the prep of the CEO office 00:53:48.280 |
Basically, I mean, in fact, one of the ways I look at it is 00:54:04.720 |
"Hey, I'm just gonna put an Excel spreadsheet in email 00:54:07.120 |
"and send it around and people will enter numbers 00:54:11.760 |
The same thing is happening in the AI era right now 00:54:22.280 |
"Tell me everything I need to know about the customer." 00:54:24.440 |
It tells me everything from my CRM, my emails, 00:54:35.600 |
So just imagine the hierarchy, this entire thing of, 00:54:38.320 |
"Oh, let me prepare a brief for the CEO," goes away. 00:54:44.400 |
If they wanna annotate it, so I'm reasoning with AI 00:54:54.080 |
Somebody gave me this example from supply chain. 00:54:56.760 |
Like somebody said, "Supply chain is like a trading desk, 00:54:59.960 |
"except it doesn't have real-time information," right? 00:55:06.680 |
and then the CFO comes and bangs you on the head 00:55:17.520 |
and giving you like, "Oh, you're doing this contract 00:55:35.120 |
our goal is to kind of create operating leverage through AI. 00:56:01.760 |
just around what we're seeing out of model scaling 00:56:07.160 |
You know, I've heard you talk about, you know, 00:56:21.000 |
these companies look more like industrial company CapEx 00:56:27.200 |
Your CapEx come from about 20 billion in 2020 00:56:35.320 |
You know, you've earned a pretty consistent return 00:56:45.640 |
Some people are worried that that correlation will break 00:56:50.280 |
maybe at some point there's going to be, you know, 00:56:53.360 |
CapEx is going to have to be spent ahead of the revenue. 00:57:08.120 |
And when does it begin to taper off, you know, 00:57:14.320 |
Yeah, I mean, a couple of different things, right? 00:57:26.120 |
we've been practicing this for a long time, right? 00:57:40.760 |
you know how to sort of drive utilization up. 00:57:44.760 |
And the good news here is it's kind of like capital intensive 00:57:56.320 |
That's kind of like when people even in the early days said, 00:57:59.560 |
hey, how can like a hyperscaler ever make money? 00:58:02.600 |
Because what's the difference between old holsters 00:58:08.720 |
And that I think is what's going to apply even 00:58:16.200 |
Which is, hey, you buy leading, you build it out. 00:58:18.640 |
In fact, one of the things that's happening right now 00:58:42.160 |
and a model that runs on an AI accelerator, right? 00:58:49.240 |
you suddenly add to build up these AI accelerators 00:58:57.800 |
So the first thing is the build out will happen, 00:59:00.560 |
the workloads will normalize and then it will be, 00:59:03.320 |
you will just keep growing like the cloud has grown. 00:59:15.560 |
everybody's sort of building only hoping demand will come, 00:59:18.960 |
just making sure that there is real diverse demand 00:59:27.960 |
So I think that that's, I think the way to manage the ROIC. 00:59:31.360 |
And by the way, the margins will be different, right? 00:59:33.280 |
This goes back to the very early dialogue we had on, 00:59:44.560 |
or foundry plus GPU or GitHub copilot add on to M365. 00:59:56.160 |
And so if you're having a portfolio matters here, right? 01:00:01.360 |
why does Microsoft have a premium today in the cloud? 01:00:04.520 |
We are bigger than Amazon, growing faster than Amazon 01:00:13.840 |
And that's kind of what we wanna do even in the AI era. 01:00:16.440 |
- Satya, there's been a lot of talk about model scaling. 01:00:40.800 |
where they kind of flipped everything on their head 01:00:43.080 |
and they said, well, if we're not doing that anymore, 01:00:45.320 |
it's way better because we can just move on to inference, 01:00:52.040 |
I'm curious, those are two kind of views of the same coin, 01:00:55.680 |
but what's your view on large LLM model scaling 01:01:00.560 |
and training costs and where we're headed in the future? 01:01:04.040 |
- Yeah, I mean, I'm a big believer in scaling laws, 01:01:12.640 |
In fact, if anything, the bet we placed in 2019 01:01:18.720 |
which is in other words, don't bet against scaling laws. 01:01:23.800 |
let's also be grounded on a couple of different things. 01:01:38.440 |
of doing large scale training becomes harder. 01:01:48.400 |
and I'll let the OpenAI folks speak for what they're doing, 01:01:54.000 |
pre-training I think is not over, it sort of continues. 01:02:00.520 |
OpenAI has talked about and Sam has talked about 01:02:14.920 |
inference time compute as another scaling law, right? 01:02:20.040 |
and then you have effectively this test time sampling 01:02:28.960 |
that then are running on your inference, right? 01:02:36.320 |
So the good news of test time or inference time compute 01:02:49.400 |
when you're using it to generate tokens for training, 01:03:01.040 |
And so therefore there is more of an economic model, right? 01:03:07.880 |
with 60 plus data centers all over the world. 01:03:10.160 |
- Right, it's a different hardware architecture 01:03:15.800 |
- Exactly, and I think the best way to think about it 01:03:19.880 |
So going back to sort of Brad's thing about ROIC, 01:03:32.040 |
which is look, you kind of wanna buy some every year, 01:03:37.720 |
When you depreciate something over six years, 01:03:44.400 |
and you age it, you age it, you age it, right? 01:03:48.360 |
and then the next year it makes it, goes into inference. 01:04:02.600 |
And like basically to your point about everybody saying, 01:04:08.080 |
One of the other things is the economic realities 01:04:12.600 |
I mean, at some point everybody will look and say, 01:04:14.960 |
what's the economically rational thing to do? 01:04:19.280 |
- Which is, hey, even if I double every year's capability, 01:04:24.560 |
And the other problem is the winner's curse, right? 01:04:26.720 |
Which is, you don't even have to publish a paper. 01:04:30.800 |
The other folks have to just look at your capability 01:04:39.600 |
I mean, you can sort of all kinds of terms of use, 01:04:45.560 |
Second thing is, you don't even have to do anything. 01:04:48.400 |
You just have to reverse engineer that capability 01:04:51.160 |
and you do it in a more compute efficient way. 01:05:00.120 |
Right now, a little bit of everybody wants to be first. 01:05:05.040 |
all the economic reality will set in on everyone. 01:05:07.720 |
And the network effects are in the app layer. 01:05:10.720 |
So why would I want to spend a lot on some model capability 01:05:14.360 |
if the network effects are all on the app layer? 01:05:30.400 |
and then he kind of joked about a million, but I-- 01:05:32.800 |
- I think he joked about a billion, but you know, 01:05:41.280 |
based on what you've seen around pre-training and scaling, 01:05:45.200 |
have you changed your infrastructure plans around that? 01:05:50.200 |
And then I have a separate question with regard to '01. 01:06:02.160 |
Which is, hey, how do you, we can argue the duration, 01:06:12.400 |
And this is where I think a little bit of disciplined way 01:06:16.080 |
of thinking about how do you clear your inventory 01:06:22.640 |
Which is, or the other way is the depreciation cycle 01:06:29.200 |
you can, unless you find the physics of the GPU works out, 01:06:39.440 |
it's in the same or better margin than a hyperscaler. 01:06:44.120 |
I'm gonna keep going and building basically to, 01:06:57.640 |
I absolutely, and Sam may have a different objective 01:07:02.720 |
He's sort of like, he may say, hey, I wanna build 01:07:05.880 |
because I know I'm deeply, have deep conviction 01:07:11.400 |
So therefore that's where I think a little bit 01:07:15.120 |
- And to clarify something, I heard Mustafa say 01:07:18.080 |
on a podcast that Microsoft is not going to engage 01:07:22.880 |
in the biggest model training competition that's going on. 01:07:28.120 |
- Well, what we won't do is do it twice, right? 01:07:38.840 |
given the partnership with OpenAI to do two unnecessary, 01:07:47.160 |
So we are very, and that's why we have concerted. 01:07:49.560 |
And by the way, that's the strategic discipline 01:08:01.240 |
And we did it because we had all the rights to the IP. 01:08:09.520 |
And so then what Mustafa is basically saying is, 01:08:13.360 |
hey, we will also do, in fact, a lot of focus 01:08:17.760 |
and even on the verification or what have you. 01:08:22.160 |
So we'll focus a lot of our compute resources 01:08:30.200 |
while also having a principled pre-training stuff 01:08:33.400 |
that sort of gives us capability internally to do things. 01:08:43.200 |
that we will continue to go ahead and develop as well. 01:08:45.880 |
- Does your question to Brad's question about, 01:08:55.440 |
as to why you've outsourced some of the infrastructure 01:08:58.960 |
to CoreWeave in that partnership that you have? 01:09:06.880 |
with the hit called ChatGPT and OpenAI API, right? 01:09:11.360 |
Yeah, we were completely, I mean like it was impossible. 01:09:15.360 |
There's no supply chain planning I could have done in, 01:09:20.360 |
what is it, like none of us knew what was gonna happen, 01:09:28.480 |
like that was just a bolt from the blue, right? 01:09:32.920 |
So we said, hey, we're not going to in fact worry 01:09:37.440 |
So that's why, whether it's CoreWeave or many others, 01:09:51.760 |
So that was just more about trying to get caught up 01:10:08.360 |
What we have told the street is that's why we are optimistic 01:10:16.240 |
And then after that, I think we'll be in better shape 01:10:24.120 |
- So I'm hearing with respect to this level two thinking, 01:10:44.000 |
you're recycling those tokens back into the context window 01:10:55.480 |
the inference was going to a million or a billion X, 01:11:11.360 |
- Yeah, I mean, I think there are two things there, Brad, 01:11:19.440 |
to think about the full workload there, the full workload. 01:11:26.080 |
One of the fastest growing things of, in fact, 01:11:31.200 |
because after all these agents need a scratch pad 01:11:39.960 |
And so that is like where they run a code interpreter. 01:11:42.760 |
And that, by the way, is a regular Azure Kubernetes cluster. 01:11:52.400 |
and its nexus to the GPU and then some data service. 01:12:02.960 |
people think about AI as separate from the cloud. 01:12:09.520 |
And I think in a world where every AI application 01:12:13.680 |
is a stateful application, it's an agentic application, 01:12:20.560 |
then classic app server plus the AI app server 01:12:30.000 |
which is, hey, we built this 60 plus AI regions. 01:12:32.600 |
I mean, Azure regions, they all will be ready 01:12:47.920 |
we've talked around OpenAI a lot during this conversation, 01:12:54.920 |
between a huge investment there and your own efforts. 01:13:02.880 |
highlighting the differences between Azure OpenAI 01:13:09.680 |
And a lot of those were about the enterprise grade, 01:13:14.400 |
you know, things that you bring to the table. 01:13:42.120 |
given OpenAI is a very at-scale company, right? 01:13:46.600 |
So it's no longer, it's a really very successful company 01:14:00.320 |
like I would with any other big partner, right? 01:14:23.760 |
where we are very deeply interested in each other's success. 01:14:28.760 |
The third is, I think of them as a big customer. 01:14:39.520 |
And so, and then the last one is the cooperation, right? 01:14:43.440 |
Which is, whether it's co-pilot in the consumer space, 01:14:48.440 |
whether it's co-pilot with M365 or whatever else, 01:14:53.440 |
we sort of say, hey, where is the competition? 01:14:57.280 |
Where is, and that's where I kind of look at it and say, 01:15:00.960 |
ultimately these things will have some overlap, 01:15:06.640 |
is in some sense for the MSFT shareholder accretive, right? 01:15:14.520 |
like to your point about the API differences, 01:15:19.560 |
or like some of the, there's differences, right? 01:15:28.800 |
then it's easiest to have an Azure and Azure Mac. 01:15:31.240 |
But if you're on AWS and you want to just use 01:15:39.600 |
there's sometimes having these two types of distributions 01:15:46.720 |
- Satya, I would say the kind of curious part 01:15:52.200 |
of the Silicon Valley community and even writ larger, 01:16:04.720 |
and Andrew Sorkin pushed Sam really hard on this. 01:16:17.120 |
I guess Elon's launched a missive in there as well. 01:16:37.200 |
We, let me, I'd say the one thing that we care deeply about 01:16:48.880 |
that is an iconic company of this platform shift 01:16:53.080 |
and the world is better with open AI doing well. 01:16:56.400 |
And so therefore that's sort of the fundamental position. 01:17:14.440 |
with great amount of sort of vision and ambition 01:17:18.200 |
and the pace with which he wants to move, you know, 01:17:22.280 |
and so therefore we have to balance that all out, right? 01:17:31.080 |
what he does to do and he needs to accommodate 01:17:36.440 |
given, you know, the overall constraints we may have. 01:17:48.240 |
I mean, this five years has been great for them. 01:18:01.720 |
- When you think about, you know, the separate funding, 01:18:13.800 |
I've talked about thinking that the next step for them, 01:18:17.000 |
you know, it'd be great to have them as a public company. 01:18:20.200 |
You know, it's such an iconic, you know, business, 01:18:33.560 |
Or do you think that it stays kind of in the relationship 01:18:51.160 |
it's their board and their management decision. 01:19:12.920 |
We want to make sure we protect our interests 01:19:20.200 |
But I think, you know, at this point, you know, 01:19:23.320 |
people like Sarah and Brad and Sam are, you know, 01:19:38.840 |
But I want to wrap on this topic of open versus closed, 01:19:48.120 |
And so maybe I'll just leave it open-ended to you. 01:19:59.080 |
And one anecdote I would just throw out there 01:20:03.400 |
that Chinese researchers developed an AI model 01:20:06.440 |
for potential military use on the back of MetaLlama, right? 01:20:21.720 |
So we are going to see some of these put to uses 01:20:32.440 |
and as a collection of companies to usher in safe AI? 01:20:39.160 |
I think that I always have thought of open source 01:20:49.800 |
I've never thought of them as just religious battles. 01:20:57.800 |
that's why I think what Meta and Mark are doing 01:21:04.440 |
he's trying to commoditize even his compliment, right? 01:21:09.000 |
If I were in his shoes, I would do that, right? 01:21:13.480 |
I mean, I think he talks openly and very eloquently 01:21:24.360 |
sometimes going back to some of your economics question, 01:21:31.320 |
theoretically, a consortium could be a superior model, 01:21:36.680 |
quite frankly, than any one player trying to do it. 01:21:48.760 |
Which is if, I always say Linux wouldn't have happened, 01:21:53.560 |
in fact, Microsoft's one of the largest committers to Linux. 01:21:57.320 |
And so was IBM, so was Oracle and what have you. 01:22:01.080 |
And I think that there may be a real place for, 01:22:03.880 |
and open source is a beautiful mechanism for that, right? 01:22:13.000 |
Then closed source may make sense in closed source. 01:22:15.320 |
After all, we have had lots of closed source products. 01:22:17.640 |
Then safety is an important but orthogonal issue 01:22:26.760 |
And, you know, one could make arguments that, 01:22:36.200 |
So I think of these as perhaps best dealt with 01:22:45.400 |
and different companies will choose different paths. 01:22:52.680 |
I think at tech, you know, now there's no chance of saying, 01:23:12.760 |
if there is sort of national security leakage or challenges, 01:23:17.880 |
So therefore, I think states and state policy 01:23:21.160 |
will have a lot to say about which of these models 01:23:25.400 |
and what the regulatory regime will look like. 01:23:27.960 |
- Well, it's hard to believe that we're only 22 months 01:23:36.920 |
but, you know, it's interesting when I reflect back 01:23:40.680 |
on your, you know, framework around phase shifts, 01:23:44.040 |
you have to put Microsoft in a really good position 01:23:50.840 |
And so congrats on the run over the last 10 years. 01:23:53.800 |
It's been really, you know, a sight to behold, 01:24:00.120 |
when we see the leadership, you, Elon, Mark, Sundar, et cetera, 01:24:06.440 |
you know, really forging ahead for Team America around AI. 01:24:20.440 |
- Yeah, I can't thank you enough for the time, Satya.