back to indexMark Zuckerberg: First Interview in the Metaverse | Lex Fridman Podcast #398
Chapters
0:0 Introduction
0:52 Metaverse
15:27 Quest 3
30:16 Nature of reality
34:54 AI in the Metaverse
51:51 Large language models
57:49 Future of humanity
00:00:00.000 |
The following is a conversation with Mark Zuckerberg 00:00:05.720 |
Mark and I are hundreds of miles apart from each other 00:00:08.600 |
in physical space, but it feels like we're in the same room 00:00:13.960 |
as photorealistic Kodak avatars in 3D with spatial audio. 00:00:21.760 |
And I think it's the future of how human beings connect 00:00:24.440 |
to each other in a deeply meaningful way on the internet. 00:00:28.460 |
These avatars can capture many of the nuances 00:00:34.320 |
we humans use to communicate emotion to each other. 00:00:38.200 |
Now I just need to work on upgrading my emotion 00:00:42.200 |
expressing capabilities of the underlying human. 00:00:48.840 |
And now dear friends, here's Mark Zuckerberg. 00:01:17.860 |
- And it doesn't feel awkward to be really close to you. 00:01:43.260 |
We're shrouded by darkness with ultra realistic face 00:01:47.780 |
and it just feels like we're in the same room. 00:01:51.580 |
This is really the most incredible thing I've ever seen. 00:01:58.380 |
- Yeah, no, I was commenting to the team before 00:02:04.720 |
from further distances than it feels like we are right now. 00:02:30.460 |
You're there wearing a headset and I'm wearing a headset. 00:02:36.420 |
So what, can you describe what it takes currently 00:02:40.740 |
for us to appear so photorealistic to each other? 00:02:46.460 |
we both did these scans for this research project 00:03:00.740 |
and instead of actually transmitting a video, 00:03:03.140 |
what it does is we've sort of scanned ourselves 00:03:09.460 |
and we've built a computer model of sort of each 00:03:13.900 |
of our faces and bodies and the different expressions 00:03:21.460 |
that then when you have the headset on your head, 00:03:24.460 |
it can, it sees your face, it sees your expression 00:03:32.580 |
of what you're supposed to look like over the wire. 00:03:37.260 |
it's also actually much more bandwidth efficient 00:03:47.700 |
- And it captures everything, like the flaws. 00:03:51.020 |
Like to me, the subtleties of the human face, 00:03:54.260 |
like even the flaws, that's like, that's all amazing. 00:03:57.500 |
It makes you, it makes it so much more immersive. 00:04:00.500 |
It makes you realize that like perfection isn't the thing 00:04:15.380 |
And just the different, like the corners of the eyes, 00:04:23.900 |
that most of communication, even when people are speaking, 00:04:28.140 |
is not actually the words that they're saying, right? 00:04:32.420 |
So, and we try to capture that with the kind of classical, 00:04:39.380 |
That's the kind of more cartoon designed one. 00:04:41.340 |
You can kind of put those kinds of expressions 00:04:51.100 |
I don't know, I just think it's really magical. 00:04:54.580 |
of what the vision around virtual and augmented reality is, 00:05:01.700 |
no matter where you actually are in the world. 00:05:09.100 |
completely different states, halfway across the country, 00:05:13.260 |
it looks like you're just sitting right in front of me. 00:05:16.980 |
- Yeah, yeah, I can't, I'm almost getting emotional. 00:05:24.540 |
Like for me to have this kind of conversation 00:05:26.380 |
with loved ones, it would just change everything. 00:05:30.300 |
Maybe just to elaborate, so I went to Pittsburgh 00:05:33.820 |
and went through the whole scanning procedure, 00:05:47.540 |
in terms of making this more accessible to people? 00:05:50.180 |
- You know, it starts off with a small number of people 00:05:57.220 |
Which is, that's the version that you did and that I did. 00:06:00.220 |
And, you know, before there were a lot of people 00:06:05.260 |
we probably need to kind of over collect expressions 00:06:14.180 |
we can reduce that down to a really streamlined process 00:06:17.980 |
and extrapolate from the scans that have already been done. 00:06:21.780 |
But, you know, the goal, and we have a project 00:06:25.580 |
is just to do a very quick scan with your cell phone, 00:06:34.140 |
say a few sentences, make a bunch of expressions, 00:06:41.820 |
and then produce something that's of the quality 00:06:44.700 |
So I think that that's one of the big challenges 00:06:47.140 |
that remains, and right now we have the ability 00:06:49.180 |
to do the scans if you have hours to sit for one. 00:07:03.580 |
But the production of these scans in a very efficient way 00:07:08.580 |
is one of the last pieces that we still need to really nail. 00:07:13.420 |
And then obviously there's all the experiences around it. 00:07:15.220 |
I mean, right now we're kind of sitting in a dark room, 00:07:21.140 |
But I think part of the vision for this over time 00:07:24.220 |
is not just having this be like a video call. 00:07:37.980 |
that is different from what you can do on a phone 00:07:39.500 |
is like doing stuff where you're physically there together 00:07:46.580 |
we could have meetings like this in the future. 00:07:50.220 |
Once you get mixed reality and augmented reality, 00:07:56.620 |
and go into a meeting and have some people physically there 00:07:58.780 |
and have some people show up in this photorealistic form, 00:08:05.300 |
I think that stuff like that is gonna be super powerful. 00:08:07.660 |
So we gotta still build out all those kind of applications 00:08:11.700 |
But I don't know, I think it's gonna be a pretty wild 00:08:16.260 |
- I mean, I'm actually almost at a loss of words. 00:08:28.220 |
Like there is that, I guess there's an uncanny valley 00:08:36.860 |
- Yeah, I mean, I think there's still a bunch of tuning 00:08:41.740 |
where different people emote to different extents, right? 00:08:56.220 |
And I think getting that to be tuned on a per person basis 00:09:09.820 |
Some people might prefer a version of themselves 00:09:21.420 |
for having like a relatively stiff expression. 00:09:26.420 |
But, you know, I mean, I might feel pretty happy 00:09:34.740 |
I'd wanna have my avatar really be able to better express 00:09:38.460 |
like how I'm feeling than how I can do physically. 00:09:50.860 |
And I mean, I think the initial version of this 00:09:54.580 |
And like you said, I do think we're kind of beyond 00:10:10.580 |
But I think that actually makes it more difficult. 00:10:14.580 |
Like the amazing thing here is that the subtleties 00:10:20.260 |
people say I'm monotone and emotionless, but I'm not. 00:10:22.700 |
It's just this, maybe my expression of emotion 00:10:40.620 |
just 'cause I've never done one of these before. 00:10:41.940 |
I've never done a podcast as one of these Kodak avatars. 00:10:44.940 |
And I'm curious to see what people think of it 00:10:54.220 |
is it tends to feel a lot more profound when you're in it 00:11:00.260 |
So I think that this one, because it's photorealistic, 00:11:03.180 |
may look kind of as amazing in 2D for people watching it 00:11:13.900 |
just it's like you feel the sense of immersion 00:11:15.820 |
when you're in it that doesn't quite translate 00:11:18.660 |
But I don't know, I'm curious to see what people think. 00:11:21.660 |
- Yeah, I'm curious to see if people could see that, 00:11:34.620 |
There's been, I don't do remote podcasts for this reason. 00:11:41.780 |
This feels like just an incredible transition 00:11:45.020 |
to something else, to the different kind of communication. 00:11:47.760 |
Breaks all barriers, like geographic, physical barriers. 00:11:51.000 |
You mentioned, do you have a sense of timeline 00:11:54.840 |
in terms of how many difficult things have to be solved 00:12:02.580 |
- Yeah, I mean, I think we'll probably roll this out 00:12:10.100 |
We want to get more people scanned and into the system. 00:12:19.620 |
I think that for a lot of the work style things, 00:12:22.420 |
productivity, I think that this is going to make 00:12:25.140 |
In a lot of game environments, I mean, this could be fine, 00:12:27.460 |
but games tend to have their own style, right? 00:12:36.340 |
and one of the things that we get a lot of feedback 00:12:38.340 |
on work rooms, where people are pretty blown away 00:12:41.540 |
by the experience and this feeling that you can like 00:12:44.180 |
be remote, but feel like you're physically there 00:12:50.660 |
that people have a hard time with the fact that 00:12:52.660 |
the avatars are so expressive and don't feel, 00:13:00.180 |
could make a very big difference for those remote meetings. 00:13:12.580 |
or objects and overlaying them on the physical world. 00:13:15.900 |
I think the ability to do kind of remote meetings 00:13:23.780 |
I mean, I think that that's going to be very exciting. 00:13:25.380 |
So yeah, rolling it out over the next few years. 00:13:28.060 |
It's not ready to be like a kind of mainstream product yet, 00:13:34.060 |
keep getting more scans in there and keep, you know, 00:13:36.180 |
and kind of rolling it out into more of the features. 00:13:37.900 |
But yeah, I mean, definitely in the next few years, 00:13:41.940 |
you'll be seeing a bunch more experiences like this. 00:13:44.740 |
- Yeah, I would love to see some celebrities scanned 00:13:55.500 |
on my mind is blown, I'm literally at a loss for words. 00:14:03.820 |
How like, how I feel the emotion, how I feel the presence, 00:14:07.180 |
how I feel like the subtleties of the emotion 00:14:09.860 |
in terms of like work meetings or any kind of, 00:14:12.420 |
in terms of podcasts, this is like, this is awesome. 00:14:22.420 |
And part of the question is also, so you have the scan, 00:14:25.800 |
then it takes a certain amount of compute to go drive that, 00:14:29.340 |
both for the sensors on the headset and then rendering it. 00:14:33.580 |
So one of the things that we're working through 00:14:35.980 |
is what is the level of fidelity that is optimal, right? 00:14:45.420 |
But one of the things that we're thinking about is like, 00:14:51.820 |
a somewhat lower fidelity version of your body, 00:14:54.420 |
but still have the main kind of the major movements. 00:15:03.780 |
In terms of being able to read and express emotions. 00:15:06.860 |
I mean, like you said, if you move your eyebrows 00:15:10.220 |
like a millimeter, I mean, that really changes the expression 00:15:14.260 |
Whereas, I mean, moving your arm like an inch 00:15:21.100 |
So yeah, so I think that we do want to get all of that 00:15:27.900 |
- So you mentioned Quest 3, that's coming out. 00:15:34.620 |
So it's not just virtual reality, it's mixed reality. 00:15:38.380 |
it's going to be the first mainstream mixed reality device. 00:15:42.180 |
I mean, obviously we shipped Quest Pro last year, 00:15:46.900 |
And part of what I'm super proud of is, you know, 00:15:50.500 |
we try to innovate, not just on pushing the state of the art 00:15:54.940 |
but making it so it can be available to everyone. 00:15:57.580 |
And, you know, we have this and it's coming out, it's $500. 00:16:07.180 |
than what we're using right now in Quest Pro. 00:16:10.980 |
for being able to deliver that kind of an innovation 00:16:14.260 |
But, you know, some of this is just software, 00:16:26.100 |
So we got the resolution for the mixed reality cameras 00:16:29.300 |
and sensors to be multiple times better in Quest 3. 00:16:32.940 |
And we just figured that that made a very big difference 00:16:40.980 |
Qualcomm just came out with their next generation chip set 00:17:01.980 |
So I'm really excited to get this in people's hands. 00:17:10.020 |
It does it better because the display is better 00:17:12.620 |
and the chip is better, so you'll get better graphics. 00:17:16.260 |
It's 40% thinner, so it's more comfortable as well. 00:17:21.260 |
But the MR is really the big capability shift. 00:17:24.100 |
And part of what's exciting about the whole space right now 00:17:29.820 |
where companies put out a new smartphone every year 00:17:32.060 |
and you can almost barely tell the difference 00:17:36.260 |
Now for this, each time we put out a new headset, 00:17:44.580 |
The ability to basically take digital representations 00:17:47.380 |
of people or objects and superimpose them on the world. 00:17:53.380 |
there's one version of this is you're gonna kind of 00:17:56.980 |
have these augments or holograms and experiences 00:18:01.140 |
that you can kind of bring into your living room 00:18:11.220 |
is that there are a lot of VR experiences today 00:18:20.500 |
sometimes people get worried about swinging their arms 00:18:22.500 |
around, like, am I gonna hit a lamp or something? 00:18:28.500 |
So having that in mixed reality actually is just 00:18:32.860 |
You kind of still get the immersion and the 3D experience 00:18:35.980 |
and you can have an experience that just wouldn't be 00:18:40.220 |
but by being anchored to and being able to see 00:18:46.180 |
And I think a lot of people are really gonna enjoy that too. 00:18:48.260 |
So yeah, I'm really excited to see how people use it. 00:18:50.740 |
But yeah, Quest 3 coming out later this fall. 00:18:54.300 |
- Yeah, and I got to experience it with other people 00:18:56.660 |
sitting around and there's a lot of furniture. 00:19:03.780 |
enjoy the ridiculousness of you, like, swinging your arms. 00:19:14.540 |
That's a really fundamentally different experience 00:19:16.220 |
than just pure VR, with zombies coming out of walls. 00:19:30.100 |
in a room with no windows, you can add windows to it. 00:19:33.020 |
And you can look outside as the zombies run towards you, 00:19:35.780 |
but like, it's still nice view outside, you know? 00:19:41.300 |
by having cameras on the outside of the headset 00:19:51.260 |
You basically need to, you need multiple cameras 00:19:58.700 |
And then it's a pretty complex compute problem, 00:20:01.700 |
an AI problem to map that to your perspective, right? 00:20:05.020 |
Because the cameras aren't exactly where your eyes are 00:20:08.340 |
you're not going to be in exactly the same place. 00:20:21.180 |
And then superimpose whatever digital objects 00:20:24.780 |
So it's, yeah, it's a very interesting technical challenge. 00:20:35.820 |
because I think Quest 3 is going to be the first device 00:20:38.180 |
like this with, that millions of people are going to get. 00:20:41.660 |
And it's only when you have millions of people 00:20:45.380 |
the whole developer community really starting to experiment 00:20:49.580 |
because now there are going to be people who actually use it. 00:20:53.780 |
you know, we got some of that flywheel going with Quest Pro, 00:21:05.380 |
So this camera, the cameras aren't just doing the pass through 00:21:14.660 |
in order to use that for like gesture recognition, 00:21:17.940 |
Yeah, we've been able to get way further on hand recognition 00:21:24.740 |
I don't know, did you see the demo experience 00:21:30.220 |
- Like, yeah, the piano, learning to play piano? 00:21:37.620 |
and like how well it matches physical reality 00:21:48.380 |
with no latency, integrating physical reality 00:21:53.100 |
Obviously that connects exactly to this Kodak avatar, 00:21:56.580 |
which is in parallel, allows us to have ultra realistic 00:22:05.100 |
So it's all converging towards like an incredible 00:22:11.820 |
To me, obviously I love the intimacy of conversation. 00:22:16.100 |
But do you have other ideas of what this unlocks 00:22:28.460 |
- Well, there's what you can do with avatars overall 00:22:39.740 |
So I think we're moving towards a world where 00:22:44.900 |
we're gonna have something that looks like normal glasses, 00:22:53.300 |
And in that world, I think that they're gonna be 00:22:57.420 |
not too far off, maybe by the end of this decade, 00:23:03.220 |
kind of as many holograms when you walk into a room 00:23:07.700 |
And it really raises this interesting question about 00:23:14.980 |
a lot of people have this phrase where they call 00:23:23.860 |
but I actually think the real world is the combination 00:23:26.460 |
of the physical world and the digital worlds coming together. 00:23:29.140 |
But until this technology, they were sort of separate. 00:23:33.900 |
It's like you access the digital world through a screen. 00:23:36.620 |
And maybe it's a small screen that you carry around 00:23:39.220 |
or it's a bigger screen where you sit down at your desk 00:23:43.300 |
but they're kind of fundamentally divorced and disconnected. 00:23:47.100 |
And I think part of what this technology is gonna do 00:23:49.860 |
is bring those together into a single coherent experience 00:23:58.620 |
So the physical world is always gonna be super important. 00:24:01.660 |
But increasingly, I think a lot of the things 00:24:04.180 |
that we kind of think of can be digital holograms. 00:24:08.620 |
I mean, any screen that you have can be a hologram, 00:24:15.740 |
can basically be just as effective as a hologram 00:24:18.940 |
as a physical object, any game that you're playing, 00:24:21.540 |
a board game or any kind of physical game, cards, 00:24:29.100 |
'cause you could just kind of snap your fingers 00:24:36.580 |
in your living room, but then you can snap your fingers 00:24:41.740 |
So I think that it's actually an amazing thought experiment 00:24:49.100 |
that could actually be better as interactive holograms. 00:24:52.860 |
But then beyond that, I think the most important thing, 00:25:06.020 |
you're wearing glasses or a headset in the very near term, 00:25:09.580 |
but hopefully by, over the next five years, glasses or so. 00:25:20.740 |
and it feels like it's them who are right there. 00:25:36.900 |
So I think you'll also have meetings in the future 00:25:48.300 |
who's an engineer on your team, who's helping with things. 00:25:50.780 |
And he can now be embodied as a realistic avatar as well 00:25:58.660 |
So I think that that's gonna be pretty compelling as well. 00:26:03.220 |
So then, okay, so what can you do with photorealistic avatars 00:26:11.060 |
Well, I think a lot of this actually comes down 00:26:17.300 |
And because all of the stuff that we're doing, 00:26:23.060 |
the motion of your eyes, the cheeks and all of that, 00:26:26.620 |
there's actually no reason why you couldn't do that 00:26:32.100 |
but you can make a cartoon version of yourself 00:26:41.020 |
between the current state of most of our interactions 00:26:45.020 |
in the physical world and where we're getting in the future 00:26:47.780 |
with this kind of hybrid physical and digital world, 00:26:51.420 |
where I think it's gonna be a lot easier for people 00:26:54.420 |
to kind of take some of these experiences seriously 00:27:04.340 |
I could see a world where people stick to the photorealistic 00:27:10.500 |
but maybe fundamentally we like photorealistic things. 00:27:13.580 |
But I can also see a world that once people get used 00:27:22.060 |
that I actually think that there could be a world 00:27:24.900 |
where people actually prefer being able to express themselves 00:27:38.460 |
And I don't know, in a bunch of our internal experiments 00:27:45.820 |
is people have no issues blending photorealistic stuff 00:27:53.460 |
that we're in now, we happen to sort of being in a dark room. 00:28:00.980 |
I think was based on the way you like to do your podcasts. 00:28:11.140 |
but photorealistic people who you're talking to. 00:28:13.780 |
And we seem to, like people just seem to just think 00:28:24.660 |
is basically you have a photorealistic avatar 00:28:27.380 |
that you're talking to, and then right next to them, 00:28:29.700 |
you have an expressive kind of cartoon avatar. 00:28:32.300 |
And that actually is pretty normal too, right? 00:28:37.740 |
To basically being interacting with different people 00:28:55.580 |
if that kind of includes both the digital and physical, 00:28:58.340 |
or if this is like the long-term way that it stays. 00:29:03.860 |
for both the expressive and the photorealistic over time. 00:29:06.040 |
I just don't know what the balance is gonna be. 00:29:12.500 |
the photorealistic is amazing to where I would prefer, 00:29:19.940 |
to see a buddy of mine remotely on a chair next to me 00:29:32.400 |
I don't want any, just chilling with a friend, 00:29:45.220 |
it's just a high-quality experience of friendship. 00:29:52.180 |
in the same kind of realism I'm seeing right now. 00:29:57.420 |
I can see myself sticking with this for a long time. 00:30:06.320 |
It's like being able to pull this off is like, 00:30:11.500 |
it's just this kind of like awesome experience. 00:30:33.880 |
it's like, okay, so my hair is a little shorter in this 00:30:46.100 |
I could still have this photorealistic avatar 00:31:00.220 |
where the avatar is realistic in the sense of, 00:31:04.340 |
this is kind of what you looked like at the time of capture, 00:31:07.440 |
but it's not necessarily temporarily accurate 00:31:10.580 |
to exactly what you look like in this moment. 00:31:15.480 |
a bunch of questions that come from that over time 00:31:22.380 |
- You mean just like the nature of identity of who we are? 00:31:26.620 |
you know how people do like summer beach body 00:31:31.700 |
they'll try to lose some weight and look their best 00:31:33.860 |
and sexiest with the nice hair and everything like that. 00:31:55.820 |
So I think there's a similar question of like, 00:32:02.180 |
I mean, and it's interesting identity question 00:32:07.660 |
it's like weight fluctuates or things like that. 00:32:14.020 |
I don't know, it's an interesting psychological question. 00:32:18.660 |
maybe a lot of people do think about themselves 00:32:26.180 |
And then it's like what you are on a day-to-day basis 00:32:35.420 |
there will definitely be a bunch of social scientists 00:32:45.540 |
about how our perception of ourselves and others 00:32:55.260 |
but one of the first feelings I had experiencing this 00:33:01.820 |
And the next question I have is I would love to talk 00:33:04.340 |
to people who are no longer here that are loved ones. 00:33:16.500 |
talk to your father, talk to your grandfather 00:33:18.500 |
and grandmother and a mother once they pass away. 00:33:29.620 |
- Yeah, I think that there are a lot of norms 00:33:33.940 |
and things that people have to figure out around that. 00:33:36.420 |
There's probably some balance where, you know, 00:33:38.660 |
if someone has lost a loved one and is grieving, 00:33:46.580 |
being able to interact or relive certain memories 00:34:01.500 |
We have, you know, a fair amount of experience 00:34:09.860 |
through social media already, unfortunately, right? 00:34:12.180 |
Where there's, you know, unfortunately, you know, 00:34:22.540 |
And we have whole protocols that we go through 00:34:32.500 |
So that way the account doesn't just go away immediately. 00:34:34.900 |
But then there are other things that are, you know, 00:34:36.460 |
important kind of private things that that person has. 00:34:38.860 |
Like we're not going to give the family access 00:34:42.740 |
So, yeah, I think that there's some best practices, 00:34:55.100 |
Another version of this is how this intersects with AIs, 00:35:05.980 |
you know, we want the world to evolve in a way 00:35:08.580 |
where there isn't like a single AI super intelligence, 00:35:11.740 |
but where, you know, a lot of people are empowered 00:35:30.220 |
I know you'd love to cultivate that community 00:35:40.300 |
and I'm sure you'd love to interact with the community more, 00:35:43.380 |
but you just are limited by the number of hours in the day. 00:35:48.140 |
making it so that you could build an AI version of yourself 00:36:02.460 |
to interact with you and your desire to build a community. 00:36:05.700 |
And there's a lot of interesting questions around that. 00:36:13.060 |
I think, you know, we'd want to make that work, 00:36:15.740 |
you know, across all the messaging platforms, 00:36:18.220 |
you know, WhatsApp and Messenger and Instagram Direct. 00:36:24.740 |
an avatar version of yourself in the metaverse 00:36:27.780 |
and you could define that sort of an AI version, 00:36:33.620 |
that it's not, you know, the kind of physical version of you, 00:36:36.860 |
but maybe that AI, even if they know it's an AI, 00:36:39.340 |
is the next best thing, because they're probably 00:36:45.140 |
I think that that could be a really compelling experience. 00:36:48.220 |
There's a lot of things that we need to get right about it, 00:36:51.660 |
that, you know, we're not ready to release the version 00:36:55.260 |
that a creator can kind of build a version of themselves yet, 00:37:03.060 |
that people can interact with in different ways. 00:37:05.300 |
And I think that that is also just going to be 00:37:08.460 |
a very powerful, you know, set of capabilities 00:37:21.580 |
where you can talk to them across the meta apps 00:37:24.420 |
and have like interesting, unique kind of conversations. 00:37:34.740 |
- Yeah, so, I mean, a lot of the vision comes from 00:37:37.820 |
this idea that, you know, I don't think we necessarily 00:37:41.260 |
want there to be like one big super intelligence. 00:37:44.060 |
We want to empower everyone to both, you know, 00:37:46.900 |
have more fun, accomplish their business goals, 00:37:49.260 |
you know, just everything that they're trying to do. 00:37:52.340 |
And, you know, we don't tend to have, you know, 00:37:56.180 |
And I don't think in the future we're going to have, 00:37:59.860 |
I think you're going to want a variety of these. 00:38:07.580 |
If some will be kind of more assistant oriented, 00:38:10.340 |
there's a sort of the kind of plain and simple one 00:38:15.620 |
It's simple, you can chat with it in any of your threads. 00:38:35.740 |
So let's say you want to contact a small business, 00:38:41.060 |
that business probably doesn't want to have to 00:38:45.460 |
And you probably don't want to wait on the phone 00:38:46.700 |
to talk to someone, but you're having someone 00:38:48.620 |
who you can just like talk to in a natural way 00:38:50.500 |
who can help you if you're having an issue with a product 00:39:05.060 |
is gonna be really good, both for the businesses 00:39:07.580 |
and for normal people who want to interact with businesses. 00:39:20.340 |
I think that there will be AIs that can tell jokes. 00:39:23.700 |
So you can put them into chat thread with friends. 00:39:37.780 |
to facilitate connection between two people or more, 00:39:57.660 |
just having an AI that you play together with. 00:40:04.260 |
but it could deeply enrich the gaming experience. 00:40:12.260 |
So that's a separate thing that I'm pretty excited about. 00:40:15.100 |
But yeah, I mean, one of the AIs that we've built 00:40:21.380 |
people have loved the most is like a adventure, 00:40:36.100 |
but we've gotten some real kind of cultural figures 00:40:41.780 |
and be the embodiment and the avatar of them. 00:40:58.500 |
a copy, or not a copy, maybe inspired by Snoop. 00:41:03.500 |
What are some of the technical challenges of that? 00:41:06.700 |
What does that experience look like for Snoop 00:41:11.140 |
- So starting off, creating new personas is easier 00:41:24.900 |
It's like it's just a new character that we created. 00:41:45.260 |
there's a whole set of things that you need to do 00:41:47.540 |
to make sure that that AI is not going to say things 00:42:04.940 |
So I think that it's less of a question around 00:42:16.780 |
it's like, well, we have our kind of V1 of that 00:42:24.780 |
But a lot of this is really just about continuing 00:42:35.180 |
or predictable in terms of what they'll communicate. 00:42:37.100 |
So that way, when you want to create the Lex assistant AI 00:42:45.100 |
you don't program them like normal computers, 00:42:53.820 |
but you want to get it to be predictable enough 00:42:57.100 |
so that way you can set some parameters for it. 00:43:06.500 |
So that's a lot of what I think we need to nail 00:43:12.500 |
And that's why that one's actually a much harder problem, 00:43:19.620 |
So that one I think will probably start releasing 00:43:32.140 |
and experimenting with some small businesses. 00:43:36.020 |
I think that that stuff will be ready to do this year 00:43:38.060 |
and we're rolling it out basically right after Connect. 00:43:42.140 |
- Yeah, I'm deeply entertained by the possibility 00:43:46.860 |
"Hey man, you need to stop the dad jokes," or whatever. 00:44:02.660 |
of a Kodak Avatar, being able to freeze yourself, 00:44:08.060 |
So everything you do, you get to see yourself do it. 00:44:12.340 |
That feels like if I was like an ape looking in a mirror 00:44:15.740 |
for the first time, realizing like, oh, that's you. 00:44:18.700 |
But then freezing that and being able to look around 00:44:26.500 |
but it just feels like a fundamentally new experience. 00:44:29.060 |
Like I'm seeing maybe color for the first time. 00:44:32.300 |
I'm experiencing a new way of seeing the world 00:44:38.460 |
Because it's physical reality, but it's digital. 00:44:42.140 |
And realizing that that's possible, it's just, 00:44:50.260 |
'Cause I lived most of my life before the internet 00:44:55.540 |
experiencing voice communication and video communication. 00:44:59.780 |
You think like, well, there's a ceiling to this, 00:45:02.580 |
but this is making me feel like, oh, there might not be. 00:45:05.740 |
There might be that blend of physical reality 00:45:14.900 |
It feels like the early days of a totally new way of living. 00:45:19.900 |
And there's a lot of people that kind of complain, 00:45:22.060 |
well, you know, the internet, that's not reality. 00:45:26.900 |
You need to turn all that off and go in nature. 00:45:30.220 |
But this feels like this will make those people happy, 00:45:37.500 |
- Yeah, well, I mean, a big part of how we're trying 00:45:45.020 |
Right, I think that's a big part of the issue 00:45:46.660 |
with computers and TVs and even phones is like, 00:45:50.980 |
yeah, I mean, maybe you can interact with them 00:45:52.420 |
in different places, but they're fundamentally, 00:45:56.660 |
And I mean, people are just not meant to be that way. 00:45:59.100 |
I mean, I think you and I have this shared passion 00:46:02.220 |
for sports and martial arts and doing stuff like that. 00:46:06.420 |
It's like so much of what makes us people is like, 00:46:09.140 |
you move around, we're not just like a brain in a tank. 00:46:12.420 |
It's the where, the human experience is a physical one. 00:46:16.580 |
And so it's not just about having the immersive expression 00:46:31.100 |
Where the digital is, there's too much digital 00:46:33.140 |
at this point for it to just be siloed to a small screen, 00:46:37.540 |
So you don't want to just sit down all day long at a desk. 00:46:46.180 |
This is, I think the kind of philosophical way 00:46:50.260 |
that I would want the world to work in the future 00:46:52.060 |
is a much more coherently blended physical and digital world. 00:46:59.620 |
and unethical questions we have to figure out as a society. 00:47:19.260 |
You know, in video games, we allow all kinds of crazy stuff. 00:47:25.060 |
And in physical reality, you know, a lot of that is illegal. 00:47:36.780 |
- Well, I think, I mean, there are content policies 00:47:40.340 |
In terms of what people are allowed to create. 00:47:42.820 |
But I mean, a lot of the rules around physical, 00:47:49.540 |
meaning that people can do as much of what they want 00:47:52.100 |
unless you're going to do damage to other people. 00:48:02.420 |
I mean, when I get into some world with my friends, 00:48:06.540 |
the first thing we start doing is shooting each other, 00:48:08.580 |
which obviously we would not do in the physical world 00:48:12.740 |
But in a game, that's like just, it's almost, 00:48:20.500 |
It's like, it's not even bearing on the game, 00:48:22.820 |
which is kind of like a funny sort of humorous thing to do. 00:48:30.340 |
it's not, you're not causing harm in that world. 00:48:38.020 |
is what are the ways where things could have been harmful 00:48:42.180 |
in the physical world that will now be freed from that? 00:48:44.380 |
And therefore there should be fewer restrictions 00:48:50.580 |
in which there could be harm in the digital world 00:48:55.740 |
It's, you know, when you show up to a restaurant 00:49:02.620 |
it's because, you know, you have one identity 00:49:13.300 |
But in a digital world where you can be anonymous 00:49:18.620 |
I think the incentive to act like a good citizen 00:49:21.380 |
can be a lot less and that causes a lot of issues 00:49:30.060 |
I think you wanna just look at what are the damages. 00:49:42.420 |
that are more about the laws of physics, right? 00:49:44.220 |
It's like, if you wanted to travel to see me in person, 00:49:50.740 |
and that would like, you know, take a few hours to get here. 00:49:53.740 |
Whereas, you know, we could just jump in a conference room 00:50:02.340 |
where we're, you know, it feels like we're together. 00:50:10.220 |
that previously would have defied the laws of physics 00:50:17.820 |
So, and one of the things that I'm curious about is, 00:50:21.220 |
you know, there are all these debates right now 00:50:25.820 |
And, you know, I think this gets us a lot closer 00:50:28.260 |
to being able to work physically in different places, 00:50:31.020 |
but actually have it feel like we're together. 00:50:53.540 |
And that would open up a lot of opportunities, right? 00:50:55.340 |
Because then people could live physically where they want 00:51:00.780 |
of being physically or kind of feeling like you're together 00:51:05.540 |
all the ways that that helps to build more culture 00:51:08.500 |
and build better relationships and build trust, 00:51:12.660 |
that if you're not seeing people in person ever. 00:51:27.580 |
all the good and the things that you need to mitigate. 00:51:30.540 |
So you try to do your best to kind of envision 00:51:34.380 |
and accentuate the things that they're going to be awesome 00:51:37.060 |
and hopefully mitigate some of the downside things. 00:51:41.620 |
that we're going to be building this out one year at a time. 00:51:45.540 |
So we're going to just get to see how it evolves 00:51:48.420 |
and what developers and different folks do with it. 00:51:53.940 |
this might be a bit of a very specific technical question, 00:52:02.340 |
There's already been a lot of exciting developments 00:52:06.980 |
Is there, what's your sense about its release? 00:52:17.860 |
we were talking about the debate that we were having 00:52:37.820 |
we spent a lot of time to get very rigorous assessment 00:52:48.860 |
to see how excited people have been about it. 00:53:23.620 |
all the different AIs and a bunch of different products 00:53:28.620 |
that we're basically building as consumer products. 00:53:30.740 |
'Cause Lama 2 by itself, it's not a consumer product, right? 00:53:45.180 |
and it's little, the branches that we built off of it 00:53:49.740 |
ready for consumer products that hopefully, you know, 00:53:53.940 |
will enjoy using those products in billions one day. 00:54:03.140 |
And I don't have anything new or news on that. 00:54:09.620 |
I don't know exactly when it's gonna be ready. 00:54:12.740 |
I think just like we had a debate around Lama 2 00:54:25.060 |
to open source this next version when it's ready too. 00:54:33.940 |
it's a thing that we're still somewhat early in working on. 00:54:39.500 |
for open sourcing Lama 2 and for being transparent 00:54:41.820 |
about all the exciting developments around AI. 00:54:48.580 |
to a really awesome conversation about where we go with AI. 00:54:54.140 |
to see all the same kind of technology integrated 00:54:56.500 |
into these personalized AI systems with the AI personas, 00:55:02.220 |
which I think when you put it in people's hands 00:55:05.020 |
and they get to have conversations with these AI personas, 00:55:08.140 |
you'll get to see like interesting failure cases, 00:55:15.060 |
or, and we get to learn as a society together, 00:55:17.300 |
what's too far, what's interesting, what's fun, 00:55:21.300 |
how much personalization is good, how much generic is good. 00:55:27.980 |
Like we have to all figure it out by using it, right? 00:55:31.420 |
- Yeah, I mean, part of what we're trying to do 00:55:32.740 |
with the initial AI's launch is having a diversity 00:55:39.420 |
just so that people can try different things. 00:55:53.380 |
or they can want to interact with historical figures? 00:56:11.500 |
and I am also excited to see, you know, the internet, 00:56:13.980 |
I don't know if you heard, can get kind of weird 00:56:19.300 |
- Yeah, so it'd be nice to see how weird they take it, 00:56:26.900 |
especially in this early stages of development, 00:56:31.620 |
it's good to learn by playing with those systems 00:56:34.900 |
and interacting with them at like a large scale, 00:56:49.460 |
over time anyone will be able to create one of these AIs, 00:56:53.660 |
almost like they create any other UGC content 00:57:00.980 |
we're not gonna see the full potential of this 00:57:06.740 |
of the whole community being able to build stuff. 00:57:08.700 |
But there's a lot of stuff that we need to get right. 00:57:26.700 |
but the extent to which we're building it out 00:58:05.980 |
So I keep forgetting for long stretches of time 00:58:40.500 |
we might not even predict change civilization. 00:58:52.780 |
but I do think that there's a lot to process about it. 00:58:59.340 |
We're just not, we're not physically in the same place. 00:59:02.220 |
It's not like you're not talking to an AI, right? 00:59:30.580 |
about what is the nature of kind of the modern real world? 00:59:35.860 |
it really is this combination of a physical world 00:59:48.460 |
and all of the innovation that's getting created there. 00:59:54.620 |
because I mean, the digital world is just increasing 00:59:57.380 |
in its capability and our ability to do awesome things, 01:00:07.620 |
So I don't think we want to run away from that 01:00:19.820 |
And it's, you know, even though the current version of this 01:00:24.980 |
I just think this is going to be by far the most human 01:00:29.060 |
and social computing platform that has ever existed. 01:00:43.220 |
maybe shifting it towards the sort of consciousness. 01:00:54.860 |
versus necessarily being in the same physical space. 01:00:58.140 |
'Cause it feels like we're in the same physical space. 01:01:05.460 |
not like that the space time, like the physics of it, 01:01:15.220 |
It's just whatever's going on inside my head. 01:01:17.460 |
- But there are a lot of social and psychological things 01:01:23.460 |
that was previously only physical presence, right? 01:01:26.300 |
I think that there's like an intimacy, a trust, 01:01:32.740 |
because so much of communication is nonverbal 01:01:34.660 |
and is based on expressions that you're kind of, 01:01:43.460 |
And before those things would have only been possible, 01:01:46.060 |
you know, had I gotten on a plane and flown to Austin 01:01:49.900 |
and sat physically with you in the same place. 01:01:56.620 |
those laws of physics and delivering the social 01:02:00.860 |
and psychological benefits of being able to be present 01:02:04.180 |
and feel like you're there with another person, 01:02:06.100 |
which I think are real benefits to anyone in the world. 01:02:12.380 |
I mean, I think that is going to be a very profound thing. 01:02:16.620 |
that's the promise of the metaverse and what, you know, 01:02:20.340 |
why I just, why I think that that's the next frontier 01:02:25.620 |
You know, I started working on social networks 01:02:29.500 |
where the first version of Facebook, your profile, 01:02:31.900 |
you know, you had one photo and the rest of it 01:02:33.780 |
was like lists of things that you were interested in. 01:02:43.580 |
but there's a clear trend where, you know, over time, 01:02:52.940 |
about the world around us gets increasingly just richer 01:03:03.060 |
or the people who you care about is, from my perspective, 01:03:12.340 |
It requires building up these new computing platforms 01:03:14.540 |
and completely new software stacks to deliver that. 01:03:26.260 |
And so for me, this photo realism is really, really exciting. 01:03:35.900 |
Thanks to you and thanks to the amazing Meta teams 01:03:46.740 |
And thank you, Mark, for talking to me inside the Metaverse. 01:03:54.140 |
I would love to measure my heart rate this whole time. 01:04:02.340 |
- Okay, well, I'm at a beach and not wearing any pants. 01:04:14.020 |
It's one of the most incredible experiences of my life.