back to indexJeffrey Shainline: Neuromorphic Computing and Optoelectronic Intelligence | Lex Fridman Podcast #225
Chapters
0:0 Introduction
0:44 How are processors made?
20:2 Are engineers or physicists more important
22:31 Super-conductivity
38:18 Computation
42:55 Computation vs communication
46:36 Electrons for computation and light for communication
57:19 Neuromorphic computing
82:11 What is NIST?
85:28 Implementing super-conductivity
93:8 The future of neuromorphic computing
112:41 Loop neurons
118:57 Machine learning
133:23 Cosmological evolution
140:32 Cosmological natural selection
157:53 Life in the universe
165:40 The rare Earth hypothesis
00:00:00.000 |
The following is a conversation with Jeff Shainlein, 00:00:08.440 |
We have a deep technical dive into computing hardware 00:00:14.080 |
I urge you to hop onto this rollercoaster ride 00:00:24.360 |
Jeff is a great communicator of technical information. 00:00:27.200 |
And so it was truly a pleasure to talk to him 00:00:34.440 |
please check out our sponsors in the description. 00:00:39.880 |
and here is my conversation with Jeff Shainlein. 00:00:43.360 |
I got a chance to read a fascinating paper you authored 00:00:52.040 |
So maybe we can start by talking about this paper 00:01:13.380 |
in conjunction with electronic circuits for computation. 00:01:22.000 |
is focused on superconducting electronics for computation. 00:01:27.720 |
but that might make a little more sense in context 00:01:31.420 |
if we first describe what that is in contrast to, 00:02:01.100 |
but let's talk about the basic building blocks, 00:02:30.780 |
and they move under the influence of voltages. 00:02:33.080 |
So you apply voltages, electrons move around, 00:02:38.400 |
and you can represent information in that way. 00:02:52.600 |
that can move around by putting different elements, 00:03:02.200 |
which means all the atoms that comprise the material 00:03:12.200 |
and you go along what are called the lattice vectors, 00:03:14.660 |
you get to another atom and another atom and another atom, 00:03:23.800 |
but you can intentionally replace a silicon atom 00:03:29.240 |
and then you can change the number of free electrons 00:03:33.980 |
that has that excess of what are called dopants. 00:03:42.040 |
and if you apply a voltage between those two, 00:03:44.160 |
you can cause electrical current to flow between them. 00:03:53.800 |
between the left and right terminal and that third voltage, 00:03:58.720 |
So what's commonly done in digital electronic circuits 00:04:02.280 |
is to leave a fixed voltage from left to right, 00:04:09.080 |
at what's called the gate, the gate of the transistor. 00:04:18.880 |
and you do this by changing the concentration 00:04:21.520 |
of different dopants in the lattice spatially. 00:04:24.800 |
And then when you apply a voltage to that gate, 00:04:27.880 |
you can either cause current to flow or turn it off. 00:04:45.520 |
that have really had a profound influence on our society. 00:04:51.600 |
Can you give a sense of what scale we're talking about 00:04:56.520 |
being able to mass manufacture these kinds of gates? 00:05:01.560 |
- Yeah, so scale in a number of different senses. 00:05:07.520 |
the distance between two atoms there is half a nanometer. 00:05:16.520 |
I think it's some six orders of magnitude smaller 00:05:20.020 |
than the width of a human hair, something on that order. 00:05:44.560 |
So for example, I said you have a left contact, 00:05:52.240 |
That's called the channel width or the channel length. 00:05:56.000 |
And what has enabled what we think of as Moore's law 00:06:07.940 |
that feature size ever smaller, ever smaller. 00:06:14.800 |
I mean, that feature size has decreased consistently 00:06:23.760 |
And that was what Moore predicted in the 1960s. 00:06:27.160 |
He thought it would continue for at least two more decades, 00:06:30.800 |
And so that is why we've been able to fit ever more devices, 00:06:35.800 |
ever more transistors, ever more computational power 00:06:41.640 |
So a user sits back and does essentially nothing. 00:06:47.680 |
so they get faster, they get more energy efficient, 00:06:56.400 |
about what we're doing as say a software designer 00:07:01.120 |
I absolutely don't mean to say that there's no innovation 00:07:12.460 |
of continued performance improvement through this scaling 00:07:22.900 |
That power consumption has not continued to scale 00:07:27.500 |
but nevertheless, we had a really good run there for a while 00:07:31.420 |
and now we're down to gates that are seven nanometers, 00:07:39.780 |
I can't keep up with where the predictions are 00:07:56.800 |
would think you can't go much further than that 00:08:01.060 |
in the way we think about the physics of our devices. 00:08:09.500 |
So how have we been able to make those transistors 00:08:14.020 |
Well, companies like Intel, Global Foundries, 00:08:17.900 |
they invest a lot of money in the lithography. 00:08:27.840 |
So you start with sort of a pristine silicon crystal 00:08:34.240 |
which is a technique where you can pattern different shapes 00:08:37.560 |
using light, you can define which regions of space 00:08:41.560 |
you're going to implant with different species of ions 00:08:49.700 |
So by using ever shorter wavelengths of light 00:08:54.540 |
and different kinds of lithographic techniques, 00:09:01.140 |
you can just simply shrink that feature size down. 00:09:04.860 |
Well, the wavelength of light that's being used 00:09:14.760 |
Well, there's an extraordinary amount of innovation 00:09:21.940 |
And now the question is, can you make it smaller? 00:09:25.960 |
do you still continue to get performance improvements? 00:09:34.320 |
So, okay, you picture a chip that has a processor on it. 00:09:48.360 |
multiple layers, tens, probably a hundred some layers 00:10:04.320 |
and it has maybe a thousand processors on it. 00:10:11.800 |
because the manufacturing process was so streamlined. 00:10:19.360 |
has to have that kind of manufacturing scalability, 00:10:29.220 |
It's not, I mean, of course there's human ingenuity 00:10:31.780 |
that goes into it, but at least from my side, where I sit, 00:10:36.100 |
it sure looks like the physics of our universe 00:10:41.140 |
And we've discovered how more so than we've invented it. 00:11:04.620 |
you're able to have electrons not interfere with each other 00:11:08.500 |
in such a way that you could still have gates. 00:11:22.760 |
So starting with the silicon material itself, 00:11:44.540 |
And depending on the voltage you apply there, 00:11:47.260 |
you pull more carriers into the conduction channel 00:11:50.020 |
or push them away so it becomes more or less conductive. 00:11:54.700 |
without just sucking those carriers right into that contact, 00:11:59.260 |
And part of scaling has been to gradually decrease 00:12:06.060 |
so that you can use a roughly similar voltage 00:12:09.060 |
and still have the same current voltage characteristics. 00:12:14.600 |
or I should say was initially used to do that, 00:12:18.720 |
which just naturally grows on the silicon surface. 00:12:21.820 |
So you expose silicon to the atmosphere that we breathe. 00:12:29.740 |
but nevertheless, what's called a native oxide 00:12:43.660 |
And that has to do with nothing but the physics 00:12:46.260 |
of the interaction between silicon and oxygen. 00:12:53.040 |
they could not perform in nearly the degree of capability 00:12:58.800 |
And that has to do with the way that the oxide grows, 00:13:05.760 |
its insulation, meaning essentially its energy gaps. 00:13:16.880 |
Silicon is a semiconductor in an elemental sense. 00:13:28.580 |
That opens you up to lots of defects that can occur 00:13:31.680 |
where one atom's not sitting quite at the lattice site it is 00:13:45.840 |
that can produce these very short wavelengths of light. 00:13:50.880 |
Well, you actually put this polymer on top of your wafer 00:13:56.200 |
And then you use a aqueous chemical processing 00:14:00.080 |
to dissolve away the regions that were exposed to light 00:14:13.600 |
where the polymer splits where a photon hits. 00:14:26.160 |
where very sophisticated technologies can be devised 00:14:39.760 |
precision of whatever we said, seven nanometers. 00:14:52.160 |
but in this whole journey from a history perspective, 00:14:58.440 |
at the intersection of engineering and physics 00:15:03.280 |
that we talked about with silicon and photolithography, 00:15:22.000 |
Like, what do you think is the most beautiful 00:15:26.800 |
- The most beautiful is a little difficult to answer. 00:15:34.760 |
about looking at the history of silicon microelectronics 00:15:39.760 |
is that, so when quantum mechanics was developed, 00:15:44.580 |
people quickly began applying it to semiconductors 00:15:50.760 |
and people cared about them for their basic physics, 00:15:55.040 |
And then the transistor was invented in the late 40s 00:16:04.240 |
into the semiconductor and that was ingenious. 00:16:08.000 |
These people were able to make it work, you know? 00:16:12.960 |
But so what I wanna get to that really strikes me 00:16:19.300 |
there were a number of different semiconductors 00:16:22.080 |
They had different properties, different strengths, 00:16:24.880 |
Most people thought germanium was the way to go. 00:16:28.440 |
It had some nice properties related to things 00:16:33.640 |
about how the electrons move inside the lattice, 00:16:37.280 |
but other people thought that compound semiconductors 00:16:41.060 |
also had really, really extraordinary properties 00:16:44.920 |
that might be conducive to making the best devices. 00:16:50.060 |
So there were different groups exploring each of these 00:16:56.120 |
But then what I find striking is why is it that silicon won? 00:17:02.120 |
Because it's not that germanium is a useless material 00:17:08.060 |
They're both doing exciting and important things, 00:17:14.360 |
whereas silicon is the semiconductor material 00:17:20.060 |
for digital computing, which has transformed our world. 00:17:24.160 |
It's because of a remarkable assemblage of qualities 00:17:47.200 |
without having to do a lot of materials development. 00:17:53.400 |
So in a semiconductor, there's an important parameter, 00:17:59.220 |
There are sort of electrons that fill up to one level 00:18:02.280 |
in the energy diagram, and then there's a gap 00:18:05.560 |
where electrons aren't allowed to have an energy 00:18:08.320 |
and then there's another energy level above that. 00:18:11.300 |
And that difference between the lower sort of filled level 00:18:16.860 |
that tells you how much voltage you have to apply 00:18:22.120 |
So with germanium, that's about 0.75 electron volts. 00:19:08.480 |
So just that slight extra energy in that band gap 00:19:12.780 |
really puts it in an ideal position to be operated 00:19:17.040 |
in the conditions of our ambient environment. 00:19:22.480 |
errors decrease exponentially with the voltage. 00:19:27.480 |
So it's funny 'cause this error thing comes up 00:19:32.020 |
when you start talking about quantum computing. 00:19:45.380 |
is based on the assumption that it's extremely low. 00:19:55.700 |
the assumption is stuff is gonna fail all over the place 00:19:59.100 |
and we somehow have to still be robust to that. 00:20:05.820 |
of our conversation where you're gonna make some enemies. 00:20:09.040 |
'cause we've been talking about physics and engineering. 00:20:22.580 |
some of the beautiful things that we've been talking about, 00:20:30.260 |
and he talks down to all the amazing engineering 00:20:33.260 |
that we're doing in the artificial intelligence 00:20:51.180 |
sort of the great accomplishments of society without both. 00:20:55.860 |
Physics tends to play a key role earlier in the development 00:20:59.660 |
and then engineering, optimization, these things take over. 00:21:12.860 |
that allowed the invention of the transistor, 00:21:20.260 |
But once you have understood and demonstrated 00:21:32.140 |
than old supercomputers in each of our phones, 00:21:37.460 |
And I think I would be quite foolish to say that, 00:21:55.700 |
to understand that it has all these nice properties 00:22:03.820 |
would you put that in a category of engineering? 00:22:09.940 |
it is applied physics, it's material science, 00:22:35.660 |
So what does that have to do with what we're talking about? 00:22:44.260 |
you can sort of induce currents by applying voltages 00:22:52.280 |
that you would expect from some kind of a conductor. 00:22:57.060 |
they don't just flow perfectly without dissipation. 00:23:00.360 |
If an electron collides with an imperfection in the lattice 00:23:11.420 |
In a superconductor, something different happens. 00:23:22.500 |
Well, it happens at low temperature and this is crucial. 00:23:35.780 |
I'm gonna be talking about conventional superconductors 00:23:45.120 |
And so those materials have to be at a temperature 00:23:51.960 |
I mean, their critical temperature might be 10 Kelvin, 00:23:55.400 |
but you wanna operate them at around four Kelvin, 00:24:01.640 |
at very low temperatures in certain materials 00:24:15.100 |
that the electrons can settle into this very special state. 00:24:18.580 |
It's sometimes referred to as a macroscopic quantum state 00:24:22.260 |
because if I had a piece of superconducting material here, 00:24:26.560 |
let's say niobium is a very typical superconductor. 00:24:33.620 |
and we cooled it below its critical temperature, 00:24:36.700 |
all of the electrons in that superconducting state 00:24:46.500 |
is described in terms of all of the particles simultaneously 00:24:49.740 |
but it extends across macroscopic dimensions, 00:24:59.440 |
let's try to be a little bit light on the technical details, 00:25:03.480 |
but essentially the electrons coordinate with each other. 00:25:06.500 |
They are able to, in this macroscopic quantum state, 00:25:24.980 |
it can just sort of almost miraculously avoid that defect 00:25:37.100 |
So you can get a current that flows without dissipation 00:25:42.920 |
That's sort of just very much scratching the surface 00:25:55.600 |
but it turns out that when you have this material, 00:26:00.480 |
you can do usual things like make wires out of it 00:26:21.680 |
diverse in its utility component is a Josephson junction. 00:26:26.880 |
It's not analogous to a transistor in the sense that 00:26:31.440 |
it changes how much current flows from left to right, 00:26:37.160 |
it's the go-to component that a circuit engineer 00:26:41.020 |
is going to use to start to build up more complexity. 00:26:50.600 |
I'm not sure how concerned to be with semantics, 00:26:56.520 |
but let me just briefly say what a Josephson junction is 00:26:59.440 |
and we can talk about different ways that they can be used. 00:27:02.240 |
Basically, if you have a superconducting wire 00:27:09.680 |
that's not superconducting, an insulator or normal metal, 00:27:13.520 |
and then another superconducting wire on the other side, 00:27:20.320 |
So you have this superconducting state on one side 00:27:35.440 |
it has very unusual current voltage characteristics. 00:27:45.120 |
So you can imagine, suppose you had a loop setup 00:27:47.560 |
that had one of those weak links in the loop. 00:27:53.200 |
independent even if you hadn't applied a voltage to it 00:27:57.000 |
So the fact that there's this phase difference 00:28:02.160 |
from one side of the tunneling barrier to the other 00:28:09.580 |
Now picture if I have a current bias coming down 00:28:14.920 |
and there's a Josephson junction right in the middle of it. 00:28:21.800 |
So I have a loop here, a superconducting loop. 00:28:27.960 |
by exceeding the critical current of that Josephson junction. 00:28:33.960 |
it can carry this super current that I've described, 00:28:37.680 |
this current that can propagate without dissipation 00:28:41.840 |
And if you try and pass more current than that 00:28:51.120 |
So in the Josephson junction, the same thing happens. 00:28:58.200 |
it's going to add a quantized amount of current 00:29:16.860 |
you would say you pop a flux on into the loop. 00:29:31.200 |
amounts of current that you can add to a loop. 00:29:36.600 |
but I think it's sufficient for our purposes. 00:29:47.000 |
Because we'll be talking about light a little bit. 00:29:51.080 |
- The speed is important, that's an excellent question. 00:30:00.800 |
- Matrix 4 is coming out, so maybe that's related. 00:30:10.680 |
Anyway, so what's the speed of these packets? 00:30:16.600 |
- So the speed of the packet is actually these flux ons, 00:30:29.720 |
maybe something like a third of the speed of light. 00:30:32.840 |
So one of the reasons why Joseph's injunctions are appealing 00:30:37.240 |
is because their signals can propagate quite fast 00:30:43.400 |
What I mean by switch is perform that operation 00:30:46.040 |
that I described where you add current to the loop. 00:30:49.400 |
That can happen within a few tens of picoseconds. 00:31:04.920 |
to the one gigahertz range, maybe three gigahertz 00:31:08.320 |
seems to be kind of where those speeds have leveled out. 00:31:30.160 |
The physics of Joseph's injunctions is understood well. 00:31:37.640 |
silicon microelectronics in conventional digital computing, 00:31:41.720 |
I think are more related to what I was alluding to before 00:31:45.040 |
about the myriad practical, almost mundane aspects 00:31:52.080 |
You can make a transistor ever smaller and smaller 00:31:55.880 |
and it will still perform its digital function quite well. 00:31:58.840 |
The same is not true of a Joseph's injunction. 00:32:02.400 |
it's not the same thing that there's this feature 00:32:06.280 |
and it'll keep performing the same operations. 00:32:08.200 |
This loop I described, any Joseph's in circuit, 00:32:19.440 |
or the way they perform whatever function it is 00:32:24.560 |
it depends on an interplay between the junction 00:32:50.360 |
to the densities that silicon microelectronics can. 00:32:56.960 |
about the various superconducting materials involved? 00:33:08.400 |
that also require to be super cold for Kelvin and so on. 00:33:12.040 |
- Yes, so let's dissect a couple of those different things. 00:33:22.200 |
- Yeah, what kind of cooling system can achieve four Kelvin? 00:33:26.320 |
And so liquid helium is expensive, it's inconvenient. 00:33:36.560 |
is impracticable for, it's not going in your cell phone. 00:33:40.080 |
So you can picture holding your cell phone like this 00:33:49.560 |
So if you're trying to make this in consumer devices, 00:33:54.140 |
electronics that are ubiquitous across society, 00:33:57.040 |
superconductors are not in the race for that. 00:34:03.280 |
maybe the thing we're focused on is computing systems 00:34:11.400 |
So then you can contrast what's going on in your cell phone 00:34:14.800 |
with what's going on at one of the supercomputers. 00:34:18.240 |
Colleague Katie Schumann invited us out to Oak Ridge 00:34:26.120 |
So these are some high performance supercomputers 00:34:29.480 |
out in Tennessee and those are filling entire rooms 00:34:35.920 |
okay, there you're already putting a lot of power 00:34:37.760 |
into cooling, cooling is part of your engineering task 00:34:45.600 |
that cooling to four Kelvin is out of the question. 00:34:49.480 |
It has not happened yet and I can speak to why that is 00:34:57.640 |
I don't think superconductors are gonna replace 00:35:13.400 |
scaling down to feature sizes, all that stuff, 00:35:16.040 |
semiconductors work better at the system level. 00:35:25.500 |
Is there some power to the momentum of an industry 00:35:28.160 |
that's mass manufacturing using a certain material? 00:35:46.500 |
And you know, I don't know the answer to that, 00:35:54.660 |
of the superconducting logic goes back to the 70s. 00:36:02.360 |
And they made some choices about their devices 00:36:06.080 |
and their architectures and things that in hindsight 00:36:10.980 |
And I don't mean any disrespect for the people that did it. 00:36:14.260 |
But then another generation of superconducting logic 00:36:29.920 |
that are doing digital computing based on logic gates 00:36:36.560 |
And they showed how it could go hundreds of times faster 00:36:47.000 |
but later when I went back and read the literature, 00:36:58.240 |
is because silicon already had so much momentum 00:37:04.280 |
because it had the simple way to keep getting better. 00:37:11.760 |
I don't think it would have to be that much better 00:37:15.400 |
But the problem is it's just not better than silicon. 00:37:17.760 |
It might be better than silicon in one metric, 00:37:21.400 |
or power consumption of a switching operation. 00:37:26.000 |
is a lot more than just that elemental operation. 00:37:31.000 |
including the manufacturing, including the packaging, 00:37:33.280 |
including the various materials aspects of things. 00:37:47.440 |
you can see how we could build an entire family 00:37:49.860 |
of digital electronic circuits based on these components. 00:38:20.800 |
When you talk about computation, how do you think about it? 00:38:28.900 |
or do you think something a little bit larger scale, 00:38:36.940 |
that are then required to do the kind of computation 00:38:42.160 |
'Cause when we talk about the speed of computation, 00:38:46.960 |
or is there some bigger picture that you're thinking about? 00:38:49.160 |
- Well, all right, so maybe we should disambiguate. 00:38:52.320 |
There are a variety of different kinds of computation. 00:38:57.140 |
in the theory of computation or anything like that. 00:39:00.140 |
I guess it's important to differentiate though 00:39:05.760 |
which represents information as a series of bits, 00:39:22.820 |
like we'll get into more the way your brain works, 00:39:42.600 |
that's usually referred to as analog computing. 00:39:54.200 |
you have a concept of what's called arithmetic depth, 00:40:07.640 |
And those kinds of computations in digital systems 00:40:10.880 |
are highly serial, meaning that data streams, 00:40:16.480 |
You do, you have to pull some information over there 00:40:18.880 |
and access memory from here and stuff like that. 00:40:27.720 |
In the brain, you're always drawing information 00:40:39.160 |
So one of the other things about a digital system 00:40:41.640 |
is you're performing these operations on a clock. 00:40:51.560 |
It builds its own time scales based on its internal activity. 00:41:00.920 |
where it's actually really trivial, simple computers, 00:41:04.600 |
just a huge number of them, and they're networked. 00:41:11.980 |
little processors, and there's a huge number of them. 00:41:21.960 |
Yes, there's computation happening within a neuron. 00:41:25.680 |
- Right, so I would say to think of a transistor 00:41:29.520 |
as the building block of a digital computer is accurate. 00:41:32.360 |
You use a few transistors to make your logic gates. 00:41:49.320 |
To make the analogy to the neuro side of things, 00:42:00.220 |
but they are lower on the information processing hierarchy 00:42:08.200 |
but neurons are entire processors in and of themselves 00:42:13.600 |
that can take in many different kinds of inputs 00:42:16.280 |
on many different spatial and temporal scales 00:42:20.840 |
so that they can perform different computations 00:42:24.880 |
- So this is where it enters this distinction 00:42:30.760 |
So you can think of neuron as performing computation 00:42:34.140 |
and the networking, the interconnectivity of neurons 00:42:40.920 |
And you see this with very large server systems 00:42:43.480 |
I mentioned offline, I've been talking to Jim Keller 00:42:46.160 |
whose dream is to build giant computers that, 00:42:48.580 |
you know, the bottom like there's often the communication 00:43:08.200 |
Maybe you can linger and say in this context, 00:43:11.040 |
what do you mean by computation and communication? 00:43:23.620 |
versus communication, I would say computation 00:43:34.680 |
and producing new, hopefully more useful information. 00:43:39.040 |
So for example, imagine you have a picture in front of you 00:43:44.040 |
and there is a key in it and that's what you're looking for 00:43:56.540 |
and the output might be the coordinates where the key is. 00:43:59.040 |
So you've reduced the total amount of information you have, 00:44:12.680 |
That could be how your system is performing the computation 00:44:23.720 |
it depends on, that's a very simplified example, 00:44:35.960 |
It might be interactive, there might be some recurrence 00:44:44.440 |
that actually goes from the input to the output. 00:44:49.080 |
hopefully reducing the total amount of information 00:44:53.800 |
Communication is then getting that information 00:44:59.440 |
because information is physical as Landauer emphasized 00:45:05.120 |
and you need to get that information to another place 00:45:13.440 |
and you're all trying to solve the same problem 00:45:15.000 |
but neuron A over here just deduced something 00:45:20.000 |
based on its inputs and it's now sending that information 00:45:31.840 |
- Rolf Landauer, not to be confused with Lev Landau. 00:45:38.000 |
to our understanding of the reversibility of information 00:45:43.000 |
and this concept that energy has to be dissipated 00:45:46.680 |
in computing when the computation is irreversible 00:45:55.080 |
but if you do expend energy to perform a computation 00:45:59.680 |
there's sort of a minimal amount that you have to do 00:46:07.680 |
and that the universe is an information process 00:46:21.960 |
Communication ideally does not change the information. 00:46:33.760 |
So, then the electron versus light distinction 00:46:44.560 |
- Yes, there's a lot that goes into it I guess 00:46:48.860 |
but just try to speak to the simplest part of it. 00:46:51.720 |
Electrons interact strongly with one another. 00:47:10.680 |
so you can cause it to be spatially localized. 00:47:38.140 |
that I can adjust to change where those electrons are 00:48:02.560 |
through the interaction with that medium that they're in 00:48:10.360 |
we can assume that photons don't interact with each other. 00:48:19.160 |
If I wanna send, if I have a communication channel 00:48:26.040 |
It doesn't change what those other ones were doing at all. 00:48:38.840 |
and they can branch really easily and things like that 00:48:42.720 |
because it's very hard for this packet of light 00:48:46.360 |
to change what this packet of light is doing. 00:48:50.200 |
So in computation, you want to change information 00:48:53.300 |
and if photons don't interact with each other, 00:49:01.440 |
Is there also something about the way they travel 00:49:12.600 |
So this gets back to electrons interact with each other 00:49:18.160 |
So say I'm trying to get a packet of information 00:49:22.440 |
from me to you and we have a wire going between us. 00:49:25.840 |
In order for me to send electrons across that wire, 00:49:29.040 |
I first have to raise the voltage on my end of the wire 00:49:32.240 |
and that means putting a bunch of charges on it 00:49:34.640 |
and then that charge packet has to propagate along the wire 00:49:41.280 |
That wire is gonna have something that's called capacitance 00:49:59.120 |
and the energy required to charge up that line 00:50:22.120 |
so is the number of connections I'm trying to make. 00:50:27.640 |
If I want to now send it to 10,000 other friends, 00:50:43.560 |
and the number 10,000 is not randomly chosen, 00:50:50.640 |
So a neuron in your brain needs to send 10,000 messages 00:50:56.780 |
You can't do that if you're trying to drive electrons 00:51:02.080 |
The brain does it in a slightly different way, 00:51:04.880 |
- How can light achieve the 10,000 connections 00:51:07.040 |
and why is it better in terms of the energy use 00:51:29.880 |
And I just have to inject photons into that waveguide 00:51:35.800 |
independent of how many different connections I'm making, 00:51:39.640 |
it doesn't change the voltage or anything like that 00:51:55.040 |
That makes sense, but I don't have a capacitive penalty. 00:51:58.840 |
Sometimes these are called wiring parasitics. 00:52:01.280 |
There are no parasitics associated with light 00:52:07.480 |
but how do I catch a photon on the other end? 00:52:14.080 |
you were talking about for a different application 00:52:34.280 |
has built his career around these superconducting 00:52:38.400 |
So if you're going to try to sort of reach a lower limit 00:52:45.240 |
superconductors come back into our conversation. 00:53:01.000 |
And on your loop branch, you have a little ammeter, 00:53:16.560 |
all the current goes through the superconducting branch. 00:53:18.920 |
Now a photon comes in, strikes that superconductor. 00:53:25.760 |
That's going to be destroyed by the energy of that photon. 00:53:28.480 |
So now that branch of the circuit is resistive too. 00:53:33.760 |
so that the resistance on that superconducting branch 00:53:38.400 |
Now, all of your current's going to go that way. 00:53:40.760 |
Your ammeter says, oh, I just got a pulse of current. 00:53:54.280 |
This is a very powerful superconducting device 00:53:59.200 |
that allows us to understand quantum states of light. 00:54:12.000 |
I mean, so what happens when you just barrage it with photons? 00:54:25.400 |
- So you have to be careful how many photons you send. 00:54:28.000 |
Like you have to be very precise with your communication. 00:54:34.840 |
to use these detectors for, that's a feature. 00:54:39.880 |
if a neuron sends one photon to a synaptic connection, 00:54:46.120 |
and one of these superconducting detectors is sitting there, 00:54:54.000 |
then I'm gonna do what I do when there's a synapse event, 00:54:56.040 |
I'm gonna perform computations, that kind of thing. 00:54:58.640 |
But if accidentally you send two there, or three, or five, 00:55:04.200 |
- And so this is how, in the system that we're devising here, 00:55:12.760 |
And that's what I tried to emphasize a second ago. 00:55:15.040 |
Communication should not change the information. 00:55:27.040 |
So that's a noise filtering property of those detectors. 00:55:33.120 |
where you'd rather know the exact number of photons. 00:55:36.120 |
That can be very useful in quantum computing with light. 00:55:41.800 |
around another kind of superconducting sensor 00:55:46.640 |
that Adrian Alita in our group does a lot of work on that. 00:55:52.560 |
based on the amplitude of the current pulse you divert, 00:56:04.700 |
in quantum states of light is in the number of photons. 00:56:09.320 |
And a number state will have a well-defined number 00:56:14.400 |
of your quantum computation encodes its information 00:56:20.460 |
So if you have a detector that is sensitive to that, 00:56:43.280 |
And so yes, optical clocks, optical clock distribution 00:56:51.140 |
I don't know the state of that field right now, 00:56:53.180 |
but I imagine that if you're trying to distribute a clock 00:56:55.620 |
across any appreciable size computational system, 00:57:00.360 |
- Yeah, I wonder how these giant systems work, 00:57:15.520 |
Like there's some kind of locking mechanisms or something. 00:57:20.060 |
Let's zoom in at this very particular question 00:57:40.060 |
is in the paper on optoelectronic intelligence. 00:57:44.700 |
Are we talking about something that starts to look a lot 00:57:47.300 |
like the human brain or does it still look a lot 00:58:01.140 |
What are you thinking about when you're thinking 00:58:07.340 |
There's this concept of neuromorphic computing 00:58:09.900 |
where what that broadly refers to is computing 00:58:15.440 |
based on the information processing principles of the brain. 00:58:26.440 |
people are considering architectural advances, 00:58:30.840 |
more distributed parallel network architectures and stuff. 00:58:34.200 |
And so there's this continuum of neuromorphic 00:58:37.420 |
from things that are pretty similar to digital computers, 00:58:45.720 |
and the way they send messages is a little bit more 00:58:52.780 |
But for the most part, it's still digital electronics. 00:59:03.220 |
instead of in a digital way, in an analog way. 00:59:12.100 |
quite a bit more on the neuromorphic side of things. 00:59:19.120 |
you're trying to get them to perform operations 00:59:22.460 |
that are highly analogous to the operations in the brain. 00:59:26.680 |
in neuromorphic computing, people like Yakima Wendoveri 00:59:29.160 |
and Gert Kauenberg, Jennifer Hasler, countless others. 00:59:39.440 |
And then all the way on the other extreme of the continuum 00:59:44.220 |
is where you say, I'll give up anything related 00:59:48.540 |
to transistors or semiconductors or anything like that. 00:59:57.220 |
And instead, what I wanna do is try and understand 01:00:00.660 |
at the kind of information processing it does. 01:00:05.500 |
about what hardware is best going to enable us 01:00:10.500 |
to capture those information processing principles 01:00:17.380 |
That's where I'm doing my exploration these days. 01:00:38.360 |
- Okay, so the brain is notoriously complicated. 01:00:48.680 |
Starting with the devices, neurons, as I said before, 01:00:54.620 |
they're sophisticated devices in and of themselves. 01:00:58.220 |
They can change their state based on the activity. 01:01:16.080 |
the spacing between spikes that come from neurons, 01:01:21.540 |
Also relevant, perhaps, I think the most important thing 01:01:28.100 |
that's guided my thinking is the network structure 01:01:33.880 |
- Which can also be adjusted on different scales. 01:01:41.360 |
you're changing the spatial distribution of them, 01:01:44.120 |
although spatial distribution doesn't change that much 01:01:49.400 |
But that network structure is really crucial. 01:01:55.380 |
You can't talk about the brain without emphasizing 01:02:02.880 |
or the prefrontal cortex, the part of the brain 01:02:04.840 |
that we think is most responsible for high-level reasoning 01:02:11.400 |
So you have this network that is highly interconnected. 01:02:15.560 |
And I think it's safe to say that one of the primary reasons 01:02:23.200 |
is that allows information to be communicated very rapidly 01:02:26.880 |
from any spot in the network to any other spot 01:02:38.480 |
that are related to fractals and scale invariance, 01:02:46.720 |
what spatial scale you're looking at in the brain 01:02:52.320 |
you see the same general statistical pattern. 01:02:54.960 |
So if I draw a box around some region of my cortex, 01:03:02.280 |
within that box make are gonna be within the box 01:03:06.200 |
And that's sort of called clustering, loosely speaking. 01:03:18.400 |
and you also have a non-vanishing probability 01:03:28.680 |
of a neuron making a connection as a function of distance. 01:03:38.660 |
and it would drop off up to some certain radius. 01:03:41.820 |
The probability would be reasonable close to one. 01:03:44.880 |
And then beyond that characteristic length, R zero, 01:03:51.280 |
And so that would mean that the neurons in your brain 01:03:53.560 |
are really localized, and that's not what we observe. 01:03:58.560 |
Instead, what you see is that the probability 01:04:02.700 |
it does drop off, but it drops off as a power law. 01:04:05.760 |
So the probability that you're gonna have a connection 01:04:08.400 |
at some radius R goes as R to the minus some power. 01:04:13.160 |
And that's what we see with forces in nature, 01:04:24.400 |
I love that there's a fractal dynamics of the brain 01:04:31.440 |
and you increase that box by certain step sizes, 01:04:54.880 |
to the way the physics of the universe works. 01:05:01.400 |
that I hope to be able to spend the rest of my life studying. 01:05:04.160 |
- You think you need to solve, understand this, 01:05:13.320 |
Yes, I think power laws are right at the heart of it. 01:05:19.400 |
the same thing happens in the temporal domain. 01:05:26.000 |
were always oscillating at the same frequency. 01:05:34.520 |
around that certain characteristic frequency. 01:05:37.880 |
The probability of finding neurons oscillating 01:05:40.240 |
or pulsing, producing spikes at a certain frequency 01:05:58.280 |
and that is limited by communication and other things. 01:06:10.840 |
which is physiologically limited by our devices. 01:06:17.160 |
all the way up to the lifetime of the organism. 01:06:27.480 |
you have thoughts that span organisms, right? 01:06:37.200 |
then that too on a spatial and temporal scale, 01:06:41.120 |
And then if you look at not just the human species, 01:06:46.480 |
as an organism with thoughts that are occurring, 01:06:49.640 |
that are greater and greater sophisticated thoughts, 01:06:51.640 |
there's a different spatial and temporal scale there. 01:06:58.120 |
Before we're done, I just wanna just tie the bow 01:07:01.000 |
and say that the spatial and temporal aspects 01:07:10.360 |
is more likely to happen on this faster timescale 01:07:17.200 |
more of your cortices, different modules in the brain 01:07:20.320 |
are gonna be engaged in information processing 01:07:25.320 |
So there's this concept of information integration 01:07:43.680 |
And so that happens through these fractal nested oscillations 01:07:47.600 |
that occur across spatial and temporal scales. 01:07:49.440 |
I think capturing those dynamics in hardware, 01:07:53.680 |
to me, that's the goal of neuromorphic computing. 01:08:15.840 |
- Well, it depends on what you're trying to use it for. 01:08:28.040 |
And it's also sometimes not the most helpful question. 01:08:31.840 |
What if what you wanna do with it is study it? 01:08:39.240 |
in order to observe these dynamical principles? 01:08:41.880 |
- And also, I ask myself that question every day, 01:08:56.960 |
We're not trying to get it to make us any money 01:09:00.200 |
Maybe we'll be able to do that, but that's not our goal. 01:09:02.600 |
Our goal is to see if we can get the same types 01:09:06.240 |
of behaviors that we observe in our own brain. 01:09:10.440 |
what I mean, the behaviors of the components, 01:09:14.280 |
the neurons, the network, that kind of stuff. 01:09:15.960 |
I think there's another element that I didn't really hit on 01:09:22.920 |
They have to do with the hierarchical modular construction 01:09:30.200 |
the main point that I think is relevant there, 01:09:33.680 |
let me try and illustrate it with a cartoon picture 01:09:50.640 |
and it would be about the size of a pizza sitting there. 01:10:01.640 |
it processes the what of what's going on around you. 01:10:08.920 |
and that network is structured entirely differently. 01:10:13.740 |
10 billion neurons in there, so numbers matter here. 01:10:17.760 |
And they're organized in that sort of power law distribution 01:10:34.080 |
keeping track of your position in space and time. 01:10:42.080 |
it almost doesn't even drop off as a function of distance. 01:10:48.420 |
But there are only about 100 million neurons there. 01:10:51.440 |
So you can have that huge, densely connected module 01:10:57.720 |
And the neocortex, or the cortex and the hippocampus, 01:11:05.360 |
And that communication is largely facilitated 01:11:16.020 |
So this thalamus is coordinating the activity 01:11:32.040 |
And it seems like building something like that 01:11:40.540 |
because those responsibilities, those separate modules, 01:11:46.160 |
That's gotta be central to achieving these states 01:11:51.160 |
of efficient information integration across space and time. 01:11:55.260 |
- By the way, I am able to achieve this state 01:12:06.920 |
they've created these incredible visual illustrations 01:12:10.360 |
of visual stimulation from the eye or something like that 01:12:48.880 |
It's like, whoa, because it looks kind of like the universe. 01:13:00.960 |
is the thalamocortical complex and just visualization. 01:13:05.840 |
You can find that yourself on YouTube, but it's beautiful. 01:13:14.440 |
Because all the signals sending back and forth, 01:13:20.880 |
but that's kind of like processing of inputs and outputs 01:13:30.920 |
But is there a storage mechanism that you think about 01:13:37.760 |
You have to have a way that you can store memories. 01:13:41.520 |
And there are a lot of different kinds of memory 01:13:45.560 |
That's yet another example of how it's not a simple system. 01:13:56.040 |
Usually starts in the context of Hopfield networks. 01:13:59.040 |
You were lucky to talk to John Hopfield on this program. 01:14:10.360 |
And you can think of a certain pattern of activity 01:14:14.720 |
as an attractor, meaning if you put in some signal 01:14:19.640 |
that's similar enough to other previously experienced 01:14:24.000 |
signals like that, then you're going to converge 01:14:27.760 |
to the same network dynamics and you will see these neurons 01:14:31.720 |
participate in the same network patterns of activity 01:14:37.600 |
So you can talk about the probability that different inputs 01:14:41.260 |
will allow you to converge to different basins of attraction 01:14:44.240 |
and you might think of that as, oh, I saw this face 01:14:48.000 |
and then I excited this network pattern of activity 01:14:53.080 |
I was at some movie and that's a famous person 01:15:00.560 |
but crucial to the ability to imprint those memories 01:15:04.420 |
in your brain is the ability to change the strength 01:15:08.560 |
of connection between one neuron and another, 01:15:16.620 |
of neuroscience and neuromorphic computing as well. 01:15:19.580 |
So there are two poles to that on that spectrum. 01:15:24.580 |
Okay, so more in the language of machine learning, 01:15:28.900 |
we would talk about supervised and unsupervised learning. 01:15:33.980 |
to neuromorphic computing, I will use a definition 01:15:37.040 |
of supervised learning, which basically means 01:15:40.660 |
the external user, the person who's controlling 01:15:44.140 |
this hardware has some knob that they can tune 01:15:50.420 |
depending on whether or not the network's doing 01:15:58.380 |
those synaptic weights are dynamically changing 01:16:02.340 |
in your network based on nothing that the user's doing, 01:16:05.040 |
nothing that there's no wire from the outside 01:16:09.080 |
The network itself is reconfiguring those synaptic weights 01:16:35.020 |
the useful network patterns and makes it less likely 01:16:37.720 |
that random noise, useless communication events 01:16:41.460 |
will have an important effect on the network activity. 01:16:50.940 |
of something that's not often done in machine learning, 01:17:05.000 |
of brain development, where a newborn is born 01:17:16.820 |
as opposed to growing new long distance connections. 01:17:19.700 |
It is possible in the brain to grow new neurons 01:17:26.100 |
but it doesn't seem to be the primary mechanism 01:17:37.020 |
and I learn from you and I can remember things 01:17:47.380 |
It's plasticity mechanisms in the synaptic connections 01:17:52.180 |
between neurons that enable me to learn on that time scale. 01:18:02.980 |
- I would say weight updates are a big part of it. 01:18:05.060 |
I also think there's more because broadly speaking, 01:18:10.440 |
our networks, say we're talking about feed forward, 01:18:14.140 |
the temporal domain is not really part of it. 01:18:20.400 |
and you're gonna do that as fast as possible. 01:18:23.140 |
but time is not part of the essence of this thing really. 01:18:33.340 |
and they're intimately intertwined as I've tried to say. 01:18:54.200 |
or at least not on the time scale of many action potentials, 01:19:00.180 |
So a synapse can become much less efficacious. 01:19:08.780 |
That can second, third, fourth action potential 01:19:13.080 |
So that's what's called short-term synaptic plasticity, 01:19:17.620 |
You're learning that I'm getting too much stimulus 01:19:23.400 |
There's also another really important mechanism in learning. 01:19:44.580 |
and my, this is a potentially terrible cartoon example, 01:19:51.960 |
So my brain will release more, perhaps, dopamine 01:19:58.380 |
the rate at which synaptic plasticity occurs. 01:20:05.340 |
more sensitive to overriding previous information, 01:20:10.340 |
And finally, as long as I'm rattling off the list, 01:20:13.180 |
I think another concept that falls in the category 01:20:16.480 |
of learning or memory adaptation is homeostasis 01:20:27.780 |
So if one neuron is just like blasting way too much, 01:20:38.500 |
And we see that's captured in deep neural networks 01:20:41.660 |
where you don't just change the synaptic weights, 01:20:47.580 |
- And so to achieve the spiking neural networks, 01:21:09.260 |
you can communicate across much greater distances 01:21:21.020 |
superconducting hardware for neuromorphic computing. 01:21:33.460 |
that you outline in optoelectronic intelligence? 01:21:47.100 |
By us, I'm talking about my team of colleagues at NIST, 01:22:14.640 |
and where is this amazing group of people located? 01:22:18.040 |
- NIST is the National Institute of Standards and Technology. 01:22:21.700 |
The larger facility is out in Gaithersburg, Maryland. 01:22:29.680 |
NIST is a federal agency under the Department of Commerce. 01:22:36.200 |
We do a lot with, by we, I mean other people at NIST, 01:22:42.620 |
making sure that we understand the system of units, 01:22:46.080 |
international system of units, precision measurements. 01:22:49.320 |
There's a lot going on in electrical engineering, 01:23:03.400 |
a place where there's a lot of brilliant people 01:23:18.080 |
- Mostly physicists and electrical engineers, 01:23:40.280 |
more of a superconducting electronics background, 01:23:56.480 |
which requires all the disciplines that you mentioned. 01:24:04.360 |
- I would actually call it more than inspiration. 01:24:20.760 |
I mean, we're very much following the concepts 01:24:25.360 |
that the cognitive sciences have laid out for us, 01:24:33.060 |
it's often stated that we just don't understand the brain, 01:24:37.960 |
because we just don't know what's going on there. 01:24:50.480 |
and I was just taken by the exact opposite sense. 01:24:53.640 |
I can't believe how much they know about this. 01:24:59.280 |
and sort of theoretically complete a lot of the concepts are. 01:25:04.200 |
That's not to say we understand consciousness 01:25:06.660 |
or we understand the self or anything like that, 01:25:13.560 |
Neuroscientists have a lot of answers to those questions. 01:25:15.980 |
So there's a lot, if you're a hardware designer 01:25:19.440 |
whoa, it's pretty clear which direction to go in, I think. 01:25:43.400 |
You know for sure that the ability of each neuron 01:25:47.540 |
to communicate to many thousands of colleagues 01:25:52.360 |
I take that as a core principle of my architecture, 01:26:03.560 |
okay, we're gonna use light for communication. 01:26:10.840 |
I mean, if you think about radio, that's light. 01:26:12.720 |
It's long wavelengths, but it's electromagnetic radiation. 01:26:17.620 |
obeying exactly the same Maxwell's equations. 01:26:27.780 |
but the way you send messages across the ocean 01:26:33.180 |
So using light for communication is not a stretch. 01:26:38.940 |
So you might ask, well, why don't you use light 01:26:41.500 |
for communication in a conventional microchip? 01:26:45.260 |
And the answer to that is, I believe, physical. 01:27:04.000 |
Oh, you need a light source at every single point. 01:27:13.040 |
and nice light source, your definition of neuron 01:27:29.520 |
- I mean, I can explain more about that, but-- 01:27:34.680 |
'cause people often say the light of consciousness. 01:27:38.240 |
So that consciousness is that which is conscious. 01:27:49.480 |
Yours is in optics, mine in light, mine's in darkness. 01:28:26.920 |
silicon is the material of choice for transistors 01:28:35.200 |
Silicon is notoriously bad at emitting light, 01:28:39.120 |
and that has to do with the immutable properties 01:28:43.280 |
of silicon itself, the way that the energy bands 01:28:53.520 |
without doing very exotic things that degrade its ability 01:29:00.000 |
So that's like one of these things where it's, 01:29:11.520 |
So, well, okay, you do give us a light source, 01:29:14.000 |
compound semiconductors, like we talked about 01:29:16.120 |
back at the beginning, an element from group three 01:29:20.320 |
where every other lattice site switches which element it is. 01:29:23.680 |
Those have much better properties for generating light. 01:29:37.440 |
- However, I say it's physics, not engineering 01:29:52.760 |
So you got to make all of your transistors first 01:29:55.700 |
and then put the compound semiconductors on top of there. 01:30:08.000 |
The spacing between atoms is different enough 01:30:11.680 |
So nature does not seem to be telling us that, 01:30:35.440 |
that we can use light and photons for communication? 01:30:43.560 |
You can use light for communication in digital systems. 01:30:46.880 |
Just the light sources are not intimately integrated 01:31:01.020 |
And then you put those together at the package level. 01:31:06.720 |
I would say a great deal of architectural limitations 01:31:14.480 |
package level integration as opposed to monolithic 01:31:20.120 |
And that's where I had done some work previously 01:31:24.640 |
There's a project led by Vladimir Stoyanovich 01:31:28.440 |
that now spun out into a company called IR Labs, 01:31:40.880 |
maybe it's some other control purpose, something. 01:31:43.620 |
And the silicon chip drives the light source chip 01:31:49.760 |
So you can get data out of the package on an optical fiber. 01:31:52.640 |
And that still gives you tremendous advantages in bandwidth 01:32:05.160 |
that they have to be integrated at this package level. 01:32:09.760 |
Those are my colleagues that I respect a great deal. 01:32:28.200 |
it's easier than the alternative, unfortunately. 01:32:33.280 |
and make sure that I'm not taking the wrong way. 01:32:36.800 |
of integrating compound semiconductors with silicon 01:32:45.680 |
they're doing it and they're making progress. 01:32:53.660 |
just the standard monolithic light source on silicon process. 01:33:00.020 |
- Yeah, so nature kind of points the way usually. 01:33:05.700 |
- And it's going to be expensive and not scalable. 01:33:11.300 |
Let's imagine this gigantic neuromorphic computing system 01:33:34.180 |
trashing light source integration with silicon, 01:33:36.140 |
let me now say why I'm basing my entire life, 01:33:41.060 |
on integrating light sources with electronics. 01:33:47.020 |
when you're talking about superconducting electronics. 01:33:57.960 |
those compound semiconductor light sources with silicon. 01:34:05.240 |
that you're using semiconducting electronics. 01:34:08.840 |
you're still going to start with a silicon wafer, 01:34:10.840 |
but it's just the bread for your sandwich in a lot of ways. 01:34:15.800 |
in precisely the same way for the electronics. 01:34:17.720 |
You're now depositing superconducting materials 01:34:34.260 |
to be intimately integrated with the transistors. 01:34:37.900 |
They don't need to be lattice matched to the silicon, 01:34:43.600 |
that you can take those compound semiconductor light sources, 01:34:49.140 |
and then grow your superconducting electronics 01:35:03.920 |
and the light sources receive signals that say, 01:35:09.800 |
send it out to all your downstream synaptic connections. 01:35:12.460 |
Those are, again, superconducting electronics. 01:35:16.220 |
Perform your computation, and you're off to the races. 01:35:22.620 |
so what are the limitations or the challenges 01:35:28.940 |
when we think about constructing these kinds of systems? 01:35:44.020 |
- Okay, one other thing about the light sources. 01:35:45.740 |
I said that silicon is terrible at emitting photons. 01:36:00.260 |
silicon is not obviously a terrible light source. 01:36:03.420 |
It's still not as efficient as compound semiconductors, 01:36:05.860 |
but it might be good enough for this application. 01:36:08.680 |
The final thing that I'll mention about that is, again, 01:36:15.380 |
superconducting detectors can receive one single photon. 01:36:21.260 |
that semiconductors can also receive photons. 01:36:23.460 |
That's the primary mechanism by which it's done. 01:36:26.220 |
A camera in your phone that's receptive to visible light 01:36:32.220 |
or you can make it in different semiconductors 01:36:43.580 |
Now, when you're using a superconducting detector, 01:36:50.860 |
So the fact that your synapses can now be based 01:37:06.540 |
You can have the world's worst light sources. 01:37:08.780 |
As long as they spit out maybe a few thousand photons 01:37:21.700 |
To me, optoelectronic integration, it's just so enticing. 01:37:26.980 |
electronics for computation, light for communication, 01:37:30.380 |
working with silicon microelectronics at room temperature 01:37:35.020 |
And I hope that when we move to the superconducting domain, 01:37:47.660 |
maybe optoelectronic integration comes to us. 01:37:53.680 |
So in these kind of hybrid heterogeneous systems, 01:37:59.560 |
What are some of the constraints of the operation here? 01:38:01.600 |
Does it all have to be at four Kelvin as well? 01:38:06.840 |
- Okay, so what are the other engineering challenges 01:38:09.720 |
of making this kind of optoelectronic systems? 01:38:14.320 |
- Let me just dwell on that four Kelvin for a second 01:38:21.540 |
And to me, that's very earth-centric, species-centric. 01:38:27.260 |
so we want our technologies to operate there too. 01:38:45.660 |
the temperature of the cosmic microwave background 01:38:52.980 |
So when the universe dies out, it'll be colder than 4K. 01:38:59.380 |
In the expanses, you don't have to get that far 01:39:07.780 |
- So what you're saying is the aliens that live 01:39:13.280 |
are using superconducting material for their computation. 01:39:16.420 |
- They don't have to live at the edge of the universe. 01:39:21.040 |
in their solar system are doing this in their asteroid belt. 01:39:27.780 |
- Oh, because they can get to that temperature easier there? 01:39:31.660 |
- Sure, yeah, all you have to do is reflect the sunlight 01:39:42.900 |
- Well, okay, so what I want to say about temperature 01:39:47.900 |
is that if you can swallow that, if you can say, 01:39:53.940 |
all right, I give up applications that have to do 01:39:56.760 |
with my cell phone and the convenience of a laptop 01:40:11.000 |
Instead, I'm thinking about scientific questions. 01:40:13.380 |
Well, it's just not that bad to have to work at 4K. 01:40:25.540 |
usually has to work at something like 100 millikelvin, 01:40:29.640 |
So now you're talking of another factor of 100 01:40:32.100 |
even colder than that, a fraction of a degree. 01:40:35.100 |
And everybody seems to think quantum computing 01:40:46.580 |
And yet it's not stopping people from investing 01:40:50.260 |
And by investing, I mean, putting their research into it 01:40:55.740 |
- So based on the energy of what you're commenting on, 01:40:59.580 |
I'm getting a sense that's one of the criticism 01:41:01.940 |
of this approach is 4K, 4Kelvin is a big negative. 01:41:12.880 |
I'm not saying that that's not a consideration. 01:41:21.480 |
In the academic world, suppose you spent your whole life 01:41:24.440 |
learning about silicon microelectronic circuits. 01:41:26.800 |
You send a design to a foundry, they send you back a chip 01:41:33.020 |
And now I'm saying, here, now learn how to use 01:41:35.720 |
all these cryogenics so you can do that at 4Kelvin. 01:41:42.060 |
- It's the old momentum, the Titanic of the turning. 01:41:45.640 |
- But you're saying that's not too much of a, 01:41:48.380 |
when we're looking at large systems and the gain 01:41:53.460 |
- And when you wanna answer the scientific question 01:41:55.180 |
about what are the physical limits of cognition? 01:42:04.620 |
orders of magnitude beyond any room temperature technology, 01:42:07.680 |
but you gotta get cold to do it, you're gonna do it. 01:42:10.440 |
And to me, that's the interesting application space. 01:42:18.920 |
So I personally am not going to let low temperature 01:42:22.560 |
stop me from realizing a technological domain 01:42:33.060 |
everything else that I'm looking for in my hardware. 01:42:37.640 |
Is there other kind of engineering challenges 01:42:43.120 |
because I haven't really described what I mean 01:42:45.760 |
by a neuron or a network in this particular hardware. 01:42:49.000 |
- Yeah, do you wanna talk about loop neurons? 01:43:01.080 |
- Right, so let me say big picture based on optics, 01:43:35.340 |
The amount of current that that adds to a loop 01:43:46.440 |
it hits a superconducting single photon detector, 01:44:15.640 |
Is this a funny word for something deeply technical? 01:44:18.880 |
- No, let's try to avoid using the word fluxon 01:44:48.540 |
- When you're using superconductors for this, 01:45:07.120 |
you would say that's your postsynaptic signal. 01:45:09.200 |
You have this current being stored in a loop. 01:45:13.520 |
Most likely, you're gonna have it decay exponentially. 01:45:21.020 |
and that's determined by putting some resistor 01:45:30.140 |
strikes a detector, adds current to that loop, 01:45:35.580 |
Then you can process that in a dendritic tree. 01:45:47.060 |
a lot of plasticity mechanisms you can implement 01:45:51.480 |
You have this one simple building block circuit 01:45:54.460 |
that you can use for a synapse, for a dendrite, 01:45:57.260 |
for the neuron cell body, for all the plasticity functions. 01:46:03.780 |
- So this basic building block has both an optical 01:46:12.500 |
You're not at fault for thinking that that's what I meant. 01:46:14.980 |
What I should say is that if you want it to be a synapse, 01:46:18.780 |
you tack a superconducting detector onto the front of it. 01:46:28.380 |
optics in the front, electrical stuff in the back. 01:46:42.500 |
It's all electrical in the superconducting domain. 01:46:44.900 |
For anybody who's up on their superconducting circuits, 01:46:48.540 |
it's just based on a DC squid, the most ubiquitous, 01:46:52.460 |
which is a circuit composed of two Josephson junctions. 01:46:55.140 |
So it's a very bread and butter kind of thing. 01:46:58.700 |
And then the only place where you go beyond that 01:47:06.860 |
or however you've structured that particular unique neuron. 01:47:34.660 |
and produce sufficient voltage to drive current 01:47:39.180 |
So that light source is the semiconducting part of a neuron. 01:47:43.180 |
And that, so the neuron has reached threshold. 01:47:47.660 |
That light then fans out across a network of wave guides 01:47:51.580 |
to reach all the downstream synaptic terminals 01:48:02.340 |
'cause a lot of listeners aren't gonna know that. 01:48:04.780 |
Look up the papers by Jeff Chiles on this one. 01:48:07.140 |
But basically, light can be guided in a simple, 01:48:11.380 |
basically wire of usually an insulating material. 01:48:20.020 |
just like in a fiber optic, it's glass, silicon dioxide. 01:48:26.100 |
So we use different materials like silicon nitride, 01:48:28.020 |
but basically just imagine a rectangle of some material 01:48:39.940 |
that target different sub regions of the network. 01:48:44.980 |
So now we're talking about building in the third dimension, 01:48:58.740 |
One of the things that I believe we have to grapple with 01:49:08.740 |
For the number of neurons that are in our brain, 01:49:13.500 |
as it would have to if we're gonna be biological organisms 01:49:16.180 |
that are resource limited and things like that. 01:49:20.940 |
is almost certainly gonna be much bigger than that 01:49:26.140 |
even whether it's based on silicon transistors. 01:49:30.700 |
that doesn't mean a semiconductor-based neuron 01:49:41.540 |
They end up being on the order of 100 microns 01:49:45.900 |
and it's difficult to get them down any smaller than that. 01:49:48.260 |
The same is true for superconducting neurons, 01:49:52.100 |
if we're trying to use light for communication. 01:49:54.020 |
Even if you're using electrons for communication, 01:50:00.060 |
okay, the size of an electron might be angstroms, 01:50:26.780 |
which is 10 nanometers in diameter, something like that, 01:50:34.620 |
but grappling with the size of these structures 01:50:46.020 |
by scale here I mean number of interconnected neurons, 01:50:59.580 |
Active, I mean superconducting electronic circuits 01:51:07.380 |
that are routing the optical signals to different places, 01:51:11.700 |
If you can get to something like 10 planes of each of those, 01:51:15.260 |
or maybe not even 10, maybe five, six, something like that, 01:51:19.740 |
Now you can get millions of neurons on a wafer, 01:51:22.860 |
but that's not anywhere close to the brain scale. 01:51:26.380 |
In order to get to the scale of the human brain, 01:51:27.980 |
you're gonna have to also use the third dimension 01:51:42.020 |
and that's when you can get to some 10 billion neurons 01:51:46.660 |
to the optoelectronic approach that we're taking. 01:51:51.400 |
where you're trying to reach commensurate scale 01:51:57.540 |
So stacking on the wafer, and stacking of the wafers, 01:52:07.700 |
You're exactly right, because that's the only way 01:52:12.500 |
from a small point to across that whole network. 01:52:21.480 |
Once you're at this scale, to me it's just obvious. 01:52:23.700 |
Of course you're using light for communication. 01:52:25.580 |
You have fiber optics given to us from nature, so simple. 01:52:33.860 |
any kind of electrical communication just doesn't, 01:52:47.360 |
- Yeah, the term loop neurons comes from the fact, 01:52:50.880 |
that they rely heavily on these superconducting loops. 01:52:54.140 |
So even in a lot of forms of digital computing 01:53:01.980 |
in a superconducting loop is a primary technique. 01:53:09.420 |
So the strength of a synaptic weight is gonna be set 01:53:13.580 |
by the amount of current circulating in a loop 01:53:18.620 |
So memory is implemented as current circulating 01:53:24.900 |
The coupling between, say, a synapse and a dendrite 01:53:29.940 |
occurs through loop coupling through transformers. 01:53:57.940 |
better than superconducting optoelectronic neurons. 01:54:23.220 |
It's just four Josephson junctions or something like that, 01:54:27.100 |
depending on how much complexity you wanna add. 01:54:29.180 |
So you can just directly simulate each component in SPICE. 01:54:35.860 |
- It's Standard Electrical Simulation Software, basically. 01:54:41.480 |
the differential equations that describe the circuit elements. 01:54:44.340 |
- And then you can stack these things together 01:54:46.200 |
in that simulation software to then build circuits. 01:54:48.860 |
- You can, but that becomes computationally expensive. 01:54:55.760 |
to more things you can do at home in your basement 01:54:59.380 |
or whatever, and one of them was computational modeling. 01:55:12.540 |
the circuit equations, which for Josephson junctions 01:55:15.860 |
usually needs to be done on like a picosecond timescale 01:55:21.220 |
So it results in a lot of differential equations 01:55:26.320 |
We were looking for a way to simulate these circuits 01:55:29.400 |
that is scalable up to networks of millions or so neurons 01:55:36.600 |
So we were able to analyze the behavior of these circuits. 01:55:40.780 |
And as I said, it's based on these simple building blocks. 01:55:46.320 |
And if you get a good model of that, boom, it tiles. 01:55:52.800 |
but it's all based on now it's one differential equation 01:55:57.100 |
So one differential equation for every synapse, 01:56:05.300 |
it's just a simple leaky integrate and fire model, 01:56:13.420 |
So I'm really fascinated by how this one simple component 01:56:18.420 |
can be used to achieve lots of different types 01:56:24.420 |
And to me, that's where scalability comes from. 01:56:39.360 |
and then emergent new behavior that was hard to predict 01:56:44.700 |
And that's exactly what we're working with here. 01:56:50.380 |
and from a hardware manifestation perspective 01:56:52.740 |
where we can hopefully start to have this test bed 01:57:03.500 |
that connected to other physics like critical phenomenon, 01:57:08.660 |
So you were asking how we simulate these circuits. 01:57:14.540 |
and we've got the simple spice circuit stuff. 01:57:21.740 |
based on this more efficient leaky integrator. 01:57:45.780 |
- And just like simulating quantum mechanical systems 01:57:49.860 |
So the goal here is to understand such systems. 01:57:58.940 |
I'm not drawn towards turning this into an enterprise 01:58:04.440 |
- Short-term applications that obviously make a lot of money 01:58:12.240 |
If you're interested in short-term making money, 01:58:14.020 |
go with deep learning, use silicon microelectronics. 01:58:16.500 |
If you wanna understand things like the physics 01:58:23.020 |
or if you wanna understand something more along the lines 01:58:26.220 |
of the physical limits of what can be achieved, 01:58:32.680 |
superconducting electronics, extremely exciting. 01:58:35.940 |
- What if I wanna use superconducting hardware 01:58:43.260 |
That's the reason I wanted to talk to you today. 01:59:02.340 |
- So what, like if you look at the scale of five, 10, 20 years 01:59:07.340 |
is it possible to, before we understand the nature 01:59:12.100 |
of human intelligence and general intelligence, 01:59:14.620 |
do you think we'll start falling out of this exploration 01:59:19.260 |
of neuromorphic systems ability to solve some 01:59:23.260 |
of the problems that the machine learning systems 01:59:34.140 |
Also, I don't really understand machine learning 01:59:47.420 |
what your input is and also what your goal is. 01:59:57.300 |
I mean, of course, a lot of times that's the case, 02:00:00.180 |
you know, there's a picture and there's a horse in it, 02:00:09.500 |
it's almost defined by the ability to handle problems 02:00:13.200 |
where you don't know what your inputs are going to be 02:00:18.600 |
I mean, I'm not sure what I'm trying to accomplish 02:00:48.620 |
that was very difficult to predict, things like that. 02:00:55.540 |
if they found themselves working on this project 02:00:59.260 |
what are all the different ways we can use this 02:01:09.300 |
particularly in the superconducting side of things, 02:01:11.680 |
using the incredible speed, power efficiency, 02:01:14.880 |
also Ken Segal at Colgate, other people working 02:01:17.680 |
on specifically the superconducting side of this 02:01:20.480 |
for machine learning and deep feed-forward neural networks. 02:01:28.760 |
- Yeah, so that's less on the nature of intelligence 02:01:31.840 |
and more on various characteristics of this hardware. 02:01:39.440 |
- Yeah, one of the things-- - And communication. 02:01:40.640 |
- One of the things that Mike Schneider's working on 02:01:44.640 |
At a relatively small scale, I think he's targeting 02:01:52.340 |
you put in a nine pixel image and you classify it 02:02:00.120 |
to see what happens there because if you can show 02:02:04.160 |
that even at that scale, you just put these images in 02:02:06.940 |
and you get it out and he thinks he can do it. 02:02:14.000 |
It's probably less, it's probably 100 picoseconds 02:02:21.560 |
the electronic circuit is extremely power efficient. 02:02:29.240 |
But when you have to cool it down to four Kelvin, 02:02:31.440 |
you pay a huge overhead just for keeping it cold, 02:02:45.120 |
It's just, it's gonna have to get that performance. 02:02:48.000 |
And this is sort of what you were asking about before 02:02:49.720 |
is like how much better than silicon would it need to be? 02:02:54.120 |
I think if it's just overall better than silicon 02:02:57.240 |
at a problem that a lot of people care about, 02:03:03.960 |
maybe it's monitoring credit transactions, I don't know. 02:03:17.680 |
to the various systems like Tesla recently announced DOJO, 02:03:22.680 |
which is a large scale machine learning training system. 02:03:46.720 |
In the long term, it could be the whole thing. 02:04:09.560 |
so let me explain to you the way things work in-- 02:04:26.440 |
a very, very large scale modular machine learning ASIC, 02:04:36.060 |
that are small and big working on this kind of problem. 02:04:39.360 |
The question is how to do it in a modular way 02:04:45.640 |
The interesting aspect of Tesla is you have a company 02:04:49.520 |
that, at least at this time, is so singularly focused 02:04:54.660 |
on solving a particular machine learning problem 02:04:57.740 |
and is making obviously a lot of money doing so 02:05:02.220 |
happens to be involved with autonomous driving. 02:05:05.180 |
And so you have a system that's driven by an application. 02:05:09.780 |
And that's really interesting because you have maybe Google 02:05:17.740 |
You have all these other companies with ASICs. 02:05:21.460 |
They're usually more kind of always thinking general. 02:05:25.980 |
So I like it when it's driven by a particular application 02:05:37.300 |
you may not always get to the right solutions. 02:05:40.300 |
It's nice to couple that sometimes with specific, 02:05:51.140 |
Sometimes just having that focus on that application 02:05:57.700 |
I think that, so one of the things that's appealing 02:06:13.180 |
Yeah, so I guess, let me just try to understand, 02:06:32.320 |
- No, it's straight up retraining a large neural network 02:06:38.580 |
- So you have to do it once for every new car? 02:06:44.820 |
which I think is a process for machine learning, 02:07:07.820 |
that updates some weights in a table and yeah, okay. 02:07:13.460 |
- And each of those cars run into trouble rarely, 02:07:34.740 |
and then the network has to become smart enough 02:07:37.540 |
to now be able to perform in those edge cases 02:07:41.820 |
There's clever ways of retraining different parts 02:07:46.260 |
I think they prefer to retrain the entire thing. 02:07:58.580 |
is to have a very large machine learning focused, 02:08:10.340 |
But like, so they're not limiting themselves completely 02:08:29.980 |
and that iterative process achieves something 02:08:37.980 |
So it has to do with training neural networks fast 02:08:54.540 |
where the scale of superconducting optoelectronics, 02:08:58.940 |
the way that, so when you talk about the weaknesses, 02:09:02.540 |
like I said, okay, well, you have to cool it down. 02:09:05.780 |
Because that's not too much of an added cost. 02:09:12.980 |
And also you have one centralized kind of cognitive hub, 02:09:24.340 |
a superconducting system in a car, that's questionable. 02:09:35.700 |
But since this is sort of a central supreme intelligence 02:09:39.060 |
or something like that, and it needs to really have 02:09:42.600 |
this massive data acquisition, massive data integration, 02:09:49.140 |
spiking neural networks with vast communication 02:09:55.740 |
There's a lot of development that needs to be done. 02:09:58.240 |
But we have to be patient with self-driving cars 02:10:02.260 |
We were all optimistic that they would be here by now. 02:10:06.460 |
but if we're thinking five or 10 years down the line, 02:10:14.780 |
that getting into self-driving cars and technologies 02:10:21.520 |
Elham Tabassi is leading up a much larger effort 02:10:35.020 |
So when you're going to deploy this neural network 02:10:38.900 |
in every single automobile with so much on the line, 02:10:45.260 |
So now how do we know that we can trust that? 02:10:48.020 |
How do we know that we can trust the self-driving car 02:10:58.200 |
I mean, you're familiar with the problem and all that. 02:11:01.820 |
But there's a fascinating dance in engineering 02:11:14.800 |
for them to be provably correct or provably safe. 02:11:32.180 |
like robustness to imperfection, be part of our systems, 02:11:37.180 |
which is a difficult thing for engineers to contend with. 02:11:53.660 |
along the lines of, I can give you an intelligent system 02:12:00.900 |
And it's in sort of creativity and abstract thinking 02:12:11.360 |
that perform exactly the same way every time. 02:12:13.780 |
- This is where like the disagreement I have with, 02:12:16.020 |
not disagreement, but a different view on the world. 02:12:26.460 |
colleagues that are roboticists, the goal is perfection. 02:12:31.460 |
And to me is like, no, I think the goal should be 02:12:40.980 |
And through the interaction between humans and robots, 02:12:44.180 |
that imperfection becomes a feature, not a bug. 02:12:56.420 |
but the robot itself is not perfect in any way. 02:13:08.700 |
a human robot interaction problem, not a robotics problem. 02:13:14.340 |
That's actually an open and fascinating question, 02:13:18.620 |
whether humans can be removed from the loop completely. 02:13:24.460 |
We've talked about a lot of fascinating chemistry 02:13:31.260 |
and we're always running up against this issue 02:13:33.700 |
that nature seems to dictate what's easy and what's hard. 02:13:44.500 |
"Does Cosmological Evolution Select for Technology?" 02:13:53.180 |
that seem to define the way our universe works, 02:13:56.220 |
that physics works, that if it worked any differently, 02:14:01.740 |
So it seems like the parameters are very fine tuned 02:14:16.820 |
it may be that the universe has also fine tuned 02:14:33.420 |
Let me just try to say a few things in my language. 02:14:41.660 |
So physicists have spent centuries trying to understand 02:14:46.540 |
the system of equations that govern the way nature behaves, 02:14:51.540 |
the way particles move and interact with each other. 02:14:55.060 |
And as that understanding has become more clear over time, 02:15:00.060 |
it became sort of evident that it's all well adjusted 02:15:05.820 |
to allow a universe like we see, very complex, 02:15:16.500 |
And so one answer to that is, well, of course it is, 02:15:21.540 |
But I don't know, that's not very satisfying. 02:15:29.220 |
We can only observe a universe that is fit for us to live in. 02:15:38.420 |
it is based partially on coming up with equations 02:16:01.140 |
of different couplings, how strongly does a charged particle 02:16:05.200 |
couple to the electromagnetic field or masses? 02:16:07.660 |
How strongly does a particle couple to the Higgs field 02:16:19.780 |
but the relative importance of different terms, 02:16:25.260 |
as the structure of the equations themselves. 02:16:31.200 |
and trying to see, okay, if I adjust the parameter, 02:16:48.940 |
And somebody had the phrase, this looks like a put-up job, 02:16:52.920 |
meaning every one of these parameters was dialed in. 02:16:57.060 |
It's arguable how precisely they have to be dialed in, 02:17:10.020 |
So, okay, maybe I think the majority position 02:17:17.020 |
it has to be that way in order for us to exist. 02:17:22.780 |
And I don't know, for a while, that never sat well with me, 02:17:26.100 |
but I just kind of moved on because there are things to do 02:17:33.780 |
but as I started working more with technology, 02:17:39.340 |
getting into the more recent years of my career, 02:17:43.740 |
after having worked with silicon for a long time, 02:17:49.020 |
but then when I switched over to superconductors, 02:17:57.380 |
that our universe gives us superconductivity. 02:18:00.580 |
It's one of the most beautiful physical phenomena, 02:18:02.500 |
and it's also extraordinarily useful for technology. 02:18:07.940 |
has to have the parameters it does for us to exist 02:18:14.800 |
Why does it give us silicon that has this ideal oxide 02:18:22.500 |
That can't be explained by the same anthropic reasoning. 02:18:30.380 |
I mean, a slight natural extension of that question is, 02:18:39.420 |
if we would simply have just another set of paint brushes 02:18:50.900 |
but would nevertheless have incredible complexity, 02:18:54.540 |
which is if you sort of zoom out and start defining things, 02:19:03.580 |
but more like how much complexity is within the system 02:19:07.300 |
- Well, yeah, you can start to quantify things. 02:19:26.660 |
That would be a shameful waste of resources, it seems. 02:19:43.460 |
that allow our particular kind of universe to arise 02:19:49.780 |
- See, I wonder if nature in this kind of discussion, 02:20:03.980 |
is it just make it super easy to do awesome stuff 02:20:08.020 |
but we could still do awesome stuff in other ways, 02:20:31.740 |
- So who is Lee Smolin and what kind of ideas does he have? 02:20:35.620 |
- Okay, Lee Smolin is a theoretical physicist 02:20:51.380 |
So his paper was called "Did the Universe Evolve?" 02:20:54.460 |
And I gave myself the liberty of titling my paper, 02:20:59.460 |
"Does Cosmological Evolution Select for Technology?" 02:21:14.620 |
unifying quantum mechanics with general relativity, 02:21:19.280 |
as you can read about in his most recent book, I believe, 02:21:29.380 |
because I think that is one of the core ideas 02:21:35.300 |
of how the universe got here, our role in it, 02:21:47.280 |
And what that means is if you look back in time 02:21:52.540 |
you see that everything used to be at one point, 02:21:58.820 |
There was an era in the evolutionary process of our universe 02:22:05.420 |
and this idea was developed primarily by Alan Guth 02:22:08.820 |
and others, Andre Linde and others in the '80s. 02:22:27.580 |
where it just accelerates incredibly rapidly. 02:22:33.740 |
this tells us that this should produce matter 02:22:39.980 |
lithium too, lithium also, and other things too. 02:22:48.500 |
have stood up extremely well to empirical verification, 02:22:52.540 |
the cosmic microwave background, things like this. 02:22:59.540 |
think that the origin of our universe is the Big Bang. 02:23:08.040 |
I'm just laying this out there so that people understand 02:23:14.180 |
not a replacement of existing well-founded ideas. 02:23:31.540 |
a Big Bang is inextricably linked with a black hole. 02:23:39.240 |
is mathematically indistinguishable from a black hole. 02:23:44.340 |
And Lee Smolin based his thinking on that idea, I believe. 02:23:49.340 |
I don't mean to speak for him, but this is my reading of it. 02:24:00.700 |
And this allows us to have progeny, offspring. 02:24:14.180 |
I think this is potentially one of the great ideas 02:24:18.620 |
that when a black hole forms, it's not a classical entity, 02:24:29.800 |
The properties, what we're calling the parameters 02:24:48.380 |
They're close, but they're a little bit different. 02:24:50.420 |
And so now you have a mechanism for evolution, 02:24:59.660 |
and then if you think about the DNA of the universe 02:25:03.260 |
are the basic parameters that govern its laws. 02:25:11.540 |
from an evolutionary process that can be traced back 02:25:17.660 |
Initially, there was something like a vacuum fluctuation 02:25:24.220 |
a universe that was able to reproduce just once. 02:25:28.860 |
And then over time, it was able to make more and more 02:25:31.020 |
until it evolved into a highly structured universe 02:25:35.300 |
with a very long lifetime, with a great deal of complexity. 02:25:40.220 |
And importantly, especially importantly for Lee Smolin, 02:25:47.220 |
Therefore, we should expect our universe to be optimized, 02:25:59.500 |
So we expect the physics of our universe to have evolved 02:26:03.780 |
to maximize fecundity, the number of offspring. 02:26:19.820 |
this is back to our fractal view of everything 02:26:27.180 |
That is very compelling and a very powerful idea 02:26:36.060 |
and perhaps the origin of ideas and intelligence. 02:26:42.180 |
the evolution of those and then the evolution 02:27:08.580 |
Different people use the word multiverse in different ways. 02:27:11.500 |
In the multiverse that I think is relevant to picture 02:27:30.200 |
and it evanesces back into the quantum vacuum. 02:27:33.980 |
And ideas of Guth before that and Andre Linde 02:27:38.980 |
with eternal inflation aren't that different, 02:28:00.760 |
this universe, what we can measure and interact with 02:28:10.780 |
And there are countless, potentially infinity, 02:28:13.600 |
other universes, other entire evolutionary trajectories 02:28:24.060 |
of achieving complexity and reproduction and all that stuff. 02:28:29.460 |
is a funnel towards this end point, not at all. 02:28:34.220 |
Just like the biological evolutionary process 02:28:44.980 |
No, we have extraordinary diversity around us. 02:28:52.180 |
it might feel like we're at the center of this process. 02:29:04.220 |
I mean, I just can't understand how with this idea, 02:29:25.660 |
in truly understanding the concept of infinity. 02:29:31.940 |
But also the concept of a lot is pretty difficult. 02:29:41.820 |
and you can't even get down to your toes, it's like. 02:29:44.460 |
- It's like, all right, 1,000, fine, a million. 02:29:51.740 |
- So this particular, so when we say technology, by the way, 02:29:54.860 |
I mean, there's some, not to over-romanticize the thing, 02:30:00.660 |
but there is some aspect about this branch of ours 02:30:04.180 |
that allows us to, for the universe to know itself. 02:30:08.980 |
- So to have like little conscious cognitive fingers 02:30:13.980 |
that are able to feel, like to scratch the head. 02:30:19.660 |
- To be able to construct E equals MC squared 02:30:22.060 |
and to introspect, to start to gain some understanding 02:30:32.060 |
Okay, I'm just human, but it feels like that, 02:30:35.620 |
if I were to build a system that does this kind of thing, 02:30:39.260 |
that evolves laws of physics, that evolves life, 02:30:42.100 |
that evolves intelligence, that my goal would be 02:30:56.060 |
I mean, I'm spending my career designing things 02:30:59.780 |
so that you and I aren't too far apart on that one. 02:31:02.700 |
- But then maybe that problem is a lot harder 02:31:07.820 |
- Let's not get too far, because I want to emphasize 02:31:10.840 |
something that what you're saying is, isn't it fascinating 02:31:14.620 |
that the universe evolved something that can be conscious, 02:31:25.740 |
Lee Smolin has argued, I think, right on almost 02:31:30.340 |
every single way that cosmological natural selection 02:31:35.340 |
could lead to a universe with rich structure, 02:31:43.180 |
to make a lot of stars so that they can make black holes. 02:31:46.140 |
But that doesn't explain what we're doing here. 02:31:48.380 |
In order for that to be an explanation of us, 02:31:51.380 |
what you have to assume is that once you made that universe 02:31:55.380 |
that was capable of producing stars, life, planets, 02:31:59.220 |
all these other things, we're along for the ride. 02:32:02.180 |
We're kind of arising, growing up in the cracks, 02:32:09.540 |
And I don't necessarily have a philosophical opposition 02:32:20.220 |
- So it seems like whatever we got going on here to you, 02:32:25.660 |
seems like a thing you might want to select for 02:32:36.820 |
it only cares about making offspring universes 02:32:39.800 |
because then there's gonna be the most of them 02:32:45.160 |
which is the set of all possible universes, let's say. 02:32:52.860 |
You know they have to be made by black holes. 02:32:57.980 |
The Big Bang tells us that black holes make universes. 02:33:19.260 |
that can efficiently compress matter into a singularity, 02:33:23.380 |
it turns out that if you can compress about 10 kilograms 02:33:30.580 |
highly probable to inflate into its own offspring universe. 02:33:34.820 |
This is according to calculations done by other people 02:33:40.260 |
and I hope I am grasping what they're telling me correctly. 02:33:52.700 |
which is that what might have happened is that, 02:33:56.300 |
okay, this particular branch on the vast tree of evolution, 02:34:01.160 |
cosmological evolution that we're talking about, 02:34:03.240 |
not biological evolution within our universe, 02:34:10.940 |
Got to the stage where stars were making lots of black holes, 02:34:15.840 |
but then continued to evolve and somehow bridge that gap 02:34:21.220 |
and intelligence capable of devising technologies 02:34:25.680 |
intelligent species working in conjunction with technologies 02:34:32.140 |
- Yeah, more efficiently, more faster and better 02:34:36.860 |
Then you start to have different kind of mechanisms 02:34:38.820 |
and mutation perhaps, all that kind of stuff. 02:34:40.820 |
- And so if you do a simple calculation that says, 02:34:45.300 |
we know roughly how many core collapse supernovae 02:35:00.620 |
that it would be possible for a technological civilization 02:35:07.420 |
with the energy and matter at their disposal. 02:35:09.980 |
And so one of the calculations in that paper, 02:35:15.660 |
is that if you take a relatively common asteroid, 02:35:20.660 |
something that's about a kilometer in diameter, 02:35:33.360 |
then you would have made at least a trillion black holes 02:35:44.820 |
So now if you envision an intelligent species 02:35:46.880 |
that would potentially have been devised initially by humans, 02:35:58.980 |
They just have to get out to the asteroid belt. 02:36:05.280 |
the rate at which stars are producing offspring universes. 02:36:13.120 |
- Yeah, so you have to somehow become masters of gravity? 02:36:20.640 |
but any force that can make the energy density 02:36:28.320 |
a great enough energy density can form a singularity. 02:36:44.240 |
recently blasted a pellet with 100 really bright lasers 02:37:05.900 |
about gravitational waves and the fact that us humans 02:37:11.720 |
one of the most impressive technological accomplishments 02:37:17.380 |
being able to precisely detect gravitational waves. 02:37:25.020 |
that other alien civilizations from very far distances 02:37:29.540 |
communicate with gravity, with gravitational waves. 02:37:34.340 |
Because as you become greater and greater master of gravity, 02:37:37.780 |
which seems way out of reach for us right now, 02:37:40.580 |
maybe that seems like a effective way of sending signals, 02:37:44.060 |
especially if your job is to manufacture black holes. 02:37:56.420 |
'cause we tend to think other alien civilizations 02:37:59.980 |
but if we think of alien civilizations out there 02:38:19.160 |
- Well, okay, let me ask, okay, this is great. 02:38:29.180 |
which is how many alien civilizations are out there? 02:38:42.300 |
the parameters of our universe have been selected 02:38:56.460 |
let's define that as the ability to produce black holes, 02:39:12.700 |
But there's also not a super strong motivation, well, 02:39:24.060 |
So one of the things that I try to emphasize in that paper 02:39:27.540 |
is that given this idea of how our parameters 02:39:35.100 |
it's clear that it's a series of trade-offs, right? 02:39:39.260 |
If you make, I mean, in order for intelligent life 02:39:42.020 |
of our variety or anything resembling us to occur, 02:39:51.060 |
but you also need water to have certain properties. 02:39:58.740 |
like the Earth to be within the habitable zone, 02:40:00.620 |
all these things that you start talking about 02:40:18.980 |
or to make rocky planets like Earth the most numerous. 02:40:32.780 |
and I've laid out a few that I think are promising, 02:40:40.860 |
makes it more or less likely that stars would form 02:40:47.300 |
are likely to form, all these different things. 02:40:54.580 |
And the prediction would be that we kind of sit 02:40:57.140 |
at this central point where if you move the parameters 02:41:03.140 |
or life doesn't form, or technology's infeasible, 02:41:14.140 |
We don't have the, well, I'm speaking for myself, 02:41:23.820 |
but we don't know, we have not been able to quantify yet 02:41:28.820 |
how finely adjusted the parameters would need to be 02:41:33.820 |
in order for silicon to have the properties it does. 02:41:35.580 |
Okay, this is not directly speaking to what you're saying, 02:41:39.500 |
which is where are they, where are the life forms out there, 02:41:44.580 |
What I'm trying to argue is that if this framework 02:41:49.100 |
is on the right track, a potentially correct explanation 02:41:52.480 |
for our existence, it doesn't necessarily predict 02:41:56.340 |
that intelligent civilizations are just everywhere, 02:41:59.300 |
because even if you just get one of them in a galaxy, 02:42:10.140 |
- Yeah, and I wonder, once you start generating 02:42:15.420 |
how that has effect on the, what kind of effect does it have 02:42:19.940 |
on the other candidates, civilizations within that universe? 02:42:35.800 |
- It could, but I guess if you want me to put my chips 02:42:41.300 |
on the table or whatever, I think I come down more 02:42:44.380 |
on the side that intelligent life civilizations are rare. 02:42:57.140 |
And also, there's a lot of papers coming out recently 02:43:00.980 |
in the field of astrobiology that are seeming to say, 02:43:05.980 |
on some modified Drake equation or something like that. 02:43:12.940 |
You shouldn't be surprised that an intelligent species 02:43:17.940 |
But if you think there's one the next solar system over, 02:43:23.860 |
the probability of finding a civilization in a galaxy, 02:43:28.180 |
maybe it's most likely that you're gonna find 02:43:36.600 |
Because does that mean in the entire lifetime of the galaxy 02:43:40.700 |
before it, so for in our case, before we run into Andromeda, 02:43:45.700 |
I think it's highly probable, I shouldn't say I think, 02:43:52.940 |
it's tempting to believe that it's highly probable 02:43:59.980 |
you're gonna get at least one intelligent species, 02:44:09.980 |
to think that they're going to coincide in time 02:44:14.980 |
- And also, if you look at the span of life on Earth, 02:44:44.780 |
Most of it, most of it was boring bacteria type of stuff. 02:44:48.660 |
- Well, bacteria are fascinating, but I take your point. 02:44:54.420 |
This kind of speaking from the perspective of your paper 02:44:57.660 |
of something that's able to generate technology 02:45:03.400 |
relative to that full history of life on Earth. 02:45:28.420 |
everything just goes to like, it accelerates very quickly. 02:45:33.360 |
- Yeah, we just don't have enough data to really say, 02:45:36.520 |
but I find this whole subject extremely engaging. 02:45:41.680 |
I think it's called the rare Earth hypothesis, 02:45:49.040 |
after the Hadean era where we were being bombarded. 02:45:52.140 |
Well, after, yeah, bombarded by comets, asteroids, 02:45:54.920 |
things like that, and also after the moon formed. 02:46:03.660 |
And it could have been, we don't know exactly when, 02:46:05.320 |
it could have been remarkably brief that that took. 02:46:12.120 |
I think that alone is sort of a checker on the scale 02:46:34.060 |
for some two and a half billion years or something 02:46:37.260 |
after the metabolic process that releases oxygen 02:46:42.780 |
Then basically the planet's just sitting there 02:46:50.140 |
that you can build these large, complex organisms. 02:47:00.960 |
in any planet that's roughly in the habitable zone 02:47:04.120 |
and has some water on it's probably gonna have those. 02:47:09.280 |
that happened some between five and 600 million years ago, 02:47:19.040 |
So if you say how much life is in our galaxy, 02:47:29.280 |
And then, but the Cambrian explosion kind of went like that 02:47:32.700 |
where within a couple tens or a hundred million years, 02:47:48.680 |
And we've just been shuffling around since then. 02:47:51.600 |
So then what caused humans to pop out of that? 02:47:54.800 |
I mean, that could be another extremely rare threshold 02:47:59.800 |
that a planet roughly in the habitable zone with water 02:48:10.200 |
like the humans cannot possibly be the most amazing thing 02:48:13.040 |
that such, if you look at the entirety of the system, 02:48:17.780 |
that cannot possibly be the most amazing thing 02:48:23.680 |
what's the equivalent in the cosmological evolution 02:48:28.960 |
the equivalent of the human eye or the human brain. 02:48:37.760 |
They're able to just do some incredible generation 02:48:42.080 |
of complexity fast on scale, like much more than, 02:48:47.480 |
it's like most of our universe is pretty freaking boring. 02:48:53.360 |
and there's some like apes that are just like doing podcasts 02:49:02.840 |
If you think about like the amazing thing in the human eye, 02:49:05.960 |
the visual cortex can do, the brain, the nervous, 02:49:15.480 |
Like if there's an equivalent of that for universes, 02:49:19.320 |
like the richness of physics that could be expressed 02:49:31.040 |
I'm a, so from a computer science perspective, 02:49:35.600 |
which is a nice sort of pretty visual way to illustrate 02:49:42.780 |
in drastically different levels of complexity. 02:49:52.080 |
is able to generate pretty triangles and squares 02:49:54.560 |
and therefore we achieve general intelligence. 02:49:57.640 |
And then there'll be like some bad-ass Chuck Norris type, 02:50:01.820 |
like universal Turing machine type of cellular automata. 02:50:06.520 |
They're able to generate other cellular automata 02:50:08.920 |
and that does any arbitrary level of computation off the bat. 02:50:16.440 |
And then we're just like, we'll be forgotten is the story. 02:50:20.840 |
This podcast just entertains a few other apes 02:50:26.880 |
- Well, I'm kind of surprised to hear your cynicism. 02:50:32.080 |
- I usually think of you as like one who celebrates humanity 02:50:41.000 |
I mean, okay, we've been here for 13.7 billion years 02:50:48.480 |
Some other universe could have kicked our butt by now, 02:51:10.520 |
of some plasma and nothing interesting going on there. 02:51:18.880 |
that a lot of really interesting evolutionary processes 02:51:26.240 |
I just mean taking place over extended periods of time 02:51:32.020 |
And then it took that first generation of stars 02:51:44.720 |
So we might still be pretty quick to the game here. 02:51:53.520 |
On those solar systems, you have rocky worlds, 02:51:57.760 |
you have gas giants, like all this complexity. 02:52:03.760 |
through the evolutionary process in life forms 02:52:16.080 |
They're icy, but there might be water underneath. 02:52:33.880 |
is fundamental to the whole reason we enjoy everything. 02:52:38.200 |
I just think that all of us conscious beings, 02:52:44.560 |
at every scale, will be completely forgotten. 02:52:49.320 |
And that would go back to maybe something more like Lao Tzu, 02:52:53.880 |
where it's like, yes, there is nothing but change. 02:53:00.720 |
But I just, in this picture of this hierarchy 02:53:05.520 |
that now it gets to us and that's the pinnacle. 02:53:10.560 |
the story I'm trying to tease out in my research 02:53:13.780 |
is about, okay, well, so then what's the next level 02:53:17.120 |
And if it's, okay, we're kind of pretty smart. 02:53:23.880 |
and Alan Guth, Max Tegmark, okay, we're really smart. 02:53:28.520 |
we can find our way to the grocery store or whatever. 02:53:32.800 |
You know, I mean, what if there's another level 02:53:43.560 |
Imagine these cognitive systems far more capable than us 02:54:07.380 |
And they just live in space with these massive 02:54:09.960 |
collection optics so that they, what do they do? 02:54:13.660 |
They just look out and experience that vast array 02:54:40.160 |
- So in some sense, we're like all what players on a stage 02:54:47.320 |
for this superconducting cold computing system out there. 02:54:56.680 |
The fact that you're actually designing systems 02:54:59.320 |
here on Earth that are trying to push this technological, 02:55:01.560 |
at the very cutting edge, and also thinking about 02:55:15.920 |
It's like the ultimate rigorous application of philosophy 02:55:46.640 |
at all levels of the stack, which I deeply appreciate. 02:55:53.560 |
It's an honor that you spend your valuable time with me. 02:55:57.240 |
- It's an honor that you would spend your time with me. 02:56:05.240 |
To support this podcast, please check out our sponsors 02:56:12.060 |
from the great John Carmack, who surely will be a guest 02:56:20.100 |
anything that an extremely clever graphics programmer 02:56:26.040 |
by a merely competent programmer some number of years later. 02:56:29.380 |
Thank you for listening, and hope to see you next time.