back to indexDmitry Korkin: Evolution of Proteins, Viruses, Life, and AI | Lex Fridman Podcast #153
Chapters
0:0 Introduction
1:57 Proteins and the building blocks of life
9:0 Spike protein
15:48 Coronavirus biological structure explained
20:45 Virus mutations
27:16 Evolution of proteins
37:2 Self-replicating computer programs
44:38 Origin of life
52:11 Extraterrestrial life in our solar system
54:8 Joshua Lederberg
60:7 Dendral
63:1 Why did expert systems fail?
65:12 AlphaFold 2
86:50 Will AI revolutionize art and music?
93:49 Multi-protein folding
98:16 Will AlphaFold 2 result in a Nobel Prize?
100:47 Will AI be used to engineer deadly viruses?
115:54 Book recommendations
125:37 Family
128:15 A poem in Russian
00:00:00.000 |
The following is a conversation with Dmitry Korkin, 00:00:09.740 |
where he specializes in bioinformatics of complex disease, 00:00:22.060 |
plus he is Russian and recites a poem in Russian 00:00:27.780 |
What else could you possibly ask for in this world? 00:00:32.960 |
Brave Browser, NetSuite Business Management Software, 00:00:42.920 |
So the choice is browsing privacy, business success, 00:00:50.660 |
And if you wish, click the sponsor links below 00:00:53.640 |
to get a discount and to support this podcast. 00:01:04.020 |
are the biologists who study viruses without an agenda, 00:01:14.500 |
and exploration of the mysteries within viruses. 00:01:43.780 |
If you enjoy this thing, subscribe on YouTube, 00:01:45.820 |
review it on Apple Podcasts, follow on Spotify, 00:01:53.220 |
And now, here's my conversation with Dmitry Korkin. 00:02:00.720 |
and the amino acid residues that make them up 00:02:25.820 |
and comparing the proteins across different species, 00:02:41.740 |
And so what I mean by that is an average protein 00:02:57.480 |
And so you can imagine a protein as a string of beads, 00:03:22.120 |
So many proteins have more than one function. 00:03:25.880 |
And those protein functions are often carried out 00:03:56.800 |
is the globular part of this protein is a protein domain. 00:04:14.600 |
- And the protein domains are made up of amino acid residues. 00:04:37.640 |
where it's the most, the cleanest element block 00:04:46.220 |
in different kinds of ways to form complex function. 00:04:50.880 |
why is that not talked about as often in popular culture? 00:04:55.160 |
- Well, there are several perspectives on this. 00:04:58.240 |
And one, of course, is the historical perspective, right? 00:05:07.760 |
to structurally resolve to obtain the 3D coordinates 00:05:17.400 |
And smaller proteins tend to be a single domain protein. 00:05:21.000 |
So we have a protein equal to a protein domain. 00:05:24.000 |
And so because of that, the initial suspicion 00:05:27.080 |
was that the proteins, they have globular shapes, 00:05:31.720 |
and the more of smaller proteins you obtain structurally, 00:05:36.720 |
the more you became convinced that that's the case. 00:05:45.720 |
alternative approaches, so the traditional ones 00:05:52.920 |
are X-ray crystallography and NMR spectroscopy. 00:06:13.600 |
to get into the 3D shapes of much larger molecules, 00:06:32.760 |
of a SARS-CoV-2 protein was the cryo-EM structure 00:06:53.960 |
- How many domains is the, is it more than one domain? 00:06:57.480 |
- Oh yes, oh yes, I mean, so it's a very complex structure. 00:07:06.480 |
of a single protein, right, so this structure 00:07:13.720 |
So it needs to form a trimer in order to function properly. 00:07:18.720 |
- So a complex is agglomeration of multiple proteins. 00:07:22.880 |
And so we can have the same protein copied in multiple, 00:07:32.120 |
and forming something that we called a homo-oligomer. 00:07:38.120 |
So in this case, so the spike protein is an example 00:07:50.880 |
- We have these three chains, the three molecular chains 00:07:55.000 |
coupled together and performing the function. 00:07:58.480 |
That's what, when you look at this protein from the top, 00:08:04.760 |
- So, but other, you know, so other complexes 00:08:08.280 |
are made up of, you know, different proteins. 00:08:16.920 |
The hemoglobin molecule, right, so it's actually, 00:08:36.000 |
about the evolution of this, you know, of this molecule. 00:08:40.640 |
And perhaps, so one of the hypothesis is that, you know, 00:08:43.980 |
in the past, it was just a homotetrimer, right? 00:08:48.280 |
So four identical copies, and then it became, you know, 00:08:53.160 |
sort of modified, it became mutated over the time 00:09:00.200 |
- Can we linger on the spike protein for a little bit? 00:09:16.160 |
to understand the structural basis of this virus 00:09:51.960 |
it's a part that is buried into the membrane of the virus 00:10:01.600 |
And it also has a lot of unstable structures around it. 00:10:10.200 |
- Yeah, so people are still trying to understand. 00:10:36.080 |
So because there are three different copies of this chains, 00:10:46.080 |
So this is the receptor binding domains, RBDs, 00:10:56.840 |
And now they are not necessarily going in a sync mode. 00:11:17.160 |
we typically see just one of the arms going out 00:11:21.840 |
and getting ready to be attached to the ACE2 receptors. 00:11:50.640 |
They actually solved the mutated structure of the spike. 00:11:56.560 |
And they showed that actually, because of these mutations, 00:12:08.860 |
And so now, so the frequency of two arms going up 00:12:24.320 |
because now you have two possible opportunities 00:12:31.140 |
It's a very complex molecular process, mechanistic process. 00:12:41.500 |
of the spike trimer, to the human ACE2 receptor. 00:12:54.720 |
what triggers the whole process of encapsulation. 00:12:58.880 |
- If this was dating, this would be the first date. 00:13:05.660 |
- So is it possible to have the spike protein 00:13:10.620 |
Or does it need that interactability with the membrane? 00:13:19.020 |
But when you get this thing attached on the surface, 00:13:32.220 |
again, where people use the cryo-lactone microscopy 00:13:35.780 |
to get the first glimpse of the overall structure. 00:13:43.820 |
about the surface, about what is happening inside, 00:13:47.040 |
because we have literally no clue until recent work 00:14:05.020 |
and it's protected by another protein, N-protein, 00:14:31.660 |
like developing a vaccine or some kind of drug 00:14:36.080 |
- So, I mean, there are many different implications to that. 00:14:41.100 |
it's important to understand the virus itself, right? 00:14:49.620 |
what is the overall mechanistic process of this virus, 00:14:56.540 |
replication of this virus, proliferation to the cell, right? 00:15:03.020 |
The other aspect is designing new treatments, right? 00:15:12.500 |
And so some nanoparticles that will resemble the viral shape 00:15:19.540 |
and essentially would act as a competitor to the real virus 00:15:26.700 |
and thus preventing the real virus entering the cell. 00:15:45.980 |
So there are, you know, to give you a, you know, 00:16:05.900 |
E, envelope protein, that acts as a pentamer, 00:16:17.940 |
It forms dimers, and actually it forms beautiful lattice. 00:16:21.700 |
And this is something that we've been studying 00:16:25.700 |
It actually forms a very nice grid or, you know, 00:16:36.260 |
and they naturally, when you have a bunch of copies 00:16:38.300 |
of each other, they form an interesting lattice. 00:16:41.060 |
And, you know, if you think about this, right, 00:16:49.500 |
needs to be organized somehow, self-organized somehow. 00:16:53.900 |
So, you know, if it was a completely random process, 00:16:57.940 |
you know, you probably wouldn't have the envelope shell 00:17:03.740 |
You know, you would have something, you know, 00:17:16.740 |
get attached to each other in a very specific, directed way. 00:17:24.300 |
We are now, we've been working in the past six months 00:17:32.140 |
on trying to understand the overall structure 00:17:43.580 |
- So the envelope is essentially is the outer shell 00:17:54.860 |
- But get that, the N is likely to interact with M. 00:18:05.660 |
they occur in different copies on the viral particle. 00:18:13.980 |
we only have two or three maybe per each particle. 00:18:26.580 |
that makes up the entire, you know, outer shell. 00:19:06.380 |
so you see these, the antibodies that target, 00:19:11.380 |
you know, spike protein, certain parts of the spike protein, 00:19:15.180 |
but there could be some, also some treatments, right? 00:19:17.940 |
So these are, you know, these are small molecules 00:19:35.620 |
is actually targeting the M-dimer of the protein, 00:19:40.620 |
targeting the proteins that make up this outer shell. 00:19:44.260 |
Because if you're able to destroy the outer shell, 00:19:47.740 |
you're essentially destroying the viral particle itself. 00:19:52.260 |
So preventing it from, you know, functioning at all. 00:20:05.260 |
Is, or like, that's a promising attack vector? 00:20:09.380 |
So I mean, there's still tons of research needs to be, 00:20:20.460 |
from our analysis, from other evolutionary analysis, 00:20:39.980 |
from the evolutionary perspective so drastically 00:20:47.940 |
about mutations of the virus in the United Kingdom. 00:20:55.460 |
You just kind of mentioned about stability and so on. 00:21:02.780 |
and which aspects, if mutated, become more dangerous? 00:21:09.260 |
what are your thoughts and knowledge and ideas 00:21:15.340 |
Are you worried about it from a biological perspective? 00:21:18.460 |
Are you worried about it from a human perspective? 00:21:21.260 |
- So I mean, you know, mutations are sort of a general way 00:21:28.620 |
So it's essentially, this is the way they evolve. 00:21:55.860 |
because every time it jumps, it also mutates, right? 00:21:59.460 |
So when it jumps to the species and jumps back, 00:22:04.460 |
so it acquires some mutations that are sort of driven 00:22:16.300 |
And it's different from the human environment. 00:22:21.420 |
that are acquired in the new species are neutral 00:22:26.180 |
with respect to the human host or maybe damaging. 00:22:33.620 |
But so are you worried about, I mean, it seems like 00:22:42.280 |
and especially with a vaccine just around the corner 00:22:49.140 |
is there some worry that this puts evolutionary pressure, 00:22:53.020 |
selective pressure on the virus for it to mutate? 00:23:02.660 |
in the scientist's mind, what will happen, right? 00:23:12.500 |
about sort of the arms race between the ability 00:23:22.580 |
than the virus essentially becomes resistant to the vaccine. 00:23:41.740 |
simply because there is not that much evidence to that. 00:23:51.140 |
Obviously there are mutations around the vaccine. 00:23:59.860 |
against the season of flu. - Because there's mutations. 00:24:02.740 |
- But I think it's important to study it, no doubts. 00:24:08.860 |
So I think one of the, to me, and again, I might be biased 00:24:15.220 |
because we've been trying to do that as well. 00:24:22.900 |
in understanding the virus is to understand its evolution 00:24:26.580 |
in order to sort of understand the mechanisms, 00:24:30.220 |
the key mechanisms that lead the virus to jump, 00:24:34.140 |
the Nordic viruses to jump from species to another, 00:24:41.700 |
to become resistant to vaccines, also to treatments. 00:24:55.500 |
the future evolutionary traces of this virus. 00:24:58.100 |
- I mean, what, from a biological perspective, 00:25:02.180 |
but is there parts of the virus that if souped up 00:25:07.940 |
through mutation could make it more effective 00:25:12.620 |
We're talking about this specific coronavirus. 00:25:16.860 |
like the membrane, the M protein, the E protein, 00:25:38.140 |
have the greatest impact, potentially damaging impact 00:25:43.460 |
- So it's actually, it's a very good question. 00:25:46.580 |
Because, and the short answer is we don't know yet. 00:25:50.140 |
But of course there is capacity of this virus 00:25:58.700 |
so if you look at the virus, I mean, it's a machine, right? 00:26:01.820 |
So it's a machine that does a lot of different functions. 00:26:05.500 |
And many of these functions are sort of nearly perfect, 00:26:09.820 |
And those mutations can make those functions more perfect. 00:26:14.100 |
For example, the attachment to ACE2 receptor, right? 00:26:36.300 |
That will make this attachment sort of stronger, 00:26:41.300 |
or, you know, something more, in a way more efficient 00:26:46.900 |
from the point of view of this virus functioning. 00:27:20.420 |
let's zoom back out and look at the evolution of proteins. 00:27:29.980 |
on the, quote, "Ongoing expansion of the protein universe." 00:27:51.860 |
And from that, there's now, you know, what is it? 00:27:55.980 |
3.5 billion years later, there's now millions of proteins. 00:28:18.120 |
- So I think, you know, if I were to pick a single keyword 00:28:23.120 |
about protein evolution, I would pick modularity, 00:28:29.340 |
something that we talked about in the beginning. 00:28:54.140 |
It's actually going all the way back to the gene, 00:29:00.180 |
And so, you know, again, these protein domains, 00:29:05.040 |
they are not only functional building blocks. 00:29:18.940 |
and functionally building blocks were discovered, 00:29:22.220 |
they essentially, they stay, those domains stay as such. 00:29:28.220 |
So that's why if you start comparing different proteins, 00:29:31.760 |
you will see that many of them will have similar fragments. 00:29:36.760 |
And those fragments will correspond to something 00:29:44.220 |
because you still have mutations and, you know, 00:29:48.220 |
different mutations are attributed to, you know, 00:29:54.460 |
diversification of the function of this, you know, 00:29:59.020 |
However, you don't, you very rarely see, you know, 00:30:07.980 |
this domain into fragments because, and it's, you know, 00:30:12.160 |
once you have the domain split, you actually, 00:30:18.180 |
you know, you can completely cancel out its function 00:30:26.580 |
And that's not, you know, efficient from the point of view 00:30:32.860 |
So the protein domain level is a very important one. 00:30:56.380 |
And those linkers are completely flexible, you know, 00:31:08.060 |
- So we do have tails, so they're called termini, 00:31:14.060 |
on one and another ends of the protein sequence. 00:31:22.560 |
So they attributed to very specific interactions 00:31:28.560 |
- But you're referring to the links between domains. 00:31:49.060 |
you have the domains that are extremely flexible 00:31:59.640 |
just this linker itself, because it's so flexible, 00:32:03.760 |
it actually can adapt to a lot of different shapes. 00:32:15.520 |
All right, so these things also evolve, you know, 00:32:18.920 |
and they in a way have different sort of laws of, 00:32:23.920 |
the driving laws that underlie the evolution, 00:32:30.500 |
because they no longer need to preserve certain structure, 00:32:41.500 |
you have something that is even less studied. 00:32:53.260 |
So alternative splicing, so it's a very cool concept. 00:32:56.940 |
It's something that we've been fascinated about 00:33:46.100 |
functionally, you know, active protein products. 00:33:57.920 |
The reason it happens is that if you look at the gene, 00:34:10.680 |
So we have a block that will later be translated, 00:34:15.060 |
Then we'll have a block that is not translated, cut out. 00:34:47.180 |
And sometimes we will throw out some of the exons. 00:34:51.400 |
And the remaining protein product will become-- 00:34:56.540 |
- Right, so now you have fragments of the protein 00:35:09.840 |
- So there's some flexibility in this process. 00:35:12.640 |
- So that creates a whole new level of complexity. 00:35:20.880 |
We, and this is where I think now the appearance 00:35:31.280 |
next generation sequencing techniques such as RNA-Seq, 00:35:41.300 |
It's a dynamic event that happens in response to disease, 00:35:45.680 |
or in response to certain developmental stage of a cell. 00:36:05.420 |
And now we have this interplay between what happening, 00:36:12.740 |
and what is happening in the gene and RNA world. 00:36:27.600 |
coincide with the boundaries of the protein domains. 00:36:30.360 |
Right, so there is this close interplay to that. 00:36:45.000 |
And obviously the evolution will pick up this complexity 00:36:57.560 |
And makes this question more complex, but more exciting. 00:37:02.280 |
- As a small detour, I don't know if you think about this 00:37:14.360 |
which are, I don't know if you're familiar with these things, 00:37:23.300 |
the whole purpose of the program is to copy itself. 00:37:31.000 |
That's a very kind of crude, fun exercise of, 00:37:35.440 |
can we sort of replicate these ideas from cells, 00:37:40.000 |
can we have a computer program that when you run it, 00:37:47.080 |
and does it in different programming languages and so on. 00:37:54.920 |
so it was essentially one of the sort of main stages 00:37:59.920 |
in informatics Olympiads that you have to reach 00:38:16.680 |
And so the task then becomes even sort of more complicated. 00:38:24.040 |
And of course it's a function of a programming language, 00:38:27.480 |
but yeah, I remember a long, long, long time ago 00:38:38.640 |
there's an entire site called Code Golf, I think, 00:39:00.440 |
And it makes you actually, people should check it out 00:39:03.600 |
there's some weird programming languages out there. 00:39:26.320 |
And then there's humans that replicate themselves, right? 00:39:40.680 |
So I'm not talking about evolutionary algorithms, 00:39:57.120 |
So we think about machine learning as a system 00:39:59.240 |
that gets smarter and smarter and smarter and smarter. 00:40:01.440 |
At least the machine learning systems of today are like, 00:40:09.120 |
as opposed to throwing a bunch of little programs out there 00:40:12.800 |
and letting them multiply and mate and evolve 00:40:31.000 |
the sort of the area of intelligent agents, right? 00:40:34.480 |
Which are essentially the independent sort of codes 00:40:38.720 |
that run and interact and exchange the information, right? 00:40:45.200 |
I mean, it could be sort of a natural evolution 00:40:53.000 |
- I think it's kind of an interesting possibility. 00:41:00.680 |
we have social networks with millions of people 00:41:03.840 |
I think it's interesting to inject into that, 00:41:05.720 |
there's already injecting into that bots, right? 00:41:14.720 |
It's interesting to think that there might be bots 00:41:23.960 |
that are operating first in the digital space. 00:41:32.600 |
there's robotics labs that take as a fundamental task 00:42:02.320 |
You can imagine like a building that self assembles. 00:42:23.200 |
to go from a single origin protein building block 00:42:34.600 |
So you mentioned the evolutionary algorithm, right? 00:42:38.520 |
and maybe sort of the goal is in a way different, right? 00:42:43.520 |
So the goal is to essentially to optimize your search. 00:42:53.040 |
So people recognize that the recombination events 00:42:58.040 |
lead to global changes in the search trajectories, 00:43:03.280 |
the mutations event is a more refined step in the search. 00:43:08.280 |
Then you have other sort of nature inspired algorithm, right? 00:43:15.040 |
So one of the reason that I think it's one of the funnest one 00:43:24.840 |
So I think the first was introduced by the Japanese group 00:43:30.200 |
where it was able to solve some pretty complex problems. 00:43:35.200 |
So that's, and then I think there are still a lot of things 00:44:08.280 |
Maybe there's other inspirations to be discovered 00:44:10.920 |
in the brain or other aspects of the various systems, 00:44:15.920 |
even like the immune system, the way it interplays. 00:44:20.160 |
I recently started to understand that the immune system 00:44:23.560 |
has something to do with the way the brain operates. 00:44:26.040 |
Like there's multiple things going on in there, 00:44:38.980 |
I'm not sure if you're familiar with the Drake equation 00:44:43.720 |
that estimate, I just did a video on it yesterday 00:44:46.720 |
'cause I wanted to give my own estimate of it. 00:44:49.280 |
It's an equation that combines a bunch of factors 00:44:52.360 |
to estimate how many alien civilizations are. 00:45:01.160 |
you know, it's like how many stars are born every year, 00:45:10.720 |
for this, how many habitable planets are there. 00:45:14.280 |
And then the one that starts being really interesting 00:45:17.560 |
is the probability that life emerges on a habitable planet. 00:45:44.440 |
Okay, first at a high level for the Drake equation, 00:45:55.060 |
do you have thoughts about how life might have started? 00:46:07.460 |
there was a very exciting paper published in Nature 00:46:10.420 |
where they found one of the simplest amino acids, 00:46:23.260 |
So this is, and I apologize if I don't pronounce, 00:46:34.760 |
This is the comet where, and there was this mission 00:46:53.640 |
which makes up, it's one of the 20 basic amino acids 00:47:28.920 |
very well-established sort of group of molecular machines. 00:47:35.400 |
Right, so yeah, it's a very interesting question. 00:47:51.600 |
Like are we really lucky or is it inevitable? 00:48:02.320 |
but it's still like, damn, that's a pretty good chance. 00:48:10.560 |
I'm probably not the best person to do such estimations, 00:48:15.560 |
but I would, intuitively, I would probably put it lower. 00:48:30.500 |
- It's, I think that everything was right in a way, right? 00:48:35.500 |
So still, it's not, the conditions were not like ideal 00:48:39.740 |
if you try to look at what was several billions years ago 00:48:48.340 |
- So there is something called the rare Earth hypothesis 00:48:52.060 |
that in counter to the Drake equation says that 00:49:03.300 |
it's quite a special place, so special it might be unique 00:49:08.060 |
in our galaxy and potentially close to unique 00:49:19.540 |
is all those different conditions are essential for life. 00:49:33.260 |
I'm trying to remember to go through all of them, 00:49:43.800 |
but also the fact that it's like a perfect balance 00:49:53.660 |
I don't know, there's a bunch of different factors 00:50:10.500 |
you need to be very close to an Earth-like planet, 00:50:41.920 |
It's always, the optimums always reach somewhere in between. 00:50:50.040 |
I think that we're probably somewhere in between, 00:51:01.960 |
The problem is we don't know the other extremes. 00:51:05.240 |
I tend to think that we don't actually understand 00:51:08.040 |
the basic mechanisms of what this is all originated from. 00:51:13.040 |
It seems like we think of life as this distinct thing, 00:51:18.520 |
maybe the physics from which planets and suns are born 00:51:23.120 |
is a distinct thing, but that could be a very, 00:51:28.280 |
From simple rules emerges greater and greater complexity. 00:51:31.020 |
So I tend to believe that just life finds a way. 00:51:34.940 |
We don't know the extreme of how common life is, 00:51:43.660 |
Like so everywhere that it's almost like laughable, 00:51:56.280 |
it's like ants thinking that their little colony 00:51:59.460 |
is the unique thing and everything else doesn't exist. 00:52:03.200 |
I mean, it's also very possible that that's the extreme, 00:52:12.860 |
Just to stick on alien life for just a brief moment more, 00:52:16.560 |
there is some signs of life on Venus in gaseous form. 00:52:21.560 |
There's hope for life on Mars, probably extinct. 00:52:37.740 |
- Yeah, and then also I guess, there's a couple moons. 00:52:47.460 |
Are you, is that exciting or is it terrifying to you 00:52:57.940 |
I mean, it was very exciting to hear about this news 00:53:09.260 |
- It'd be nice to have hard evidence of something 00:53:11.620 |
which is what the hope is for Mars and Europa. 00:53:31.820 |
So it's, the moment we discover things outside Earth, 00:53:47.620 |
- I think that that would be another turning point 00:53:52.060 |
And if, especially if it's different in some very new way, 00:53:58.200 |
that's a definitive statement, not a definitive, 00:54:07.720 |
You brought up Joshua Lederberg in an offline conversation. 00:54:13.480 |
I think I'd love to talk to you about AlphaFold, 00:54:20.400 |
so he won the 1958 Nobel Prize in Physiology and Medicine 00:54:24.520 |
for discovering that bacteria can mate and exchange genes, 00:54:32.200 |
like we mentioned, helping NASA find life on Mars, 00:54:47.920 |
Do you, what do you find interesting about this guy 00:54:51.400 |
and his ideas about artificial intelligence in general? 00:54:55.000 |
- So I have a kind of personal story to share. 00:55:12.560 |
so it's different from the feature-based machine learning, 00:55:24.000 |
was to cheminformatics and computer-aided drug design. 00:55:31.360 |
as a part of my research, I developed a system 00:55:35.680 |
that essentially looked at chemical compounds 00:55:44.680 |
male hormones, right, and tried to figure out 00:55:53.160 |
the structural building blocks that are important, 00:56:20.560 |
you know, with some machine learning background, 00:56:32.200 |
asking lots of questions on one of the sort of 00:56:46.360 |
- Yeah, on the internet. - Yeah, so you essentially, 00:56:49.680 |
you have a bunch of people and you post a question 00:56:56.000 |
And back then, one of the most popular forums was CCL. 00:57:14.040 |
- Yes, I asked questions, also shared some, you know, 00:57:20.600 |
how we do and whether whatever we do makes sense. 00:57:25.080 |
And so, you know, and I remember that one of these posts, 00:57:31.400 |
I would call it desperately looking for a chemist's advice, 00:57:40.720 |
And so I post my question, I explained, you know, 00:57:48.960 |
and what kind of applications I'm planning to do. 00:58:06.880 |
He's like, "You won't believe who replied to you." 00:58:13.880 |
He said, "Well, you know, there is a message to you 00:58:18.040 |
And my reaction was like, "Who is Joshua Lederberg?" 00:58:29.720 |
Joshua wrote me that we had conceptually similar ideas 00:58:40.160 |
- And we should also, sorry, and this is a side comment, 00:58:57.640 |
responding to young whippersnappers on the CCL forum. 00:59:02.960 |
- And so back then he was already very senior. 00:59:05.840 |
I mean, he unfortunately passed away back in 2008. 00:59:12.560 |
he was a professor emeritus at Rockefeller University. 00:59:30.800 |
with the hope that, you know, that I could actually, 00:59:33.440 |
you know, have a chance to meet Joshua in person. 00:59:50.040 |
with the sort of sky scrapper that Rockefeller owns, 01:00:06.320 |
But so I started, you know, reading about Dendral, 01:00:40.960 |
the way you study the extraterrestrial molecules 01:00:51.680 |
gives you the ideas about the possible fragments, 01:00:59.800 |
pieces of this molecule that make up the molecule, right? 01:01:12.440 |
before, you know, became fragments, bits and pieces, right? 01:01:24.360 |
the idea of Lederberg was to connect chemistry, 01:01:47.160 |
and essentially try to sort of induce the molecule 01:02:01.520 |
was that, you know, it would provide a list of candidates 01:02:13.960 |
is to solve the entirety of this problem automatically. 01:02:37.400 |
of the modern bioinformatics, cheminformatics, 01:02:44.000 |
so every time you deal with projects like this, 01:03:04.200 |
Is there, and why they kind of didn't become successful? 01:03:12.520 |
where it does seem like there is a lot of expertise 01:03:18.280 |
is it possible to see that a system like this 01:03:36.440 |
my first two lectures are on the history of AI. 01:03:48.200 |
and so, you know, the question of why expert systems 01:03:58.520 |
And there are, you know, if you try to read the, 01:04:14.840 |
and so therefore they were replaced, you know, 01:04:21.160 |
The other one was that completely opposite one, 01:04:37.120 |
I mean, in both cases, sort of the outcome was the same. 01:04:46.040 |
- That's interesting. - If I look at this, right? 01:05:12.860 |
- So speaking of AlphaFold, so DeepMind's AlphaFold2 01:05:29.640 |
It's an incredible accomplishment from the looks of it. 01:05:39.000 |
- It's definitely a very exciting achievement. 01:05:41.880 |
To give you a little bit of perspective, right? 01:05:43.760 |
So in bioinformatics, we have several competitions. 01:05:59.560 |
you know, they call it bioinformatics Olympic games. 01:06:07.040 |
was the discipline in predicting the protein structure, 01:06:10.280 |
predicting the 3D coordinates of the protein. 01:06:16.760 |
predicting effects of mutations on protein functions, 01:06:21.480 |
then predicting protein-protein interactions. 01:06:28.080 |
or a critical assessment of protein structure. 01:06:40.000 |
during these competitions is, you know, scientists, 01:06:43.960 |
experimental scientists solve the structures, 01:06:48.360 |
but don't put them into the protein databank, 01:06:57.240 |
Instead, they hold it and release protein sequences. 01:07:05.400 |
is to predict the 3D structures of these proteins, 01:07:10.160 |
and then use the experimentally solved structures 01:07:12.920 |
to assess which one is the closest one, right? 01:07:19.520 |
And maybe you can also say, what is protein folding? 01:07:32.440 |
So I just added a little, yeah, just a bunch. 01:07:39.400 |
for the folks that might be outside of the field. 01:07:42.400 |
- So, yeah, so, you know, so the reason it's, you know, 01:07:45.920 |
it's relevant to our understanding of protein folding 01:07:54.160 |
how the folding mechanistically works, right? 01:08:19.720 |
and then the whole protein structure gets formed. 01:08:29.800 |
So these are, you know, elements that are structurally stable. 01:08:40.320 |
Because some of the secondary structure elements, 01:08:42.560 |
you have to have, you know, a fragment in the beginning 01:08:57.120 |
So it's still, you know, it's still a big enigma, 01:09:11.200 |
And it happens like the same way almost every time. 01:09:19.080 |
- It's, yeah, that's why it's such an amazing thing. 01:09:22.920 |
But most importantly, right, so it's, you know, 01:09:24.960 |
so when you see the translation process, right, 01:09:29.240 |
so when you don't have the whole protein translated, right, 01:09:41.200 |
you already see some structural, you know, fragmentation. 01:09:49.280 |
before the whole protein gets produced, right? 01:10:04.160 |
like that's not, that's bigger than the question of folding. 01:10:14.640 |
So, you know, so obviously if we are able to predict 01:10:27.640 |
sort of the mechanistics of the protein folding. 01:10:30.200 |
Because we can then potentially look and start probing 01:10:38.200 |
and what are not so critical parts of this process. 01:10:44.440 |
so in a way, this protein structure prediction algorithm 01:10:53.720 |
So you change the, you know, you modify the protein, 01:11:13.360 |
we typically have some sort of incremental advancement. 01:11:18.720 |
You know, each stage of this CASP competition, 01:11:22.600 |
you have groups with incremental advancement. 01:11:25.320 |
And, you know, historically, the top performing groups 01:11:29.840 |
were, you know, they were not using machine learning. 01:11:43.220 |
And that was, you know, that would enable them 01:11:47.360 |
to obtain protein structures of those proteins 01:11:52.360 |
that don't have any structurally solved relatives. 01:11:57.520 |
Because, you know, if we have another protein, 01:12:01.860 |
say the same protein, but coming from a different species, 01:12:10.440 |
and that's so-called homology or comparative modeling, 01:12:17.360 |
And that would help us tremendously in, you know, 01:12:25.400 |
But what happens when we don't have these relatives? 01:12:27.900 |
This is when it becomes really, really hard, right? 01:12:49.540 |
and all of a sudden, it's much better than everyone else. 01:12:58.760 |
- Oh, and the competition is only every two years, I think. 01:13:06.600 |
kind of of a shockwave to the bioinformatics community 01:13:10.160 |
that we have like a state-of-the-art machine learning system 01:13:20.760 |
so if you look at this, it actually predicts the context. 01:13:25.760 |
So, you know, so the process of reconstructing 01:13:29.480 |
the 3D structure starts by predicting the context 01:13:38.880 |
And the context is essentially the part of the proteins 01:13:40.960 |
that are in the close proximity to each other. 01:13:44.680 |
the machine learning part seems to be estimating, 01:13:51.080 |
but it seems to be estimating the distance matrix, 01:13:53.200 |
which is like the distance between the different parts. 01:14:00.600 |
the reconstruction is becoming more straightforward. 01:14:11.280 |
And now we started seeing in this current stage, right, 01:14:18.480 |
we started seeing the emergence of these ideas 01:14:38.640 |
- Yeah, there does seem to be also an incorporation. 01:14:44.880 |
There does seem to be an incorporation of this other thing. 01:14:48.120 |
I don't know if it's something that you could speak to, 01:14:50.160 |
which is like the incorporation of like other structures, 01:15:03.840 |
- Yes, so evolutionary similarity is something 01:15:08.360 |
that we can detect at different levels, right? 01:15:10.760 |
So we know, for example, that the structure of proteins 01:15:22.320 |
but the structural shape is actually still very conserved. 01:15:26.320 |
So that's sort of the intrinsic property that, you know, 01:15:37.800 |
But we know that, I mean, there've been multiple studies. 01:15:41.040 |
And, you know, ideally if you have structures, 01:15:48.560 |
However, sometimes we don't have this information. 01:15:54.880 |
So we have, you know, hundreds, thousands of, you know, 01:16:15.360 |
we can actually say a lot about sort of what is conserved 01:16:24.040 |
structure more stable, what is diverse in this protein. 01:16:27.240 |
So on top of that, we could provide sort of the information 01:16:30.880 |
about the sort of the secondary structure of this protein, 01:16:44.760 |
so you just have a protein sequence and nothing else, 01:16:48.200 |
the reality is such that we are overwhelmed with this data. 01:17:03.440 |
in the previous version of AlphaFold, they didn't, 01:17:17.880 |
like the features derived from the similarity. 01:17:22.000 |
It seems like there's some kind of, quote unquote, 01:17:24.640 |
iterative thing where it seems to be part of the learning 01:17:29.640 |
process is the incorporation of this evolutionary similarity. 01:17:34.240 |
- Yeah, I don't think there is a bioarchive paper, right? 01:17:51.800 |
but it could be, it's like interpreting scripture. 01:18:01.920 |
- So now, speaking about protein folding, right? 01:18:04.280 |
So, you know, in order to answer the question 01:18:09.440 |
So we need to go back to the beginning of our conversation, 01:18:13.280 |
you know, with the realization that, you know, 01:18:15.000 |
an average protein is that typically what the cusp 01:18:24.080 |
this competition has been focusing on the single, 01:18:27.200 |
maybe two domain proteins that are still very compact. 01:18:31.000 |
And even those ones are extremely challenging to solve, 01:18:37.640 |
an average protein that has two, three protein domains. 01:18:42.400 |
If you look at the proteins that are in charge of the, 01:18:47.480 |
you know, of the process with the neural system, right, 01:18:51.500 |
perhaps one of the most recently evolved sort of systems 01:19:09.000 |
So they are, you know, some of them have five, six, seven, 01:19:16.840 |
And, you know, we are very far away from understanding 01:19:22.400 |
- So the complexity of the protein matters here, 01:19:30.240 |
So you're saying solved, so the definition of solved here 01:19:50.280 |
- Well, I mean, you know, I do think that, you know, 01:19:54.680 |
especially with regards to the alpha fold, you know, 01:20:07.360 |
pretty big majority of the more compact proteins, 01:20:17.480 |
in order to understand how the overall protein, 01:20:49.760 |
like the size, I forget how many 3D structures 01:20:54.560 |
have been mapped, but the training data is very small, 01:20:59.040 |
maybe a one or two million, something like that, 01:21:02.920 |
but like, it doesn't seem like that's scalable. 01:21:09.360 |
it feels like you want to somehow 10X the data 01:21:30.680 |
based on the evolutionary information, right? 01:21:33.760 |
So you can, there is a potential to enhance this information 01:21:38.560 |
and use it again to empower the training set. 01:21:43.560 |
And it's, I think, I am actually very optimistic. 01:21:53.600 |
I think it's been one of these sort of, you know, 01:22:11.320 |
that is truly better than the sort of the more conventional 01:22:30.500 |
So like, okay, so let's see who's in the running. 01:22:39.960 |
beating the world champion at the game of Go. 01:22:48.240 |
or at least the AI community was highly skeptical. 01:22:51.380 |
Then you got like also Deep Blue original Kasparov. 01:22:56.960 |
maybe what would you say, the AlexNet image in that moment. 01:23:19.000 |
that whole space of transformers and language models, 01:23:27.160 |
of application of neural networks to language models. 01:23:48.920 |
I would say it's one of the greatest accomplishments 01:23:53.920 |
like mechanical engineering of robotics ever. 01:24:12.040 |
And I don't know if you can, what else is there? 01:24:18.440 |
So, and then AlphaFold, many people are saying 01:24:24.860 |
- Well, in terms of the impact on the science 01:24:38.480 |
- I mean, I'm probably not the best person to answer that. 01:24:56.400 |
but Kasparov, it was, I mean, it was a shock. 01:25:04.280 |
for the pretty substantial part of the world, 01:25:13.000 |
that especially people who have some experience 01:25:18.240 |
with chess, right, and realizing how incredibly human 01:25:25.080 |
this game, how much of a brain power you need 01:25:30.080 |
to reach those levels of grandmasters, right, level. 01:25:35.800 |
- Yeah, and it's probably one of the first time, 01:25:53.720 |
- Yes, yes, so that was, to me, that was like, 01:26:01.920 |
- As probably, like, we don't, it's hard to remember. 01:26:09.880 |
It's like, nah, you gotta put Muhammad Ali at number one. 01:26:21.520 |
and search is the integral part of AI, right? 01:26:25.400 |
- People don't think of it that way at this moment. 01:26:37.680 |
In fact, I mean, that's what neural networks are, 01:26:47.760 |
and you just have to become clever and clever 01:26:50.920 |
- And I also have another one that you didn't mention 01:27:08.840 |
the experts in Rembrandt painting in Netherlands, 01:27:15.280 |
and a group, an artificial intelligence group, 01:27:27.000 |
that never existed before in the style of Rembrandt. 01:27:55.840 |
You haven't been able to achieve superhuman level 01:28:21.680 |
So there hasn't been a moment where it's like, 01:28:24.400 |
oh, this is, we're now, I would say in the space of music, 01:28:28.760 |
what makes a lot of money, we're talking about serious money, 01:28:32.080 |
it's music and movies, or like shows and so on, 01:28:49.440 |
that is, that's sufficiently popular to make a ton of money. 01:28:56.120 |
- And that moment would be very, very powerful, 01:29:05.480 |
like even Premiere, audio editing, all the editing, 01:29:13.280 |
I wanna talk to those folks just 'cause I wanna nerd out, 01:29:15.560 |
it's called iZotope, I don't know if you're familiar with it. 01:29:18.080 |
They have a bunch of tools of audio processing, 01:29:26.360 |
like on the audio here, 'cause it's all machine learning. 01:29:43.560 |
but they also have all of this machine learning stuff, 01:29:46.000 |
like where you actually give it training data. 01:29:58.080 |
like the ability of it to be able to separate voice 01:30:03.320 |
and music, for example, or voice in anything, is incredible. 01:30:07.240 |
Like it just, it's clearly exceptionally good 01:30:11.160 |
at applying these different neural networks models 01:30:14.920 |
to separate the different kinds of signals from the audio. 01:30:28.280 |
that will sell millions, a piece of art, yeah. 01:30:41.160 |
and an integral part of this is the project, right? 01:30:47.320 |
because typically we have these project presentations 01:30:56.160 |
and it's sort of, it adds this cool excitement. 01:31:00.320 |
And every time, I'm amazed with some projects 01:31:08.720 |
And so, and quite a few of them are actually, 01:31:22.160 |
who designed an AI producing hokus, Japanese poems. 01:31:51.440 |
- Yes, yeah, it seems reason, so it's kind of cool. 01:31:57.840 |
where people tried to teach AI how to play like rock music, 01:32:16.600 |
Of course, if you look at the grandmasters of music, 01:32:39.040 |
that at least some style of this music could be picked up, 01:32:43.800 |
but then you have this completely different spectrum 01:32:46.960 |
of classical composers, and so it's almost like, 01:32:56.800 |
You just listen to it, and say, "Nah, that's not it, 01:33:17.560 |
a tortured soul behind the music, I don't know. 01:33:26.560 |
one day we'll have a song written by an AI engine 01:33:49.960 |
How hard is the multi-protein folding problem? 01:33:58.720 |
of greater and greater complexity of proteins? 01:34:03.280 |
is that basically become multi-protein complexes? 01:34:15.640 |
of protein folding and protein-protein interactions. 01:34:26.520 |
actually, they never form a stable structure. 01:34:51.500 |
is so-called post-synaptic density 95, PSD95 protein. 01:35:33.920 |
but the way it's organized itself, it's flexible, right? 01:35:49.340 |
So they start acting in the orchestrated manner, right? 01:35:54.340 |
So, and the type of the shape of this protein, 01:35:58.860 |
it's in a way, there are some stable parts of this protein, 01:36:13.180 |
- So do you think that kind of thing is also learnable 01:36:28.180 |
- To me, it's yet another level of complexity, 01:36:31.380 |
because when we talk about protein-protein interactions, 01:36:35.180 |
and there is actually a different challenge for this, 01:36:48.560 |
So, but it's, you know, there are different mechanisms 01:37:06.540 |
we participated for a few years in this competition. 01:37:11.540 |
We typically don't participate in competitions, 01:37:35.420 |
So the function that evaluates whether or not 01:37:38.100 |
your protein-protein interaction is supposed to look like 01:37:43.340 |
So the scoring function is very critical part 01:37:49.820 |
So we design it to be a machine learning one. 01:37:55.820 |
machine learning-based scoring function used in CAPRI. 01:38:06.580 |
what are the critical components contributing 01:38:10.540 |
- So this could be converted into a learning problem 01:38:17.020 |
- Do you think AlphaFold2 or something similar to it 01:38:24.300 |
will result in a Nobel Prize or multiple Nobel Prizes? 01:38:28.660 |
So like, you know, obviously, maybe not so obviously, 01:38:33.300 |
you can't give a Nobel Prize to a computer program. 01:38:38.020 |
You, at least for now, give it to the designers 01:38:54.880 |
Would it lead to discoveries at the level of Nobel Prizes? 01:39:08.700 |
to be evolving with the evolution of science. 01:39:14.500 |
that it now becomes like really multifaceted, right? 01:39:17.820 |
So where you don't really have like a unique discipline, 01:39:21.300 |
you have sort of the, a lot of cross-disciplinary talks 01:39:32.380 |
So I think, you know, the computational methods 01:39:46.860 |
they were first acknowledged back in 2013, right? 01:39:50.580 |
Where, you know, the first three people were, you know, 01:39:59.140 |
for study of the protein folding, right, the principle. 01:40:06.940 |
So, you know, that I think is unavoidable, you know. 01:40:16.560 |
The fact that, you know, alpha fold and, you know, 01:40:23.460 |
similar approaches, 'cause again, it's a matter of time 01:40:26.340 |
that people will embrace this, you know, principle. 01:40:36.940 |
But, you know, these methods will be critical 01:40:41.940 |
in a scientific discovery, no doubts about it. 01:40:47.380 |
- On the engineering side, maybe a dark question, 01:40:59.000 |
And the next question is something quite a few biologists 01:41:04.000 |
are against, some are for, for study purposes, 01:41:09.620 |
Do you think machine learning, like something 01:41:11.860 |
like alpha fold could be used to engineer viruses? 01:41:14.780 |
- So to answering the first question, you know, 01:41:16.980 |
it has been, you know, a part of the research 01:41:29.160 |
Of course, you know, one of the pioneers is David Baker 01:41:38.220 |
and was used to design new proteins, you know. 01:41:41.540 |
- And design of proteins means design of function. 01:41:44.200 |
So like when you design a protein, you can control, 01:41:58.160 |
you can look at the proteins from the functional perspective, 01:42:05.700 |
So if you want to have a building block of a certain shape, 01:42:22.040 |
one of the natural applications of these algorithms. 01:42:53.680 |
is we're trying to develop a machine learning algorithm 01:43:04.600 |
- Of the virus, I mean, so there are applications 01:43:07.720 |
to coronaviruses because we have strains of SARS-CoV-2, 01:43:14.600 |
but we also have strains of other coronaviruses 01:43:20.440 |
the common cold viruses and some other ones, right? 01:43:29.000 |
- Pathogenic meaning it's actually inflicting damage. 01:43:35.320 |
There are also some seasonal versus pandemic strains 01:43:41.760 |
And determining what are the molecular determinant, right, 01:43:45.520 |
so that are built in into the protein sequence, 01:43:58.000 |
- Oh, interesting, so like using machine learning, 01:44:17.480 |
you're saying we don't have enough data for that? 01:44:27.240 |
There was one work that appeared in bio-archive 01:44:31.680 |
by Eugene Kunin, who is one of these pioneers 01:44:35.480 |
in evolutionary genomics, and they tried to look at this, 01:44:45.080 |
supervised learning methods, and now the question is, 01:44:50.200 |
can you advance it further by using not so standard methods? 01:44:56.320 |
So, there's obviously a lot of hope in transfer learning, 01:45:02.680 |
where you can actually try to transfer the information 01:45:11.320 |
And so, there is some promise in going this direction, 01:45:16.320 |
but if we have this, it would be extremely useful, 01:45:22.960 |
the potential mutations that would make a current strain 01:45:27.560 |
- Anticipate them from a vaccine development, 01:45:31.120 |
for the treatment, antiviral drug development. 01:45:36.840 |
- But you could also use that system to then say, 01:45:50.240 |
I mean, you know, again, the hope is, well, several things. 01:45:55.240 |
So, one is that, even if you design a sequence, right? 01:46:06.760 |
So, to carry out the actual experimental biology, 01:46:30.440 |
that it's now becoming no longer a sort of a fun puzzle 01:46:37.840 |
- Yeah, so then there might be some regulation. 01:46:44.760 |
there was an issue on regulating the research 01:46:52.480 |
So, there were several groups use sort of mutation analysis 01:46:57.480 |
to determine whether or not this strain will jump 01:47:03.280 |
And I think there was like a half a year moratorium 01:47:24.980 |
It's like, let's watch this thing mutate for a while 01:47:33.760 |
I guess, I'm not so much worried about that kind of research 01:47:59.840 |
And that seems to be the most important value, actually, 01:48:51.680 |
would you be more worried about natural pandemics 01:49:08.840 |
I would still be worried about the natural pandemics, 01:49:25.320 |
engineering viruses as a weapon is a weird one, 01:49:31.520 |
but it seems very difficult to target a virus, right? 01:49:35.640 |
The whole point of a weapon, the way a rocket works, 01:49:42.360 |
to hit a target with a virus is very difficult. 01:50:04.560 |
- Well, I also hope that, I mean, that's what we see. 01:50:07.760 |
I mean, with the way we are getting connected, 01:50:14.440 |
I think it helps for the world to become more transparent. 01:50:27.120 |
I think it's one of the key things for the society 01:50:36.440 |
- This is something that people disagree with me on, 01:50:43.480 |
so you're kind of speaking more to the other aspects, 01:51:03.760 |
will become also a thing of the 20th century. 01:51:10.200 |
- I think nations will lose power in the 21st century, 01:51:13.240 |
like lose sufficient power towards secrecies. 01:51:16.000 |
Transparency is more beneficial than secrecy, 01:51:23.400 |
that the governments will become more transparent. 01:51:28.400 |
- So we last talked, I think, in March or April. 01:51:58.160 |
at how efficient the scientific community was. 01:52:03.120 |
I mean, and even just judging on this very narrow domain 01:52:12.520 |
understanding the structural characterization 01:52:16.600 |
of this virus from the components point of view, 01:52:28.620 |
less than 20, but close enough, 20 years ago. 01:52:38.500 |
what was sort of the response by the scientific community. 01:52:42.460 |
You see that the structural characterizations did occur, 01:52:56.880 |
So we see that, you know, the research pop up. 01:53:06.000 |
Never before we had a single virus sequenced so many times. 01:53:17.040 |
to trace very precisely the sort of the evolutionary nature 01:53:48.900 |
it's always a host pathogen co-evolution that, you know, 01:53:55.400 |
- It'd be cool if we also had a lot more data about, 01:54:05.120 |
for like contact tracing purposes for this virus, 01:54:13.600 |
But it's already nice that we have geographical data 01:54:19.080 |
No, I think contact tracing is obviously a key component 01:54:28.020 |
There is also, there is a number of challenges, right? 01:54:40.880 |
It's the prediction of the number of infections 01:54:47.880 |
So, and, you know, obviously the AI is the main topic 01:55:09.320 |
but like, it would be nice if it was like really rich. 01:55:16.760 |
I mean, he dreams that the community comes together 01:55:19.000 |
with like a weather map to where a viruses, right? 01:55:22.960 |
Like really high resolution sensors on like how, 01:55:27.840 |
from person to person, the viruses that travel, 01:55:36.800 |
that you've spoken about of the evolution of these viruses, 01:55:41.200 |
like day-to-day mutations that are occurring. 01:55:48.680 |
and from the perspective of being able to respond 01:55:56.440 |
Is there some three or whatever number of books, 01:56:07.760 |
and maybe some that you would recommend others? 01:56:11.360 |
- So I'll give you three very different books, 01:56:28.360 |
that sort of impacted my earlier stage of life, 01:56:32.480 |
and I'm probably not gonna be very original here. 01:56:41.280 |
- Well, not for a Russian, maybe it's not super original, 01:56:43.880 |
but it's a really powerful book for even in English. 01:57:07.320 |
What ideas, what insights did you get from it? 01:57:12.200 |
by the fact that you have those parallel lives 01:57:33.840 |
And, you know, of course, the romantic part of this book 01:57:41.760 |
it's like the romance empowered by sort of magic, right? 01:57:45.840 |
And maybe on top of that, you have some irony, 01:57:53.400 |
Because it was that, you know, the Soviet time. 01:57:58.560 |
So that's the wit, the humor, the pain, the love, 01:58:06.200 |
that kind of captures something about Russian culture 01:58:10.280 |
that people outside of Russia should probably read. 01:58:14.240 |
- So the second one is, again, another one that, 01:58:41.640 |
you know, Solzhenitsyn was diagnosed with cancer 01:58:44.760 |
when he was reasonably young and he made a full recovery. 01:58:54.480 |
who was sentenced for life in one of these, you know, camps. 01:59:24.820 |
being a, you know, a patient in the cancer clinic, 01:59:47.580 |
who described these, you know, the experiences, 01:59:51.780 |
in the book, by the patient as incredibly accurate, right? 01:59:58.740 |
So, you know, I read that there was some doctors saying 02:00:03.340 |
that, you know, every single doctor should read this book 02:00:10.660 |
But, you know, again, as many of the Solzhenitsyn's books, 02:00:19.540 |
And obviously, you know, if you look above the cancer 02:00:24.540 |
and the patient, I mean, the tumor that was growing 02:00:29.820 |
and then disappeared in his body with some consequences, 02:00:35.340 |
I mean, this is, you know, allegorically the Soviet, 02:00:48.020 |
you know, when he was asked, he said that this is 02:00:56.140 |
Him being a part of the, you know, of the Soviet regime, 02:01:08.020 |
And also someone who experienced cancer in his life. 02:01:12.700 |
You know, the Gulag Archipelago and this book, 02:01:25.940 |
I've read other, you know, books by Solzhenitsyn. 02:01:30.940 |
This one is, to me, is the most powerful one. 02:01:34.780 |
- And by the way, both this one and the previous one, 02:01:40.280 |
So now there is, the third book is an English book, 02:01:45.700 |
So, you know, we're switching the gears completely. 02:01:48.760 |
So this is the book, which it's not even a book, 02:02:24.320 |
is considered to be one of the biggest thinkers, right? 02:02:28.680 |
So his intellectual power was incredible, right? 02:02:32.500 |
And you can actually feel this power in this book 02:02:38.220 |
knowing that he will be, you know, he will die. 02:02:41.340 |
The book actually got published only after his death, 02:02:53.060 |
that, you know, he still, you know, hadn't realized. 02:02:58.060 |
And, you know, so this book is very difficult to read 02:03:04.780 |
because, you know, every single paragraph is just compact. 02:03:19.900 |
to put the parallels between the brain computing power, 02:03:24.900 |
the neural system, and the computers, you know, 02:03:29.580 |
- Do you remember what year he was working on this? 02:03:36.420 |
when he was diagnosed with cancer and he was essentially. 02:03:39.780 |
- Yeah, he's one of those, there's a few folks 02:03:42.740 |
people mention, I think Ed Witten is another, 02:03:49.080 |
they say he's just an intellectual powerhouse. 02:03:56.620 |
the reason I put it sort of in this separate section 02:03:59.500 |
because this is a book that I recently listened to. 02:04:20.440 |
the fossil plants and so she uses the fossil plants, 02:04:29.000 |
the chemical analysis to understand what was the climate 02:04:40.280 |
And so something that incredibly touched me by this book, 02:04:54.080 |
So certain parts of the book you could actually 02:05:03.920 |
anything like this, you know, reading the book. 02:05:12.720 |
And I think this is, you know, this is really a must read, 02:05:22.240 |
for anyone who wants to learn about sort of, you know, 02:05:48.120 |
Do you have some exciting things you're looking forward 02:05:56.120 |
maybe silly or fun, or something very important 02:06:11.160 |
towards, you know, things becoming normal, right? 02:06:27.340 |
it's called the School for Molecular and Theoretical Biology. 02:06:37.860 |
from all over the world, and they're incredibly bright. 02:06:41.020 |
It's like, every time I go there, it's like, you know, 02:06:50.980 |
so we did this school remotely, but it's different. 02:07:01.140 |
I also, I mean, you know, one of my personal resolutions, 02:07:05.900 |
I realized that being in-house and working from home, 02:07:20.460 |
spending time with my family, believe it or not. 02:07:24.420 |
So you typically, you know, with all the research 02:07:28.300 |
and teaching and everything related to the academic life, 02:07:38.280 |
And so, you know, you don't feel that, you know, 02:07:50.860 |
And, you know, this time I realized that, you know, 02:07:58.180 |
Spending your time with the family, with your kids. 02:08:05.420 |
in actually trying to spend as much time as possible. 02:08:15.660 |
I asked you if there's a Russian poem you could read 02:08:31.540 |
So yeah, so this poem was written by my namesake, 02:08:42.380 |
and it's called "Sorceress", "Vedma" in Russian. 02:08:52.940 |
So that's sort of another connotation of sorceress or witch. 02:09:23.220 |
we are talking now, so around New Year, around Christmas. 02:10:55.340 |
something that is happening, something that is far away, 02:11:06.180 |
- There's something magical about winter, isn't it? 02:11:24.540 |
but English doesn't capture some of the magic 02:11:28.220 |
that Russian seems to, so thank you for doing that. 02:11:35.560 |
It's contagious how much you love what you do, 02:11:40.780 |
So I really appreciate you taking the time to talk today. 02:11:47.820 |
with Dmitry Korkin, and thank you to our sponsors, 02:11:50.740 |
Brave Browser, NetSuite Business Management Software, 02:12:00.220 |
So the choice is browsing privacy, business success, 02:12:08.580 |
And if you wish, click the sponsor links below 02:12:11.140 |
to get a discount and to support this podcast. 02:12:24.100 |
Thank you for listening, and hope to see you next time.