back to indexLuís and João Batalha: Fermat's Library and the Art of Studying Papers | Lex Fridman Podcast #209
Chapters
0:0 Introduction
2:22 Backstories to research papers
17:13 Fermat's Library
37:14 Scientific publishing
60:54 How to read a paper
66:48 Taking good notes
75:27 Favorite papers on Fermat's Library
116:18 Fermat's Library on Twitter
125:50 What it takes to build a successful startup
134:46 Game of Thrones
137:34 Realism in science fiction movies
143:33 Greatest soccer player of all time
166:22 Advice for young people
00:00:00.000 |
The following is a conversation with Louise and João Botala, 00:00:03.920 |
brothers and co-founders of Fermat's Library, which is an incredible platform for annotating papers. 00:00:10.400 |
As they write on the Fermat's Library website, "Just as Pierre de Fermat scribbled his famous 00:00:16.880 |
last theorem in the margins, professional scientists, academics, and citizen scientists 00:00:21.920 |
can annotate equations, figures, ideas, and write in the margins." 00:00:26.400 |
Fermat's Library is also a really good Twitter account to follow. I highly recommend it. 00:00:32.000 |
They post little visual factoids and explorations that reveal the beauty of mathematics. I love it. 00:00:39.840 |
Quick mention of our sponsors. Skiff, SimpliSafe, Indeed, NetSuite, and 4Sigmatic. 00:00:47.760 |
Check them out in the description to support this podcast. 00:00:50.880 |
As a side note, let me say a few words about the dissemination of scientific ideas. 00:00:56.000 |
I believe that all scientific articles should be freely accessible to the public. They currently 00:01:02.320 |
are not. In one analysis I saw, more than 70% of published research articles are behind a paywall. 00:01:08.480 |
In case you don't know, the funders of the research, whether that's government or industry, 00:01:14.640 |
aren't the ones putting up the paywall. The journals are the ones putting up the paywall, 00:01:20.000 |
while using unpaid labor from researchers for the peer review process. 00:01:25.520 |
Where is all that money from the paywall going? In this digital age, the costs here should be 00:01:31.520 |
minimal. This cost can easily be covered through donation, advertisement, or public funding of 00:01:37.520 |
science. The benefit versus the cost of all papers being free to read is obvious, and the fact that 00:01:43.120 |
they're not free goes against everything science should stand for, which is the free dissemination 00:01:48.640 |
of ideas that educate and inspire. Science cannot be a gated institution. The more people can freely 00:01:56.560 |
learn and collaborate on ideas, the more problems we can solve in the world together, and the faster 00:02:02.160 |
we can drive old ideas out and bring new, better ideas in. Science is beautiful and powerful, 00:02:10.080 |
and its dissemination in this digital age should be free. This is the Lex Friedman Podcast, 00:02:17.200 |
and here's my conversation with Luis and Joao Batalla. Luis, you suggested an interesting idea. 00:02:24.960 |
Imagine if most papers had a backstory section, the same way that they have an abstract. So, 00:02:32.960 |
knowing more about how the authors ended up working on a paper can be extremely insightful. 00:02:38.720 |
And then you went on to give a backstory for the Feynman QED paper. This is all in a tweet, 00:02:43.760 |
by the way. We're doing tweet analysis today. How much of the human backstory do you think is 00:02:48.320 |
important in understanding the idea itself that's presented in the paper or in general? 00:02:53.680 |
- I think this gives way more context to the work of scientists. I think a lot of people have this 00:03:00.800 |
almost kind of romantic misconception that the way a lot of scientists work is almost as the sum of 00:03:08.320 |
eureka moments where all of a sudden they sit down and start writing two papers in a row, 00:03:13.200 |
and the papers are usually isolated. And when you actually look at it, the papers are chapters of a 00:03:19.440 |
way more complex story. And the Feynman QED paper is a good example. So, Feynman was actually going 00:03:26.800 |
through a pretty dark phase before writing that paper. He lost enthusiasm with physics and doing 00:03:33.840 |
physics problems. And there was one time when he was in the cafeteria of Cornell, and he saw a guy 00:03:39.600 |
that was throwing plates in the air. And he noticed that when the plate was in the air, there 00:03:44.720 |
were two movements there. The plate was wobbling, but he also noticed that the Cornell symbol was 00:03:50.480 |
rotating. And he was able to figure out the equations of motions of those plates. And that 00:03:58.240 |
led him to kind of think a little bit about electron orbits in relativity, which led to 00:04:08.080 |
about quantum electrodynamics. So, that kind of reignited his interest in physics, 00:04:13.840 |
and ended up publishing the paper that led to his Nobel Prize, basically. And I think 00:04:20.560 |
there are a lot of really interesting backstories about papers that readers never get to know. 00:04:26.160 |
For instance, we did a couple of months ago an AMA around a paper, a pretty famous paper, 00:04:34.640 |
the Gans paper with Ian Goodfellow. And so, we did an AMA where everyone could ask questions 00:04:40.160 |
about the paper, and Ian was responding to those questions. He was also telling the story of how 00:04:46.240 |
he got the idea for that paper in a bar. So, that was also an interesting backstory. 00:04:51.760 |
I also read a book by Cedric Villani. Cedric Villani is this mathematician, a Fields Medalist. 00:05:00.080 |
And in his book, he tries to explain how he got from a PhD student to the Fields Medal, 00:05:06.480 |
and he tries to be as descriptive as possible, every single step, how he got to the Fields Medal. 00:05:11.840 |
And it's interesting also to see just the amount of random interactions and discussions with other 00:05:17.200 |
researchers, sometimes over coffee, and how it led to fundamental breakthroughs in some of his 00:05:23.040 |
most important papers. So, I think it's super interesting to have that context of the backstory. 00:05:27.280 |
- Well, the Ian Goodfellow story is kind of interesting, and perhaps that's true for Feynman 00:05:31.680 |
as well. I don't know if it's romanticizing the thing, but it seems like just a few little 00:05:37.200 |
insights and a little bit of work does most of the leap required. Do you have a sense that for a lot 00:05:43.440 |
of the stuff you've looked at, just looking back through history, it wasn't necessarily the grind 00:05:50.880 |
of like Andrew Wiles or the Fermat's Last Theorem, for example. It was more like a brilliant 00:05:57.120 |
moment of insight. In fact, Ian Goodfellow has a kind of sadness to him almost, in that at that 00:06:03.760 |
time in machine learning, like at that time, especially for GANs, you could code something 00:06:11.680 |
up really quickly on a single machine and almost do the invention, go from idea to experimental 00:06:18.960 |
validation in like a single night, a single person could do it. And now there's kind of a sadness 00:06:24.000 |
that a lot of the breakthroughs you might have in machine learning kind of require large scale 00:06:28.320 |
experiments. So it was almost like the early days. So I wonder how many low hanging fruit there are 00:06:37.120 |
in science and mathematics and even engineering where it's like, you could do that little 00:06:43.200 |
experiment quickly. Like you have an insight in a bar. Why is it always a bar? But you have an 00:06:47.760 |
insight at a bar and then just implement and the world changes. It's a good point. I think it also 00:06:55.040 |
depends a lot on the maturity of the field. When you look at a field like mathematics, 00:07:00.400 |
like it's a pretty mature field. A field like machine learning, it's growing pretty fast. 00:07:06.400 |
And it's actually pretty interesting. I looked up like the number of new papers 00:07:14.640 |
on archive with the keyword machine learning and like 50% of those papers have been published 00:07:19.920 |
in the last 12 months. So you can see just the sense- 00:07:24.160 |
Five zero, 50%. So you can see the magnitude of growth in that field. And so I think like 00:07:31.440 |
as fields mature, like those types of moments, I think naturally are less frequent. It's just 00:07:39.040 |
a consequence of that. The other point that is interesting about the backstory is that it can 00:07:44.160 |
really make it more memorable in a way. And by making it more memorable, it kind of sediments 00:07:49.760 |
the knowledge more in your mind. I remember also reading the sort of the backstory to 00:07:54.960 |
Dijkstra's shortest path algorithm, right? Where he came up with it essentially while he was 00:08:02.320 |
sitting down at a coffee shop in Amsterdam. And he came up with that algorithm over 20 minutes. 00:08:09.680 |
And one interesting aspect is that he didn't have any pen or paper at the time. And so he had to do 00:08:13.920 |
it all in his mind. And so there's only so much complexity that he can handle if you're just 00:08:18.480 |
thinking about it in your mind. And that like when you think about the simplicity of Dijkstra's 00:08:24.000 |
shortest path finding algorithm, it's knowing that backstory helps sediment that algorithm 00:08:30.480 |
in your mind so that you don't forget about it as easily. 00:08:33.280 |
It might be from you that I saw a meme about Dijkstra. 00:08:37.520 |
It's like he's trying to solve it and he comes up with some kind of random path. And then 00:08:43.760 |
it's like my parents aren't home. And then he does, he figures out the algorithm for the shortest 00:08:49.280 |
path. He's trying through words to convey memes, but it's hilarious. I don't know if it's in post 00:08:57.520 |
that we construct stories that romanticize it. Apparently when Newton, there was no apple. 00:09:02.400 |
Especially when you're working on problems that have a physical manifestation or a visual 00:09:08.160 |
manifestation, it feels like the world could be an inspiration to you. So it doesn't have to be 00:09:16.160 |
completely on paper. Like you could be sitting at a bar and all of a sudden see something in a pattern 00:09:23.120 |
will spark another pattern and you can visualize it and rethink a problem in a particular way. 00:09:29.440 |
Of course, you can also load the math that you have on paper and always carry that with you. 00:09:34.560 |
So when you show up to the bar, some little inspiration could be the thing that changes it. 00:09:38.720 |
Is there any other people almost on the human side, whether it's physics with Feynman, 00:09:44.640 |
Dirac, Einstein, or computer science, Turing, anybody else? Any backstories that you remember 00:09:51.600 |
that jump out? Because I'm also referring to not necessarily these stories where something 00:09:57.600 |
magical happens, but these are personalities. They have big egos. Some of them are super friendly. 00:10:04.160 |
Some of them are self-obsessed. Some of them have anger issues. Some of them, how do I describe 00:10:11.360 |
Feynman? But he appears to have appreciation of the beautiful in all its forms. He has a wit 00:10:19.040 |
and a cleverness and a humor about him. So does that come into play in terms of the construction 00:10:24.400 |
of the science? - I think you brought up Newton. Newton is a good example also to think about his 00:10:30.000 |
backstory because there's a certain backstory of Newton that people always talk about, but then 00:10:35.600 |
there's a whole another aspect of him that is also a big part of the person that he was. But he was 00:10:41.920 |
really into alchemy. And he spent a lot of time thinking about that and writing about it. And he 00:10:48.720 |
took it very seriously. He was really into Bible interpretation and trying to predict things based 00:10:54.160 |
on the Bible. And so there's also a whole backstory then. And of course, you need to look at it in the 00:10:59.040 |
context and the time that when Newton lived, but it adds to his personality. And it's important to 00:11:07.280 |
also understand those aspects that maybe people are not as proud to teach to little kids, but 00:11:14.880 |
it's important. It was part of who he was and maybe without those, who knows what he would have 00:11:19.760 |
done otherwise. - Well, the cool thing about alchemy, I don't know how it was viewed at the time, 00:11:27.440 |
but it almost like to me symbolizes dreaming of the impossible. Like most of the breakthrough 00:11:34.400 |
ideas kind of seem impossible until they're actually done. It's like achieving human flight. 00:11:39.040 |
It's not completely obvious to me that alchemy is impossible or like putting myself in the mindset 00:11:45.120 |
of the time. And perhaps even still, everything that, some of the most incredible breakthroughs 00:11:54.640 |
would seem impossible. And I wonder the value of believing almost like focusing and dreaming 00:12:03.200 |
of the impossible such that it is actually is possible in your mind and that in itself manifests, 00:12:09.200 |
whether the accomplishing that goal or making progress in some unexpected direction. So alchemy 00:12:16.800 |
almost symbolizes that for me. - I distinctly remember having the same thought of thinking, 00:12:21.920 |
when I learned about atoms and that they have protons and electrons, I was like, okay, to make 00:12:26.240 |
gold, you just take whatever has an atomic weight below it and then shove another proton in there 00:12:30.800 |
and then you have a bunch of gold. So like, why don't people do that? It seemed like conceptually 00:12:35.920 |
is like, this sounds feasible. You might be able to do it. - And you can actually, it's just very, 00:12:40.800 |
very expensive. - Yeah, yeah, exactly. So in a sense, we do have alchemy and maybe even back 00:12:47.440 |
then it wasn't as crazy that he was so into it, but people just don't like to talk about that as 00:12:52.800 |
much. Yeah, but Newton in general was a very interesting fella. - Anybody else come to mind? 00:12:57.520 |
In terms of people that inspire you, in terms of people that you just are happy that they have once 00:13:06.880 |
or still exist on this earth. - I think, I mean, Freeman Dyson for me. Yeah, Freeman Dyson was, 00:13:15.360 |
I've had a chance to actually exchange a couple of emails with him. He was probably one of the 00:13:19.680 |
most humble scientists that I've ever met and that had a big impact on me. We were trying, 00:13:25.360 |
we're actually trying to convince him to annotate a paper on Fermat's library and I sent him an email 00:13:33.200 |
asking him if he could annotate a paper and his response was something like, I have very limited 00:13:40.080 |
knowledge. I just know a couple of things about certain fields. I'm not sure if I'm qualified to 00:13:45.360 |
do that. That was his first response. And this was someone that should have won a Nobel Prize 00:13:51.120 |
and worked on a bunch of different fields, did some really, really great work. And then just 00:13:57.440 |
the interactions that I had with him, every time I asked him a couple of questions about his papers 00:14:02.640 |
and he always responded saying, I'm not here to answer your questions. I just want to open more 00:14:09.280 |
questions. And so that had a big impact on me. It was like just an example of an extremely humble 00:14:17.200 |
yet accomplished scientist. And Feynman was also a big, big inspiration in the sense that he was 00:14:25.680 |
able to be, again, extremely talented and scientist, but at the same time, socially, 00:14:32.080 |
he was able to, he was also really smart from a social perspective and he was able to interact 00:14:38.560 |
with people. He was also a really good teacher and was also to, did awesome work in terms of 00:14:45.520 |
explaining physics to the masses and motivating and getting people interested in physics. 00:14:52.400 |
And that for me was also a big inspiration. - Yeah, I like the childlike curiosity of some 00:14:57.760 |
of those folks, like you mentioned, Feynman. I have Daniel Kahneman, I got a chance to meet 00:15:01.920 |
and interact with. Some of these truly special scientists, what makes them special is that even 00:15:08.320 |
in older age, there's still that fire of childlike curiosity that burns. And some of that is like 00:15:18.000 |
not taking yourself so seriously that you think you've figured it all out, but almost like thinking 00:15:22.880 |
that you don't know much of it. And that's like step one in having a great conversation or 00:15:31.680 |
collaboration or exploring a scientific question. It's cool how the very thing that probably earned 00:15:38.240 |
people the Nobel prize or work that's seminal in some way is the very thing that still burns even 00:15:46.320 |
after they've won the prize. It's cool to see. And they're rare humans, it seems. - And to that 00:15:53.120 |
point, I remember like the last email that I sent to Freeman Dyson was like in his last birthday, 00:15:57.840 |
he was really into number theory and primes. So what I did is I took like a photo of him, 00:16:04.160 |
picture, and then I turned that into like a giant prime number. So I converted the picture into a 00:16:10.880 |
bunch of one and eights, and then I moved some numbers around until it was a prime. 00:16:14.800 |
And then I sent him that. - Oh, so the visual, like it still 00:16:19.440 |
looked like the picture, it was made up of a prime. That's tricky to do. That's hard to do. 00:16:23.760 |
- It looks harder than it actually is. So the way you do it is like you convert the darker regions 00:16:29.920 |
into eights and the lighter regions in ones. And then there's- - And then you just keep flipping 00:16:38.080 |
primality tests that are cheaper from a computational standpoint. But what it tells you is 00:16:44.000 |
it excludes numbers that are not prime. Then you end up with a set of numbers that you don't know 00:16:48.480 |
if they are prime or not. And then you run the full primality test on that. So you just have 00:16:53.200 |
to keep iterating on that. And it's funny because when he got the picture, he was like, "How did 00:16:58.160 |
you do that?" He was super curious too. And then we got into the details. And again, it was already 00:17:03.840 |
90, I think 92 or something. And that curiosity was still there. So you can really see that in 00:17:12.160 |
some of these scientists. - So could we talk about Fermat's library? 00:17:16.000 |
- Yeah, absolutely. - What is it? What's the main goal? 00:17:21.280 |
What's the dream? - It is a platform for annotating papers 00:17:25.280 |
in its essence. And so academic papers can be one of the densest forms of content out there and 00:17:32.000 |
generally pretty hard to understand at times. And the idea is that you can make them more accessible 00:17:38.800 |
and easier to understand by adding these rich annotations to the side. And so we can just 00:17:43.680 |
imagine a PDF view on your browser, and then you have annotations on each side. And then when you 00:17:48.000 |
click on them, a sidebar expands and then you have annotations that support LaTeX and Markdown. 00:17:54.160 |
And so the idea is that you can say, explain a tougher part of a paper where there's a step 00:17:59.680 |
that is not completely obvious, or you can add more context to it. And then over time, 00:18:05.520 |
papers can become easier and easier to understand and can evolve in a way. But it really came from 00:18:12.640 |
myself, Luis and two other friends. We've had this long running habit of kind of running a 00:18:19.600 |
journal club amongst us. We come from different backgrounds. I studied CS, we studied physics. And 00:18:24.720 |
so we'd read papers and present them to each other. And then we tried to bring some of that online. 00:18:32.320 |
And that's when we decided to build Fermat's library. And then over time, it kind of 00:18:39.200 |
grew into something with a broader goal. And really what we're trying to do is trying to help 00:18:49.360 |
move science in the right direction. That's really the ultimate goal and where we want to take it now. 00:18:55.920 |
So there's a lot to be said. So first of all, for people who haven't seen it, 00:18:59.600 |
the interface is exceptionally well done. Execution is really important here. 00:19:06.560 |
The other thing is just to mention for a large number of people, apparently, which is new to me, 00:19:12.960 |
don't know what LaTeX is. So it's spelled like latex. So be careful googling it if you haven't 00:19:19.600 |
before. It's a, sorry, I don't even know the correct terminology. 00:19:25.280 |
It's a type setting language where you're basically writing a program that then generates 00:19:32.240 |
something that looks from a typography perspective, beautiful. 00:19:37.840 |
And so a lot of academics use it to write papers. I think there's like a bunch of communities that 00:19:44.640 |
use it to write papers. I would say it's mathematics, physics, computer science. 00:19:52.400 |
Because I'm collaborating currently on a paper with two neuroscientists from Stanford. 00:19:58.240 |
So I'm using Microsoft Word and Mendeley and like all of those kinds of things. 00:20:08.560 |
And I'm being very Zen like about the whole process, but it's fascinating. 00:20:13.200 |
It's a little heartbreaking actually, because it actually, it's funny to say, 00:20:20.160 |
but we'll talk about open science, actually the bigger mission behind it for Mars Libraries, like 00:20:25.120 |
really opening up the world of science to everybody. 00:20:28.480 |
Is these silly two facts of like one community uses LaTeX and another uses Word, 00:20:40.080 |
It's like boring and practical in a sense, but this makes it very difficult to collaborate. 00:20:45.440 |
Just on that, I think that if there are some people that should have received like a Nobel 00:20:50.720 |
Prize, but will never get it. And I think one of those is like Donald Knuth because of tech 00:20:55.920 |
and LaTeX then, because it had a huge impact in terms of like just making it easier for 00:21:02.560 |
researchers to put their content out there, like making it uniform as much as possible. 00:21:17.120 |
I mean, he had a very young age, got the Turing Award for his work in algorithms and so on. 00:21:21.440 |
So like an incredibly, I think it's in, it might be even the sixties, but I think it's the seventies. 00:21:27.760 |
So when he was really young and then he went on to do like incredible work with his book and yeah, 00:21:36.320 |
And going back just on the reason why we ended up, because I think this is interesting. 00:21:41.360 |
The reason why we ended up using the name Fermat's Library, this was because of Fermat's 00:21:46.560 |
Last Theorem. And Fermat's Last Theorem is actually a funny story. So Pierre de Fermat, 00:21:51.440 |
he was like a lawyer and he wrote like on a book that he had a solution to Fermat's Last Theorem, 00:22:03.520 |
And so Fermat's Last Theorem basically states that there is no solution. 00:22:08.720 |
If you have integers a, b and c, there's no solution to a to the power of n plus b to the 00:22:14.480 |
power of n equals to c to the power of n if n is bigger than two. So there's no solutions. 00:22:21.920 |
And he said that, and that problem remained open for almost 300 years, I believe. And a lot of the 00:22:30.560 |
most famous mathematicians tried to tackle that problem. No one was able to figure that out until 00:22:35.680 |
Andrew Wiles, I think was in the nineties, was able to publish the solution, which was, I believe, 00:22:41.920 |
almost 300 pages long. And so it's kind of an anecdote that, you know, there's a lot of knowledge 00:22:49.200 |
and insights that can be trapped in the margins, and there's a lot of potential energy that you 00:22:55.520 |
can release if you actually spend some time trying to digest that. And that was the origin story for 00:23:03.920 |
the name. - Yes, you can share the contents of the margins with the world. 00:23:11.360 |
or a communication that then leads to a solution. - And if you think about papers, like papers are, 00:23:16.640 |
as João was saying, probably one of the densest pieces of text that any human can read. And you 00:23:22.080 |
have these researchers, like some of the brightest minds in these fields, working on like new 00:23:27.680 |
discoveries and publishing these work on journals that are imposing them restrictions in terms of 00:23:32.800 |
the number of pages that they can have to explain a new scientific breakthrough. So at the end of 00:23:38.400 |
the day, papers are not optimized for clarity and for a proper explanation of that content, 00:23:46.320 |
because there are so many restrictions. So there's, as I mentioned, there's a lot of potential energy 00:23:51.200 |
that can be freed if you actually try to digest a lot of the contents of papers. 00:23:57.120 |
- Can you explain some of the other things? So margins, librarian, journal club? 00:24:02.240 |
- So journal club is what a lot of people know us for, where we, every week, we release an 00:24:07.600 |
annotated paper and in all sorts of different fields, but physics, CS, math. Margins is kind of 00:24:13.760 |
the same software that we use to run the journal club and to host the annotations, but we've made 00:24:18.880 |
that available for free to anybody that wants to use it. And so folks use it at universities and 00:24:25.760 |
for running journal clubs. And so we've just made that freely available. And then librarian is a 00:24:33.440 |
browser extension that we developed that is sort of an overlay on top of archive. So it's about 00:24:38.560 |
bringing some of the same functionality around comments, plus adding some extra niceties to 00:24:46.480 |
archive, like being able to very easily extract the references of a paper that you're looking at, 00:24:51.200 |
or being able to extract the BibDeck in order to cite that paper yourself. So it's an overlay on 00:24:56.640 |
top of archive. - The idea is that you can have that commenting interface without having to leave 00:25:01.280 |
archive. - It's kind of incredible. I didn't know about it. And once I learned of it, it's like, 00:25:07.680 |
holy shit, why isn't it more popular given how popular archive is? Like everybody should be 00:25:14.240 |
using it. Archive sucks in terms of its interface. Or let me rephrase that, it's limited. Yeah. 00:25:21.600 |
- Archive is a pretty incredible project, right? And it is, in a way, it's, you know, 00:25:28.880 |
the growth has been completely linear over time. If you look at like number of papers published on 00:25:33.760 |
archive, like, you know, it's pretty much a straight line for the past 20 years. Especially 00:25:38.480 |
if you're coming from a startup background and then you were trying to do archive, 00:25:43.840 |
you'd probably try to like all sorts of growth acts and like try to then maybe like have paid 00:25:49.600 |
features and things like that. And that would kind of maybe ruin it. And so there's a subtle 00:25:55.360 |
balance there. And I don't know what aspects you can change about it. 00:25:59.280 |
- Yeah. For some tools in science, it just takes time for them to grow. Archive is just turned 30, 00:26:05.920 |
I believe. And for people that don't know, archive is this kind of online repository where people put 00:26:11.440 |
preprints, which are versions of the papers before they actually make it to journals. 00:26:20.320 |
- For people who don't know. And it's actually a really vibrant place to publish your papers and 00:26:25.440 |
in the aforementioned communities of mathematics, physics and computer science. 00:26:30.800 |
- It started with mathematics and physics. And then over the last 30 years, it evolved. And now 00:26:36.400 |
actually computer science now, it's a more popular category than physics and math on archive. 00:26:42.080 |
- And there's also, which I don't know very much about like a biology, medical version of that. 00:26:50.560 |
- It's interesting because if you look at like these platforms for preprints, 00:26:56.320 |
they actually play a super important role. Because if you look at a category like math, 00:27:02.800 |
for some papers in math, it might take close to three years after you click upload paper on the 00:27:11.440 |
journal website and the paper gets published on the website of the journal. So this is literally 00:27:16.720 |
the longest upload period on the internet. And during those three years, that content is just 00:27:26.400 |
locked. And so that's why it's so important for people to have websites like ARXIV so that you 00:27:32.560 |
can share that before it goes to the journal with the rest of the world. That was actually on ARXIV 00:27:37.440 |
that Perelman published the three papers that led to the proof of the Poincaré conjecture. 00:27:44.240 |
And then you have other fields like machine learning, for instance, where the field is 00:27:49.440 |
evolving at such a high rate that people don't even wait before the papers go to journals before 00:27:54.960 |
they start working on top of those papers. So they publish them on ARXIV, then other people 00:27:59.200 |
see them, they start working on that. And ARXIV did a really good job at like building that core 00:28:04.800 |
platform to host papers. But I think there's a really, really big opportunity in building more 00:28:10.240 |
features on top of that platform, apart from just hosting papers. So collaboration, annotations, 00:28:15.520 |
like having other things apart from papers like code and other things. Because, for instance, 00:28:23.440 |
in the field like machine learning, there's a really big... As I mentioned, people start working 00:28:28.000 |
on top of preprints and they are assuming that that preprint is correct. But you really need a 00:28:34.960 |
way, for instance, to maybe... It's not peer review, but distinguish what is good work from 00:28:41.280 |
bad work on ARXIV. How do you do that? So like a commenting interface like Librarian, it's useful 00:28:46.960 |
for that so that you can distinguish that in a field that is growing so fast as machine learning. 00:28:53.440 |
And then you have platforms that focus, for instance, on just biology. BioARXIV is a good 00:28:59.040 |
example. BioARXIV is also super interesting because there's actually an interesting experiment 00:29:07.520 |
that was run in the '60s. So in the '60s, the NIH supported this experiment called the Information 00:29:16.400 |
Exchange Group, which at the time was a way for researchers to share biology preprints via mail 00:29:24.320 |
or using libraries. And that project in the 1960s got canceled six years after it started. And it 00:29:30.800 |
was due to intense pressure from the journals to kill that project because they were fearing 00:29:36.080 |
competition from the preprints for the journal industry. Creek was one of the famous scientists 00:29:46.640 |
that opposed to the Information Exchange Group. And it's interesting because right now, if you 00:29:53.040 |
analyze the number of biology papers that appear first as preprints, it's only 2% of the papers. 00:29:59.600 |
And this was almost 50 years after that first experiment. So you can see that pressure from 00:30:06.400 |
the journals to cancel that initial version of a preprint repo had a tremendous impact on the 00:30:14.160 |
number of papers that are showing up in biology as preprints. So it delayed a lot that revolution. 00:30:21.680 |
But now platforms like BioRxiv are doing that work. But there's still a lot of room for growth 00:30:27.040 |
there. And I think it's super important because those are the papers that are open, 00:30:30.000 |
that everyone can read. - Okay, so but if we just look at the entire process of science as a big 00:30:35.600 |
system, can we just talk about how it can be revolutionized? So you have an idea, depending 00:30:43.280 |
on the field, you wanna make that idea concrete, you wanna run a few experiments in computer science, 00:30:49.040 |
there might be some code, there'd be a data set for some of the more sort of biology, psychology, 00:30:58.000 |
you might be collecting the data set, that's called a study, right? So that's part of that, 00:31:06.720 |
that's part of the methodology. And so you are putting all of that into a paper form. 00:31:12.640 |
And then you have some results. And then you submit that to a place for review through the 00:31:20.400 |
peer review process. And there's a process where, how would you summarize the peer review process, 00:31:25.680 |
but it's really just like a handful of people look over your paper and comment and based on that, 00:31:31.040 |
decide whether your paper is good or not. So there's a whole broken nature to it. At the same 00:31:37.840 |
time, I love the peer review process when I buy stuff on Amazon, for like the commenting system, 00:31:46.800 |
whatever that is. So, okay, so there's a bunch of possibilities for revolutions there. And then 00:31:51.440 |
there's the other side, which is the collaborative aspect of the science, which is people annotating 00:31:56.720 |
people commenting sort of the low effort collaboration, which is a comment. Sometimes 00:32:02.880 |
as you've talked about, a comment can change everything, but, or a higher effort collaboration, 00:32:08.400 |
like more like maybe annotations or even like contributing to the paper. You can think of like 00:32:14.000 |
a collaborative updating of the paper over time. So there's all these possibilities for 00:32:21.920 |
doing things better than they've been done. Can we talk about some ideas in this space, 00:32:29.200 |
some ideas that you're working on, some ideas that you're not yet working on, but should be 00:32:34.400 |
revolutionized? Because it does seem that archive and like open review, for example, 00:32:41.120 |
are like the Craigslist of science. Like, yeah, okay. I'm very grateful that we have it, but it 00:32:49.840 |
just feels like it's like 10 to 20 years. Like it doesn't feel like that's a feature. The simplicity 00:32:56.400 |
of it is a feature. It feels like it's a bug. But then again, the pushback there is Wikipedia has 00:33:05.200 |
the same kind of simplicity to it. And it seems to work exceptionally well in the crowdsourcing 00:33:13.040 |
aspect of it. So, sorry, there's a bunch of stuff going on on the table. Let's just pick random 00:33:18.400 |
things that we can talk about. - Wikipedia, for me, it's the cosmological constant of the internet. 00:33:24.640 |
I think we are lucky to live in the parallel universe where Wikipedia exists. Because if 00:33:29.680 |
someone had pitched me Wikipedia, like a publicly edited encyclopedia, like a couple of years ago, 00:33:36.320 |
like it would be, I don't know how many people would have said that that would have survived. 00:33:41.280 |
- I mean, it makes almost no sense. It's like having a Google doc that everybody on the internet 00:33:45.840 |
can edit. And like, that will be like the most reliable source for knowledge. 00:33:50.240 |
- I don't know how many, but hundreds of thousands of topics. - Yeah. It's insane. 00:33:56.880 |
- It's insane. And then you have users, like there's a single user that edited one third 00:34:02.960 |
of the articles on Wikipedia. So we have these really, really big power users. They are 00:34:07.520 |
a substantial part of like what makes Wikipedia successful. And so, like, no one would have ever 00:34:16.640 |
imagined that that could happen. And so that's one thing. I completely agree with what you just said. 00:34:23.680 |
- Sorry to interrupt briefly. Maybe let's inject that into the discussion of everything else. 00:34:28.880 |
I also believe, I've seen that with Stack Overflow, that one individual or a small collection 00:34:35.120 |
of individuals contribute or revolutionize most of the community. Like if you create a really 00:34:42.000 |
powerful system for archive or like open review and made it really easy and compelling and exciting 00:34:50.960 |
for one person who is in like a 10X contributor to do their thing, that's going to change everything. 00:34:57.840 |
It seems like that was the mechanism that changed everything for Wikipedia. And that's the mechanism 00:35:02.240 |
that changed everything for Stack Overflow is gamifying or making it exciting or just making it 00:35:07.760 |
fun or pleasant or fulfilling in some way for those people who are insane enough to like answer 00:35:15.280 |
thousands of questions or write thousands of factoids and like research them and check them, 00:35:21.920 |
all those kinds of things, or read thousands of papers. - Yeah. No, Stack Overflow is another 00:35:27.120 |
great example of that. And it's just, and those are both two incredibly productive communities 00:35:34.080 |
that generate a ton of value and capture almost none of it. And in a way it's almost like counter, 00:35:43.520 |
it's very counterintuitive that people, that these communities would exist and thrive. 00:35:51.120 |
And it's really hard to, there aren't that many communities like that. 00:35:57.840 |
- So how do we do that for science? Do you have ideas there? Like what are the biggest 00:36:02.800 |
problems that you see? Are you working on some of them? - Just on that, there are a couple of 00:36:06.960 |
really interesting experiments that people are running. An example would be like the polymath 00:36:11.680 |
projects. So this is a kind of a social experiment that was created by Tim Gowers, 00:36:17.600 |
Fields medalist. And his idea was to try to prove that is it possible to do mathematics in a 00:36:24.560 |
massively collaborative way on the internet? So he decided to pick a couple of problems and test 00:36:32.240 |
that. And they found out that it actually, it is possible for specific types of problems, 00:36:38.080 |
namely problems that you're able to break down in little pieces and go step by step. You might need, 00:36:44.320 |
as with open source, you might need people that are just kind of reorganizing the house every 00:36:50.960 |
once in a while. And then people throw a bunch of ideas and then you make some progress, then you 00:36:56.560 |
reorganize, you reframe the problem and you go step by step. But they were actually able to prove 00:37:01.920 |
that it is possible to collaborate online and do progress in terms of mathematics. And so I'm 00:37:11.040 |
confident that there are other avenues that could be explored here. - Can we talk about peer review, 00:37:15.760 |
for example? - Absolutely. I think like in terms of the peer review, I think it's important to look 00:37:22.400 |
at the bigger picture here of like, of what the scientific publishing ecosystem looks like. 00:37:30.320 |
Because for me, there are a lot of things that are wrong about that entire process. So if you 00:37:36.640 |
look at, for instance, at what publishing means in like a traditional journal, you have journals that 00:37:43.520 |
pay authors for their articles, and then they might pay like reviewers to review those articles. 00:37:52.400 |
And finally, they pay people to, or distributors to distribute the content. In the scientific 00:37:59.680 |
publishing world, you have scientists that are usually backed by government grants, they are 00:38:04.800 |
giving away their work for free in the form of papers. And then you have other scientists that 00:38:10.160 |
are reviewing their work. This process is known as the peer review process, again for free. And 00:38:16.640 |
then finally, we have government backed universities and libraries that are buying back all that work 00:38:27.040 |
so that other scientists can read. So this is, for me, it's bizarre. You have the government that is 00:38:32.240 |
funding the research, is paying the salaries of the scientists, is paying the salaries of the 00:38:36.560 |
reviewers, and it's buying back all that product of their work again. And I think the problem with 00:38:43.600 |
this system, and it's why it's so difficult to break this suboptimal equilibrium, is because of 00:38:51.680 |
the way academia works right now and the way you can progress in your academic life. And so, 00:38:58.720 |
in a lot of fields, the competition in academia is really insane. So you have hundreds of PhD 00:39:06.320 |
students that are trying to get to a professor position, and it's hyper competitive. And the 00:39:14.320 |
only way for you to get there is if you publish papers, ideally in journals, with a high impact 00:39:23.360 |
factor. In computer science, it's often conferences that are also very prestigious, or actually more 00:39:28.240 |
prestigious than journals now. Okay, interesting. So that's the one discipline where, I mean, 00:39:33.120 |
that has to do with the thing we've discussed in terms of how quickly the field turns around. 00:39:38.960 |
But like, NeurIPS, CVPR, those conferences are more prestigious, or at the very least, 00:39:45.520 |
as prestigious as the journals. But yeah, but doesn't matter. The process is what it is. 00:39:49.840 |
So for people that don't know, the impact factor of a journal is basically the average number of 00:39:56.080 |
citations that a paper would get if it gets published on that journal. But so, you can 00:40:02.400 |
really think that the problem with the impact factor is that it's a way to turn papers into 00:40:10.480 |
accounting units. And let me unpack this, because the impact factor is almost like a nobility title. 00:40:18.880 |
So because papers are born with impact, even before anyone reads them. So the researchers, 00:40:24.560 |
they don't have the incentive to care about if this paper is going to ever have a long-term impact 00:40:31.200 |
on the world. What they care, their goal, their end goal is the paper to get published. 00:40:35.680 |
So that they get that value up front. So for me, that is one of the problems of that. And that 00:40:42.480 |
really creates a tyranny of metrics. Because at the end of the day, if you are a dean, what you 00:40:48.240 |
want to hire is like people, researchers that publish papers on journals with high impact 00:40:53.600 |
factors, because that will increase the ranking of your university and will allow you to charge 00:40:58.400 |
more for tuition, so on and so forth. And that, especially when you are in super competitive areas, 00:41:06.720 |
that people will try to gamify that system and misconduct starts showing up. 00:41:12.720 |
There's a really interesting book on this topic called Gaming the Metrics. It's a book by a 00:41:19.920 |
researcher called Mario Biagioli. It goes a lot into like how the impact factor and metrics 00:41:27.440 |
affect science negatively. And it's interesting to think, especially in terms of citations, 00:41:32.400 |
if you look at the early work of like looking at citations, there was a lot of work that was 00:41:37.280 |
done by a guy called Eugene Garfield. And this guy, the early work in terms of citation, they 00:41:42.560 |
wanted to use citations as from a descriptive point of view. So what they wanted to create was 00:41:49.600 |
a map. And that map would create a visual representation of influence. So citations 00:41:56.080 |
would be links between papers. And ideally what they would show, they would represent is that you 00:42:02.560 |
read someone else's paper and it had an impact on your research. They weren't supposed to be counted. 00:42:07.920 |
I think this inspired like Larry and Sergey's work, right? For Google. 00:42:12.720 |
Exactly. I think they even mentioned that. But what happens is like, as you start counting citations, 00:42:17.680 |
you create a market. And the same way, like, and this was the work of Eugene Garfield was a big 00:42:24.240 |
inspiration for Larry and Sergey for the page rank algorithm that led to the creation of Google. 00:42:30.960 |
And they even recognized that. And if you think about it, it's like the same way there's a gigantic 00:42:36.400 |
market for search engine optimization, SEO, where people try to optimize the page rank and how a 00:42:45.360 |
web page will rank on Google. The same will happen for papers. People will try to optimize 00:42:51.680 |
the impact factors and the citations that they get. And that creates a really big problem. 00:42:57.440 |
And it's super interesting to actually analyze them. If you look at the distribution of the 00:43:02.960 |
impact factors of journals, you have like nature with nature, I believe it's like in the low 40s. 00:43:09.760 |
And then you have, I believe science is high 30s. And then you have a really good set of good 00:43:16.480 |
journals that will fall between 10 and 30. And then you have a gigantic tale of journals that 00:43:23.040 |
have impact factor below two. And you can really see two economies here. You see the universities 00:43:31.520 |
that are maybe less prestigious, less known, that where the faculty are pressured to just 00:43:37.920 |
publish papers, regardless of the journal, what I want to do is increase the ranking of my 00:43:42.960 |
university. And so they end up publishing as many papers as they can in journals with low impact 00:43:49.200 |
factor. And unfortunately, this represents a lot of the global South. And then you have the luxury 00:43:57.520 |
good economy. So for instance, and there are also problems here in the luxury good economy. 00:44:03.360 |
So if you look at the journal like nature, so with impact factor of like in the low 40s, 00:44:09.600 |
there's no way that you're going to be able to sustain that level of impact factor by just 00:44:15.440 |
grabbing the attention of scientists. What I mean by that is like, for the journals, 00:44:22.080 |
the articles that get published in nature, they need to be New York Times great. So they need to 00:44:29.280 |
make it to the big media, they need to be captured by the big media. Because that's the only way for 00:44:35.120 |
you to capture enough attention to sustain that level of citations. And that of course, creates 00:44:41.360 |
problems because people then will try to, again, gamify the system and have like titles or abstracts 00:44:48.320 |
or that are bigger, make claims that are bigger than what is actually can be sustained by the data 00:44:57.120 |
or the content of the paper. And you will have clickbait titles or clickbait abstracts. And 00:45:02.720 |
again, this is all a consequence of metrics and science of metrics. And this is a very dangerous 00:45:10.320 |
cycle that I think it's very hard to break, but it's happening in academia in a lot of fields 00:45:16.240 |
right now. - Is it fundamentally the existence of metrics or the metrics just need to be significantly 00:45:21.840 |
improved? Because like I said, the metrics used for Amazon for purchasing, I don't know, computer 00:45:30.720 |
parts is pretty damn good in terms of selecting which are the good ones, which are not. In that 00:45:35.920 |
same way, if we had Amazon type of review system in the space of ideas, in the space of science, 00:45:44.000 |
it feels like those metrics would be a little bit better. Sort of when it's significantly more open 00:45:52.800 |
to the crowdsource nature of the internet, of the scientific internet, meaning as opposed to, 00:46:00.000 |
like my biggest problem with peer review has always been that it's like five, six, seven people, 00:46:06.880 |
usually even less. And it's often nobody's incentivized to do a good job in the whole 00:46:13.360 |
process. Meaning it's anonymous in a way that doesn't incentivize, like doesn't gamify or 00:46:22.480 |
incentivize great work. And also it doesn't necessarily have to be anonymous. Like there 00:46:30.080 |
has to be, the entire system doesn't encourage actual sort of rigorous review. For example, like 00:46:39.680 |
open review does kind of incentivize that kind of process of collaborative review, 00:46:47.280 |
but it's also imperfect. It just feels like the thing that Amazon has, which is like thousands 00:46:52.960 |
of people contributing their reviews to a product. It feels like that could be applied to science 00:47:00.640 |
where the same kind of thing you're doing with Fermat's library, but doing at a scale that's much 00:47:08.000 |
larger. It feels like that should be possible given the number of grad students, given the number of 00:47:16.240 |
general public that's, like for example, I personally, as a person who got an education 00:47:22.560 |
in mathematics and computer science, like I can be a quote unquote like reviewer on a lot bigger 00:47:32.480 |
set of things than is my exact expertise. If I'm one of thousands of reviewers, if I'm the only 00:47:41.600 |
reviewer, one of five, then I better be like an expert in the thing. But if I, and I've learned 00:47:47.680 |
this with COVID, which is like, you can just use your basic skills as a data analyst as a, 00:47:54.320 |
and to contribute to the review process on a particular little aspect of a paper and be able 00:47:59.200 |
to comment, be able to sort of draw in some references that challenge the ideas presented 00:48:05.680 |
or to enrich the ideas that are presented, or, you know, it just feels like crowdsourcing 00:48:12.240 |
the review process would be able to allow you to have metrics in terms of how good a paper is 00:48:20.480 |
that are much better representative of its actual impact in the world, of its actual value to the 00:48:26.080 |
world, as opposed to some kind of arbitrary gamified version of its impact. - I agree with 00:48:35.440 |
that. I think we, there's definitely the possibility, at least for more resilient, 00:48:39.520 |
a more resilient system than what we have today. And it's, I think that's kind of what you're 00:48:43.280 |
describing, Alex. And I mean, to an extent we kind of have like a little bit of a Heisenberg 00:48:50.160 |
uncertainty principle. When you pick a metric, as soon as you do it, then maybe it works as a good 00:48:54.480 |
heuristic for a short amount of time, but soon enough people would start gamifying and, but then 00:49:00.480 |
you can definitely have metrics that are more resilient to gamification and they'll work as 00:49:05.200 |
a better heuristic to try to push you in the best direction. - But I guess the underlying problem 00:49:12.000 |
you're saying is there's a shortage of positions in academia. - That's a big problem for me. - Yeah. 00:49:18.080 |
And that, and so they're going to be constantly gamifying the metrics. - It's a bit of a zero 00:49:22.800 |
sum game. - It's a very competitive, it's a very competitive field. And that's what usually happens 00:49:28.080 |
in very competitive fields. - Yeah. - But I think some of like the peer review problems, like scale 00:49:34.880 |
helps, I think. And it's interesting to look at like what you're mentioning, breaking it down, 00:49:38.880 |
maybe in like smaller parts and having more people jumping in. But this is definitely a problem. 00:49:47.040 |
And the peer review problem, as I mentioned, is correlated with the problem of like academic 00:49:52.240 |
career progression and it's all intertwined. And that's why I think it's so hard to break it. 00:49:57.920 |
There are like a couple of really interesting things that are being done right now. There are a 00:50:02.800 |
couple of, for instance, journals that are overlay journals on top of platforms like archive and 00:50:09.440 |
bio-archive that want to remove like the more traditional journals from the equation. 00:50:14.480 |
So essentially a journal is just a collection of links to papers. And what they are trying to do 00:50:21.360 |
is like removing that middleman and trying to make the review process a little bit more transparent 00:50:28.480 |
and not charging universities. Like there are a couple of more famous ones. There's one discreet 00:50:36.000 |
analysis in mathematics. There's one called the Quantum Journal, which we're actually working with 00:50:41.280 |
them. We have a partnership with them for the papers that get published in Quantum Journal. 00:50:45.840 |
They also get the annotations on formats and they're doing pretty well. They've been able to 00:50:50.480 |
grow substantially. The problem there is getting to critical mass. So it's again, convincing the 00:50:55.200 |
researchers and especially the young researchers that need that impact factor, need those 00:51:01.120 |
publications to have citations to not publish on the traditional journal and go on an open journal 00:51:07.520 |
and publish their work there. I think there are a couple of really high profile scientists, 00:51:12.960 |
people like Tim Gowers, that are trying to incentivize like famous scientists that already 00:51:18.240 |
have tenure and that don't need that to publish that to increase the reputation of those journals 00:51:24.000 |
so that other maybe younger scientists can start publishing on those as well. And so they can try 00:51:29.440 |
to break that vicious cycle of the more traditional journals. - I mean, another possible way to break 00:51:36.720 |
this cycle is to like raise public awareness and just by force, like ban paid journals. Like what 00:51:45.520 |
exactly are they contributing to the world? Like basically making it illegal to forget the fact 00:51:54.560 |
that it's mostly federally funded. So that's a super ugly picture too. But like, why should 00:52:00.800 |
knowledge be so expensive? Like where everyone is working for the public good and then there's 00:52:08.880 |
these gatekeepers that, you know, most people can't read most papers without having to pay money. 00:52:16.000 |
And that doesn't make any sense. Like that should be illegal. - I mean, that's what you're saying 00:52:23.680 |
is exactly right. I mean, for instance, right, I went to school here in the US, we studied in Europe 00:52:28.880 |
and you would sit, like you'd ask me all the time to download papers and send it to him because he 00:52:33.600 |
just couldn't get it and like papers that he needed for his research. And so- - But he's a student, 00:52:38.240 |
like he's- - Yeah, he's a grad student. - He was a grad student, but that, you know, I'm even 00:52:42.560 |
referring to just regular people. - Oh yeah, okay, that too. - And I think during 2020, because of 00:52:50.880 |
COVID, a lot of journals put down the walls for certain kind of coronavirus related papers. 00:52:56.320 |
But like that just gave me an indication that like, this should be done for everything. It's absurd. 00:53:04.080 |
Like people should be outraged that there's these gates because, so the moment you dissolve the 00:53:11.280 |
journals, then there'll be an opportunity for startups to build stuff on top of archive. It'd 00:53:18.720 |
be an opportunity for like Fermat's library to step up, to scale up to something much even larger. 00:53:25.360 |
I mean, that was the original dream of Google, which I've always admired, which is make the 00:53:31.040 |
world's information accessible. Actually, it's interesting that Google hasn't, maybe you guys 00:53:36.000 |
can correct me, but they have put together Google Scholar, which is incredible. And they've 00:53:41.920 |
did the scanning of books, but they haven't really tried to make science accessible in the following 00:53:50.000 |
way. Like besides doing Google Scholar, they haven't like delved into the papers, right? 00:53:57.200 |
- Which is especially curious given what Luis was saying, right? That it's kind of in their 00:54:01.600 |
genesis, there's this research that was very connected with how papers reference each other 00:54:06.960 |
and like building a network out of that. - Interesting enough, like Google, I think 00:54:11.520 |
there was not intended, Google Plus was like the Google social network that caught cancel, 00:54:16.800 |
was used by a lot of researchers. - Yes, it was. 00:54:18.800 |
- With what I think was just a side, kind of a side effect, but then a lot of people ended up 00:54:24.000 |
migrating to Twitter, but it was not on purpose. But yeah, I agree with you, like they haven't 00:54:27.840 |
gone past Google Scholar and I don't know why. - Well, that said, Google Scholar is incredible. 00:54:33.440 |
For people who are not familiar, it's one of the best aggregation of all the scientific work that's 00:54:40.000 |
out there and especially the network that connects all of them, what sites, what, and also trying to 00:54:45.680 |
aggregate all of the versions of the papers that are available there and trying to merge them in a 00:54:50.480 |
way that one particular work, even though it's available in a bunch of places, counts as, 00:54:55.200 |
you know, like a central hub of what that work is across the multiple versions. But that almost 00:55:01.280 |
seems like a fun pet project of a couple of engineers within Google, as opposed to a serious 00:55:08.000 |
effort to make the world science accessible. - But going back to just the journals, when 00:55:13.920 |
you're talking about that, Lex, I believe that in that front, I think we might be past the event 00:55:20.720 |
horizon. So I think the business model for the journals doesn't make sense. They're a middle 00:55:27.840 |
layer that is not adding a lot of value and you see a lot of motions, whereas like in Europe, 00:55:33.040 |
a lot of the papers that are funded by the European Union, they will have to be 00:55:41.280 |
open to the public. And I think there's a lot of... - Bill Gates too, like what the Gates Foundation 00:55:46.880 |
funds, like the demand that it's accessible to everybody. - So I think it's a question of time 00:55:54.560 |
before that wall kind of falls. And that is going to open a lot of possibilities. Because, you know, 00:56:00.880 |
imagine if you had like the layer of, that gigantic layer of papers all available online, 00:56:08.160 |
that unlocks a lot of potential as a platform for people to build things on top of that. 00:56:13.680 |
- But I think to what you're saying, it is weird. Like you can literally go and listen to any song 00:56:20.400 |
that was ever made on your phone, right? You open Spotify and you might not even pay for it. You 00:56:25.840 |
might be on the free version and you can listen to any song that was ever made pretty much. 00:56:29.760 |
But there's like, you don't have access to a huge percentage of academic papers, which is just like 00:56:38.240 |
this fundamental knowledge that we're all funding, but you as an individual don't have access to it. 00:56:43.280 |
And somehow, you know, like the problem for music got solved, but for papers, it's still like... 00:56:49.440 |
- It's just not yet. It could be ad supported, all those kinds of things. And then hopefully 00:56:53.840 |
that would change the way we do science. That's the most exciting thing for me, 00:56:57.600 |
is especially once I started like making videos and this silly podcast thing, I started to realize 00:57:06.400 |
that if you want to do science, one of the most effective ways is to do a couple, 00:57:12.160 |
the paper with a set of YouTube videos, like explaining it. 00:57:17.040 |
- Yeah. That also seems like there's a lot of room for disruption there. What is the paper 2.0 00:57:23.920 |
going to look like? I think like LaTeX and the PDF seems like, if you look at the first paper 00:57:31.680 |
that got published in Nature, and if you look at the paper that got published in Nature today, 00:57:36.320 |
look at the two side by side, they are fundamentally the same. And even though like 00:57:41.280 |
the paper that gets published today, you know, you get even code, like right now people put like 00:57:47.840 |
code on a PDF. And there are so many things that are related to papers today. You have data, 00:57:56.240 |
you have code, you might need videos to better explain the concepts. So I think for me, it's 00:58:03.840 |
natural that there's going to be also an evolution there, that papers are not going to be just the 00:58:08.000 |
static PDFs or LaTeX. There's going to be a next interface. - So in academia, a lot of things that 00:58:16.000 |
you're judged by is often quantity, not quality. I wonder if there's a opportunity to have like, 00:58:23.360 |
I tend to judge people by the best work they've ever done as opposed to... 00:58:28.960 |
I wonder if there's a possibility for that to encourage sort of focusing on the quality 00:58:34.000 |
and not necessarily in paper form, but maybe a subset of a paper, subset of idea, 00:58:39.200 |
almost even a blog post or an experiment. Like why does it have to be published in a journal 00:58:44.000 |
to be legitimate? - And it's interesting that you mentioned that. I also think like, yeah, 00:58:52.240 |
why is that the only format? Why can't a blog post or... We were even experimenting with these 00:59:00.080 |
a few months ago. Can you actually like publish something or like a new scientific breakthrough 00:59:08.160 |
or something that you've discovered in the form of like a set of tweets, 00:59:13.280 |
a Twitter thread? Why can't that be possible? And we were experimenting with that idea. 00:59:21.840 |
We even, yeah, we ran a couple of, like some people submitted a couple of those, 00:59:27.440 |
like I think the limit was three or four tweets. Maybe it's a new way to look at a proof or 00:59:32.880 |
something, but I think it just serves to show that there should be other ways to publish like 00:59:38.160 |
scientific discoveries that don't fit the paper format. - Well, but so even with the Twitter thread, 00:59:44.880 |
it would be nice to have some mechanism of formalizing it and making it 00:59:50.400 |
into an NFT. - Maybe. - Like a concrete thing that you can reference with a link that's unique. 00:59:58.960 |
Because I mean, everything we've been saying, all of that, while being true, it's also true that 01:00:09.120 |
the constraints and the formalism of a paper works well. It like forces you, constraints forces you 01:00:17.920 |
to narrow down your thing and literally put it on paper, but you know. - I agree. - Make concrete. 01:00:27.200 |
And that's why, I mean, it's not broken. It's just could be better. And that's the main idea. 01:00:34.160 |
I think there's something about writing, whether it's a blog post or Twitter thread or a paper, 01:00:41.040 |
that's really nice to concretize a particular little idea that can then be referenced by 01:00:49.520 |
other ideas, then it can be built on top of with other ideas. So let me ask, you've read quite a 01:00:58.400 |
few papers, you've annotated quite a few papers. Let's talk about the process itself. How do you 01:01:06.080 |
advise people read papers? Or maybe you want to broaden it beyond just papers, but just read 01:01:12.640 |
concrete pieces of information to understand the insights that lay within. - I would say for papers 01:01:19.200 |
specifically, I would bring back kind of what Luis was talking about, is that it's important to keep 01:01:24.160 |
in mind that papers are not optimized for ease of understanding. And so, right, there's all sorts of 01:01:31.920 |
restrictions in size and format and language that they can use. And so it's important to keep that 01:01:38.640 |
in mind. And so that if you're struggling to read a paper, that might not mean that the underlying 01:01:45.280 |
material is actually that hard. And so that's definitely something that, especially for us, 01:01:51.760 |
that we read papers and most of the times we'll read papers that are completely outside of our 01:01:57.760 |
comfort zone, I guess. And so it'd be completely new areas to us. So I always try to keep that in 01:02:05.040 |
mind. - So there's usually a certain kind of structure, like abstract introduction, there's 01:02:09.200 |
methodology, depending on the community and so on. Is there something about the process of like 01:02:17.440 |
how to read it, whether you want to skim it to try to find the parts that are easy to understand 01:02:22.080 |
and not, reading it multiple times? Is there any kind of hacks that you can comment on? 01:02:28.720 |
- I remember like Feynman had this kind of hack when he was reading papers where he would basically, 01:02:35.520 |
I think I believe he would read the conclusion of the paper and we would try to just see if he would 01:02:45.360 |
be able to figure out how to get to the conclusion in like a couple of minutes by himself. 01:02:51.040 |
And he would read a lot of papers that way. And I think Fermi also did that almost. And Fermi was 01:02:58.640 |
known for doing a lot of back of the envelope calculations. So he was a master at doing that. 01:03:03.520 |
In terms of like, especially when reading a paper, I think a lot of times people might 01:03:10.640 |
feel discouraged about the first time you read it, it's very hard to grasp or you don't understand a 01:03:18.480 |
huge fraction of the paper. And I think it's having read a lot of papers in my life, I think 01:03:24.000 |
I'm in peace with like the fact that you might spend hours where you're just reading a paper 01:03:29.360 |
and jumping from paper to paper, reading citations. And like your level of understanding of sometimes 01:03:36.960 |
of the paper is very close to 0%. And all of a sudden, everything kind of makes sense in your 01:03:44.400 |
mind. And then you have this quantum jump where all of a sudden you understand the big picture 01:03:51.360 |
of the paper. And this is an exercise that I have to do when reading papers, and especially like 01:03:58.240 |
more complex papers, like, okay, you don't understand because you're just going through 01:04:01.840 |
the process and just keep going. And it might feel super chaotic, especially if you're jumping 01:04:07.280 |
from reference to reference. You might end up with like 20 tabs open and you're reading a ton of 01:04:12.800 |
other papers, but it's just trusting that process, that at the end, like you'll find light. And I 01:04:19.200 |
think for me, that's a good framework when reading a paper. It's hard because you might end up 01:04:25.840 |
spending a lot of time and it looks like you're lost, but that's the process to actually understand 01:04:34.000 |
what they're talking about in the paper. - Yeah, I think that process, I've found a lot of value 01:04:39.840 |
in the process, especially for things outside my field, reading a lot of related work sections 01:04:46.560 |
and kind of going down that path of getting a big context of the field, because what's, 01:04:52.560 |
especially when they're well-written, there's opinions injected into the related work. Like 01:04:58.080 |
what work is important, what is not. And if you read multiple related work sections that cite or 01:05:03.600 |
don't cite each other, like the papers, you get a sense of where the field, where the tensions of 01:05:09.520 |
the field are, where the field is striving. And that helps you put into context, like whether the 01:05:16.400 |
work is radical, whether it's overselling itself, whether it's underselling itself, all those 01:05:22.480 |
things. And added on top of that, I find that often the related work section is the most kind 01:05:31.520 |
of accessible and readable part of a paper, because it's kind of, it's brief to the point, 01:05:38.240 |
it's trying to like summarizing, it's almost like a Wikipedia style article. The introduction is 01:05:43.280 |
supposed to be a compelling story or whatever, but it's often like overselling, there's like an 01:05:48.800 |
agenda introduction. The related work usually has the least amount of agenda, except for the few 01:05:54.800 |
like elements where you're trying to talk shit about previous work, where you're trying to sell 01:05:59.680 |
that you're doing much better. But other than that, when you're just painting where the field 01:06:05.520 |
came from or where the field stands, that's really valuable. And also, again, just to agree 01:06:11.120 |
with Fiona in the conclusion, but it's like, I get a lot of value from the breadth first search, 01:06:16.640 |
kind of read the conclusion, then read the related work, and then go through the references in the 01:06:23.040 |
related work, read the conclusion, read the related work, and just go down the tree until 01:06:29.280 |
you like hit dead ends or run out of coffee. And then through that process, you go back up the tree 01:06:35.280 |
and now you can see the results in their proper context. Unless of course the paper is truly 01:06:40.960 |
revolutionary, which even that process will help you understand that is in fact truly revolutionary. 01:06:47.920 |
You've also, you talked about just following your Twitter thread in a depth first search. 01:06:55.920 |
You talked about that you read the book on Grisha Perlman, Grigori Perlman, and then you had a 01:07:03.360 |
really nice Twitter thread on it and you were taking notes throughout. So at a high level, 01:07:09.840 |
is there suggestions you can give on how to take good notes? Whether it's, we're talking about 01:07:14.880 |
annotations or just for yourself to try to put on paper ideas as you progress through the work in 01:07:22.000 |
order to then like understand the work better? For me, I always try not to underestimate how 01:07:28.320 |
much you can forget within six months after you've read something. I thought you were going to say 01:07:33.040 |
five minutes, but yeah, six months is good. Yeah, or even shorter. And so that's something 01:07:38.400 |
that I always try to keep in mind. And it's often, I mean, every once in a while I'll read back a 01:07:45.200 |
paper that I annotated on Fermat and I'll read through my own annotations and I've completely 01:07:52.960 |
forgotten what I had written. But it also, it's interesting because in a way, after you just 01:08:00.800 |
understood something, you're kind of the best possible teacher that can teach your future self. 01:08:06.720 |
After you've forgotten it, you're kind of your own best possible teacher at that moment. 01:08:14.640 |
And so it can be great to try to capture that. It's brilliant. It just made me kind of 01:08:21.840 |
realize it's really nice to put yourself in the position of teaching an older version of yourself 01:08:30.240 |
that returns to this paper, almost like thinking it literally. That's under explored, but it's super 01:08:36.000 |
powerful because you were the person that you can, if you look at the scale from one, not knowing 01:08:41.760 |
anything about the topic, and 10, you are the one that progressed from one to 10 and you know which 01:08:46.720 |
steps you struggled with. So you're really the best person to help yourself make that transition 01:08:52.240 |
from one to 10. And a lot of the times, I really believe that the framework that we have to expose 01:09:01.600 |
ourselves to be talking to us when we were an expert, when we were taking that class and we 01:09:06.880 |
knew everything about quantum mechanics. And then six months later, you don't remember half of the 01:09:11.360 |
things. How could we make it easier to have those conversations between you and your past expert 01:09:19.680 |
self? I think there might be, it's an under explored idea. I think notes on paper are probably 01:09:27.120 |
not the best way. I'm not sure if it's a combination of video, audio, where it's like you 01:09:32.960 |
have a guided framework that you follow to extract information from yourself so that you can later 01:09:38.480 |
kind of revisit to make it easier to remember. But that's, I think it's an interesting idea worth 01:09:47.040 |
exploring that I haven't seen a lot of people kind of trying to distill that problem. 01:09:53.200 |
Yeah, creating the kind of tools. I find if I record, it sounds weird, but I'll take notes. But 01:10:00.400 |
if I record audio, like little clips of thoughts, like rants, that's really effective at capturing 01:10:10.160 |
something that notes can't. Because when I replay them, for some reason, it loads my brain back 01:10:17.120 |
into where I was when I was reading that in a way that notes don't. Like when I read notes, 01:10:23.520 |
I'll often be like, "Whoa, what? What was I thinking there?" But when I listened to the audio, 01:10:30.640 |
it brings you right back to that place. And maybe with video, with visual, that might be even more 01:10:36.160 |
powerful. I think so. Yeah. And I think just the process of verbalizing it, that alone kind of 01:10:44.080 |
makes you have to structure your thought and put it in a way that somebody else could come and 01:10:49.680 |
understand it. And just the process of that is useful to organize your thoughts. And yeah, 01:10:57.120 |
just that alone. Does the Fermat's Library Journal Club have a video component or no? 01:11:02.480 |
Not natively. Sometimes we'll include videos, but it's always embedded. 01:11:07.840 |
Do people build videos on top of it to explain the paper? Because you're doing all the hard work 01:11:12.720 |
of understanding deeply the paper. We haven't seen that happening too much, but we were actually 01:11:20.880 |
playing around with the idea of creating some sort of podcast version where we try to distill 01:11:26.160 |
the paper on an audio format that not maybe you can have access to. Might be trickier, but there 01:11:32.720 |
are definitely people that could be interested in the paper and that topic, but are not willing 01:11:36.640 |
to read it. But they might listen to a 30-minute episode on that paper. You could reach more people 01:11:42.480 |
and you might even bring the authors to the conversation, but it's tricky, especially for 01:11:47.200 |
more technical papers. We've thought about doing that, but we haven't converged. I'm sure if you 01:11:54.080 |
have any... Well, I'm going to take that as a small project to take one of the Fermat's, almost 01:12:00.560 |
half advertisement and half as a challenge for myself, to take one of the annotated papers and 01:12:05.760 |
use it as a basis for creating a quick video. I've seen, hopefully I'm saying the name correctly, but 01:12:14.720 |
Machine Learning Street Talk. I think that's the name of the show that I recommend highly. 01:12:21.840 |
That's the right name. But they do exactly that, which is multiple hour breakdown of a paper with 01:12:27.760 |
video component. Sometimes with authors, people love it. It's very effective. 01:12:33.120 |
There's also, I've seen, I haven't seen the entire, in its entirety, but I've seen the founder of 01:12:40.000 |
comma.ai, George. I've seen him just taking a paper and then distilling the paper and coding it, 01:12:48.960 |
coding it sometimes doing 10 hours. And he was able to get a lot of people interested in that 01:12:55.600 |
and viewing him. - So I'm a huge fan of that. 01:12:58.480 |
George is a personality. I think a lot of people listen to this podcast for the same reason. 01:13:05.520 |
It's not necessarily the contents. They like to listen to a silly Russian who has a childlike brain 01:13:13.760 |
and mumbles and all those like struggle with ideas. And George is a madman. People just enjoy, 01:13:20.240 |
how is he going to struggle in implementing this particular paper? How is he going to 01:13:24.560 |
struggle with this idea? It's fun to watch and that actually pulls you in. The personality 01:13:28.800 |
is important there. - True. I agree with you, 01:13:32.160 |
but there also, it's visible. There's an extraordinary ability that is there. He's 01:13:39.200 |
talented and you need to have, there's a craft and this guy definitely has talent and he's doing 01:13:44.960 |
something that is not easy. And I think that also draws the attention of people. And the other day, 01:13:50.640 |
we were actually, we ran into this YouTube channel of this guy that was restoring art. 01:13:55.840 |
And it was basically just a video of him. The production is really well done. And it's just 01:14:05.760 |
him taking really old pieces of art and then paintings and then restoring them. But he's 01:14:12.480 |
really good at that. And he describes that process and that draws the attention of people, 01:14:18.080 |
regardless of your craft, be it like annotating a paper, restoring it. 01:14:22.960 |
- Craftsmanship, excellence. Yeah. George is incredibly good at programming. 01:14:27.200 |
You know, those competitive programmers, like Top Coater and all those kinds of stuff. He has 01:14:33.760 |
the same kind of element where the brain just jumps around really quickly. And that's, yeah, 01:14:39.040 |
just like with art restoration. - And it's also motivating. 01:14:41.280 |
- Yeah, it's motivating. But you're right, in watching people who are good at what they do, 01:14:48.080 |
it's motivating, even if the thing you're trying to do is not what they're doing. 01:14:51.600 |
It's just like contagious when they're really good at it. And the same kind of analysis with 01:14:56.160 |
the paper, I think, so not just like the final result, but the process of struggling with it. 01:15:02.400 |
That's really interesting. - Yeah. I think Twitch proved that 01:15:05.840 |
like, you know, that there's really a market for that, for watching people do things that they're 01:15:11.760 |
really good at and you'll just watch it. You will enjoy that, that might even spike your interest 01:15:18.800 |
in that specific topic. And yeah, people will enjoy watching sometimes hours on end of great 01:15:26.080 |
craftsmen. - Do you mind if we talk about some of the 01:15:29.200 |
papers, do any papers come to mind that have been annotated on Fermat's library? 01:15:33.680 |
- The papers that we annotated can be about completely random topics, but that's part of 01:15:38.960 |
what we enjoy as well. It forces you to explore these topics that otherwise maybe you'd never 01:15:43.360 |
run into. And so the ones that come to mind are, to me, are fairly random, but one that I really 01:15:51.680 |
enjoyed learning more about is a paper written by a mathematician, actually, Tom Apostol, 01:16:00.080 |
and about a tunnel in a Greek island off the coast of Turkey. So it's very random. 01:16:10.240 |
So this, okay, so what's interesting about this tunnel? So this tunnel was built in the 6th 01:16:18.000 |
century BC and it was built in the island of Samos, which is, as I said, off the coast of Turkey. 01:16:27.360 |
And they had the city on one side and the other mountain, and then they had a bunch of springs 01:16:34.160 |
on the other side and they wanted to bring water into the city. Building an aqueduct would be 01:16:40.320 |
pretty hard because of the way the mountain was shaped. And it would also, if they were under a 01:16:45.440 |
siege, they could just easily destroy that aqueduct and then the water wouldn't have any 01:16:51.520 |
water supply, the city wouldn't have any water supply. And so they decided to build a tunnel 01:16:56.480 |
and they decided to try to do it quickly. And so they started digging from both ends at the same 01:17:07.280 |
time through the mountain. And so when you start thinking about this, it's a fairly difficult 01:17:14.400 |
problem. And this is like 6th century BC. So you had very limited access to the mathematical tools 01:17:22.640 |
that you had at the time were very limited. And so what this paper is about is about the story 01:17:27.680 |
of how they built it and about the fact that for about 2000 years, the accepted explanation of how 01:17:35.760 |
they built it was actually wrong. And so this tunnel has been famous for a while. There are 01:17:40.240 |
a number of historians that talked about it since ancient Egypt. And the method that they described 01:17:46.800 |
for building it was just wrong. And so these researchers went there and were able to figure 01:17:57.040 |
that out. And so basically, kind of the way that they thought they had built it was basically, 01:18:02.720 |
if you can imagine looking at the mountain from the top and you have the mountain, then you have 01:18:07.920 |
both entrances. And so what they thought, and this is what the ancient historians described, 01:18:14.640 |
is that they effectively tried to draw a right angle triangle with the two entrances at each end 01:18:24.480 |
of the hypotenuse. And the way they did is like they would go around the mountain and kind of 01:18:29.440 |
walking in a grid fashion. And then you can figure out the two sides of the triangle. And then after 01:18:36.560 |
you have that triangle, you can effectively draw two smaller triangles at each entrance that are 01:18:43.680 |
proportional to that big triangle. And then you kind of have arrows pointing in each way. 01:18:49.360 |
And then you know at least that you have a line going through the mountain that connects 01:18:56.400 |
both entrances. The issue with that is like, once you go to this mountain and you start thinking of 01:19:03.680 |
doing this, you realize that, especially given that the tools that they had at the time, that 01:19:08.480 |
your error margin would be too small. You wouldn't be able to do it. Just the fact of trying to build 01:19:17.440 |
this triangle in that fashion, the error would accumulate and you would end up missing. You'd 01:19:22.080 |
start building these tunnels and they would miss each other. - So the task ultimately is to figure 01:19:26.000 |
out like really perfectly, as close as possible, the direction you should be digging. First of all, 01:19:31.280 |
that it's possible to have a straight line through and then what that direction would be. 01:19:37.040 |
And then you are trying to infer that by constructing a right triangle. I'm not exactly 01:19:44.320 |
sure about how to do that rigorously, like by tracing the mountain, by walking along the mountain. 01:19:50.560 |
You said grids? - Yeah, you kind of walk as if you were in a grid and so you just walk in right 01:19:56.720 |
angles. - But then you have to walk really precisely then. You have to use tools to measure 01:20:02.880 |
this and the terrain is probably a mess. So this makes more sense in 2D and 3D gets even weirder. 01:20:09.760 |
So, okay, gotcha. - But so this method was described by an ancient Egyptian historian, 01:20:16.640 |
I think, Hero of Alexandria. And then for about 2000 years, that's how we thought that they'd 01:20:24.160 |
built this tunnel. And then these researchers went there and found out that actually they must have 01:20:32.720 |
to use other methods. And then in this paper, they describe these other methods. And of course, 01:20:39.680 |
they can't know for sure, but they presented a bunch of plausible alternatives. The one that 01:20:46.000 |
for me is the most plausible is that what they probably must have done is to use something that 01:20:51.120 |
is similar to an iron sight on a rifle, the way you can line up your rifle with a target off in 01:20:59.280 |
the distance by having an iron sight. And they must have done something similar to that effectively 01:21:09.280 |
with three sticks. And that way they were able to line up sticks along the side of the mountain 01:21:17.600 |
that were all on the same height. And so that then you could get to the other side, 01:21:22.400 |
and then you could draw that line. So this for me is the most plausible way that they might have 01:21:29.200 |
done that. But then they described this in detail and other possible approaches in this paper. 01:21:36.240 |
- So this is a mathematician doing this? - Yeah, this is a mathematician that did this. 01:21:41.040 |
- Which I suppose is the right mindset instead of skills required to solve an ancient problem. 01:21:51.040 |
there are a lot of things. - Because they didn't have computers or 01:21:54.960 |
drones or LIDAR back then or whatever technology you would use modern day for civil engineering. 01:21:59.680 |
- Yeah. And another fascinating thing is that effectively after the downfall of the Roman 01:22:07.120 |
civilization, people didn't build tunnels for about a thousand years. We go a thousand years 01:22:12.160 |
without tunnels. And then only in late middle ages that we start doing them again. 01:22:17.600 |
But here is a tunnel like sixth century BC, like incredibly limited mathematics. And they build it 01:22:23.920 |
in this way. And it was a mystery for a long time exactly how they did it. And then these 01:22:30.880 |
mathematicians went there and basically with no archeology kind of background, were able to figure 01:22:37.440 |
it out. - How do annotations for this paper look like? What's a successful annotation for paper 01:22:43.920 |
like this? - Yeah. So sometimes you're for this paper, 01:22:47.360 |
sometimes adding some more context on a specific part. Like sometimes they mentioned, for instance, 01:22:56.880 |
these instruments that were common in ancient Greece and ancient Rome for building things. 01:23:04.640 |
And so in some of those annotations, I described these instruments in more detail and how they 01:23:10.800 |
worked. 'Cause sometimes it can be hard to visualize these. Then this paper, I forget exactly 01:23:20.560 |
when this was published. I believe maybe the seventies. But then there was further research 01:23:27.840 |
into this tunnel and other interesting aspects about it. I add those to that paper as well. 01:23:34.080 |
There's historical context that I also go into there. For instance, the fact that as I said, 01:23:40.640 |
that effectively after the downfall of the Roman empire, no tunnels were built. Like this is 01:23:44.960 |
something that I added to the paper as well. Yeah. So those are- - So when other people look at the 01:23:50.960 |
paper, how did they usually consume the annotations? So it's like, is there a commenting feature? 01:23:58.000 |
I mean, like this is a really enriching experience the way you read a paper. 01:24:02.240 |
What aspects do people usually talk about that they value from this? 01:24:07.760 |
Yeah. So anybody can just go on there and either add a new annotation or other comments to an 01:24:15.680 |
existing annotation. And so you can start a kind of a thread within an existing annotation. 01:24:21.280 |
And that's something that happens relative frequency. And then because I was the original 01:24:26.800 |
author of the initial annotation, I get pinged. And so oftentimes I'll go back and add on to that 01:24:34.080 |
thread. - How'd you pick the paper? I mean, first of all, this whole process is really exciting. 01:24:39.360 |
I'm gonna, especially after this conversation, I'm gonna make sure I participate much more 01:24:44.320 |
actively on papers that I know a lot about and on paper I know nothing about. I should- - You get 01:24:49.280 |
to annotate the paper. - I would love to. I also, I mean, I realized that there's a, 01:24:56.880 |
like, it's an opportunity for people like me to publicly annotate a paper. Like- - Or do an AMA 01:25:06.560 |
around the paper. - Yeah, exactly. But yeah, but like be in the conversation about a paper. It's 01:25:13.760 |
like a place to have a conversation about an idea. The other way to do it that's much more ad hoc is 01:25:19.680 |
on Twitter, right? But this is more like formal and you could actually probably integrate the two. 01:25:24.880 |
They have a conversation about the conversation. So the Twitter is the conversation about a 01:25:29.360 |
conversation and the main conversation is in the space of annotations. - There's an interesting 01:25:34.400 |
effect that we see sometimes with the annotations on our papers is that a lot of people, especially 01:25:39.280 |
if the annotations are really well done, people sometimes are afraid of adding more annotations 01:25:46.960 |
because they see that as a kind of a finished work. And so they don't want to pollute that. 01:25:51.680 |
And especially if it's like a silly question. I don't think that's good. I think we should as 01:25:59.280 |
much as possible try to lower the barrier for someone to jump in and ask questions. I think 01:26:04.720 |
most of the times it adds value, but it's some feedback that we got from users and readers. 01:26:10.480 |
I'm not exactly sure how to kind of fight that, but... - Well, I think if I serve as an inspiration 01:26:21.120 |
in any way is by asking a lot of dumb questions and saying a bunch of dumb shit all the time. 01:26:27.840 |
And hopefully that inspires the rest of the other folks to do the same, because that's the only way 01:26:33.600 |
to knowledge, I think, is to be willing to ask the dumb questions. - And there are papers that are 01:26:38.720 |
like... We have a lot of papers on Fermat's where it's just one page, or really short papers. And 01:26:44.800 |
we have like the shortest paper ever published in a math journal, with just a couple of words. 01:26:50.720 |
One of my favorite papers on the platform is actually a paper written by Enrico Fermi. And 01:26:57.600 |
the title of the paper is, I think, is "My Observations at Trinity." So basically Fermi 01:27:02.640 |
was part of the Manhattan Project. So he was in New Mexico when they exploded the first atomic bomb. 01:27:10.240 |
And so he was a couple of miles away from the explosion, and he was probably one of the first 01:27:15.840 |
persons to calculate the energy of the explosion. And so the way he did that was he took a piece of 01:27:22.560 |
paper and he tore down a piece of paper in little pieces. And when the bomb exploded, 01:27:28.560 |
the Trinity bomb was the name of the bomb, he waited for the blast to arrive at where he was. 01:27:33.840 |
And then he threw those pieces of paper in the air, and he calculated the energy based on the 01:27:39.520 |
displacement of the paper, the pieces of paper. And then he wrote a report, which was classified 01:27:45.040 |
until a couple of years ago. One page report, calculating the energy of the explosion. 01:27:50.720 |
- Oh, that's so badass. - And we actually went there and 01:27:53.760 |
kind of unpacked and I think he just mentions basically the energy, and we actually went, 01:27:59.600 |
and one of the annotations is like explaining how he did that. 01:28:02.080 |
- I wonder how accurate he was. - It was maybe, I think, like 20 or 25% off. 01:28:08.240 |
Then there was another person that actually calculated the energy based on images after 01:28:14.640 |
the explosion and the rate at which the mushroom of the explosion expanded. And it's more accurate 01:28:22.240 |
to calculate the energy based on that. And I think it was like 20, 20% off. But it's really 01:28:28.000 |
interesting because Fermi was known for all these, being a master at these back of the envelope 01:28:33.280 |
calculations, all those, like the Fermi problems are well known for that. And it's super interesting 01:28:39.600 |
to see that just one page report, it was also actually classified. And it's interesting because 01:28:45.360 |
a couple of months ago, when the Beirut explosion happened, there was a video circulating of these 01:28:52.000 |
a bride that was doing a photo shoot when the explosion in Beirut happened. And so you can see 01:28:57.200 |
a video of her with the wedding dress, and then the explosion happens and the blast arrives at 01:29:01.440 |
where she was. She was a couple of miles away from the blast. And you can see like the displacement 01:29:07.840 |
of the dress as well. And I actually looked, and that video went viral on Twitter. And I actually 01:29:12.320 |
looked at that video and used the same techniques that Fermi used to calculate the energy of the 01:29:18.240 |
explosion based on the displacement of the dress. And you could actually see where she was at the 01:29:24.240 |
distance from the explosion, because there was a store behind her and you could look the name of 01:29:28.080 |
the store. And so I calculated that. - The distance and then you can... 01:29:32.000 |
- Based on the distance where she was from the explosion and also on the displacement of the 01:29:37.680 |
dress, like because you can, when the blast happens, like you can see the dress going back 01:29:42.160 |
and then going back to the original position. And like, by just looking at how much the dress moved, 01:29:49.200 |
you can estimate the energy of the explosion. - I assume you published this. 01:29:53.760 |
- On Twitter, it was just a Twitter thread. But actually like a lot of people share that and it 01:29:59.920 |
was picked up by a couple of news outlets. - I was hoping it would be like a formal title 01:30:05.920 |
and it would be an archive. - No, no, no, no. 01:30:07.760 |
- It may be submitted. - Just the Twitter thread. But it was 01:30:10.560 |
interesting because it was exactly the same method that Fermi used. 01:30:14.560 |
- Is there something else that jumps to mind? Like, is there something, I know like in terms 01:30:20.480 |
of papers, like I know the Bitcoin paper is super popular. Is there something interesting to be said 01:30:25.600 |
about any of the white papers in the cryptocurrency space? - Yeah, the Bitcoin paper was the first paper 01:30:32.720 |
that we put on for months. - Why that choice as the first paper? 01:30:38.400 |
- This was a while ago and it was one of the papers that I read and then kind of explained it 01:30:46.000 |
to Luiz and there were two other friends that do this journal club with us. And I did some research 01:30:52.400 |
in cryptography as an undergrad. And so it was a topic that I was interested in. But even for me, 01:30:59.040 |
that I had that background, but reading the Bitcoin paper, it took me a few reads to really 01:31:06.400 |
kind of wrap my head around it. It uses very spartan, precise language in a way. It's like, 01:31:13.120 |
you feel like you can't take any word out of it without something falling apart. And it's all 01:31:19.200 |
there. I think it's a beautiful paper and it's very well-written, of course, but we wanted to 01:31:27.920 |
try to make it accessible so that anybody that maybe is an undergrad in computer science could 01:31:32.960 |
go on there and know that you have all the information in that page that you're going to 01:31:40.000 |
need to understand the mechanics of Bitcoin. And so I explain the basic public key cryptography 01:31:48.720 |
that you need to know in order to understand it. I explain, okay, what are the properties of a hash 01:31:53.760 |
function and how they are useful in this context. Explain what a Merkle tree is. So a bunch of those 01:32:00.880 |
basic concepts that maybe if you're reading it for the first time and you're an undergrad and 01:32:04.320 |
you don't know those terms, you're going to be discouraged because maybe, okay, now I have to 01:32:08.720 |
go and Google around until I understand these before I can make progress in the paper. And this 01:32:15.120 |
way it's all there. So there's a magic to... Also to the fact that over time, more people went on 01:32:21.920 |
there and added further annotations. So the idea that the paper gets easier and more accessible 01:32:28.480 |
over time, but you're still looking at the original content the way the author intended it to be. 01:32:35.440 |
But there's just more context and the toughest bits have more in-depth explanations. 01:32:41.920 |
- Okay, I think there's just so many interesting papers there. I remember reading the paper that 01:32:50.400 |
was written by Freeman Dyson on the first time that he explained, he came up with the concept 01:32:57.280 |
of the Dyson sphere and he put that out. Again, it's a one-page paper. And what he explained was 01:33:04.560 |
that eventually if a civilization develops and grows, there's going to be a point where when 01:33:12.480 |
the resources on the planet are not enough for the energy requirements of that civilization. 01:33:18.560 |
So if you want to go, the next step is you need to go to the next star and extract energy from 01:33:24.000 |
that star. And the way to do it is you need to build some sort of cap around the star that 01:33:30.000 |
extracts the energy. So he theorized this idea of the Dyson sphere and he went on to kind of analyze 01:33:37.520 |
how he would build that, the stability of that sphere. Like if something happens, if there's 01:33:42.320 |
like a small oscillation with that fear collapse into the star or no, what would happen? And even 01:33:48.640 |
went on to kind of say that a good way for us to look for signs of intelligent life out there 01:33:55.600 |
is to look for signals of these Dyson spheres. And because according to the law of second law 01:34:01.920 |
of thermodynamics, like there's going to be a lot of infrared radiation that is going to be emitted 01:34:06.480 |
as a consequence of extracting energy from the star. And we should be able to see those signals 01:34:12.480 |
of like infrared if we look at the sky. But all these, like from the introduction of the concept, 01:34:18.000 |
like how to build the Dyson sphere, the problems of like having a Dyson sphere, how to detect how 01:34:23.440 |
that could be used as a signal for intelligence. - Wait, really? That's all in the paper? 01:34:26.880 |
- Yeah, all in one, like one page paper. And it's like, for me, it's beautiful. It's like- 01:34:31.040 |
- Where was this published? - I don't remember. 01:34:33.280 |
- It's fascinating that papers like that could be, I mean, the guts it takes to put that all 01:34:40.320 |
together in a paper form. You know, that kind of challenges our previous discussion of paper. I 01:34:46.320 |
mean, papers can be beautiful. You can play with the format, right? 01:34:49.440 |
- But there's a lot to unpack there. That's like the starting point, but it's beautiful that you're 01:34:56.720 |
able to put that in one page and then people can build on top of that. 01:35:01.360 |
- But the key ideas are there. - Yeah, exactly. 01:35:03.760 |
- What about, have you looked at any of the big seminal papers throughout the history of science? 01:35:09.680 |
Like you look at simple, like Einstein papers. Have any of those been annotated? 01:35:15.600 |
- Yeah, yeah, no. We have some more seminal papers that people will have heard about. You know, 01:35:22.080 |
we have the DNA double helix paper on there. We have the Higgs boson paper. Yeah, there's 01:35:32.560 |
papers that we know that it's, they're not gonna be finding out about them because of us, but it's 01:35:39.920 |
papers that we think should be more widely read and that folks would benefit from having some 01:35:46.000 |
annotations there. And so we also have a number of those. 01:35:48.720 |
- Yeah, a lot of like discovery papers for fundamental particles and all that. We have 01:35:54.240 |
a lot of those on Fermat's library. I would like to, we haven't annotated that one, but I'd like 01:36:00.000 |
to on the Riemann hypotheses. That's a really interesting paper as well. But we haven't 01:36:06.400 |
annotated that one, but there's a lot of like more historical landmark papers on the platform. 01:36:12.400 |
- Have you done Poincaré conjecture with Perlman? 01:36:18.480 |
- That's too much for me, but it's interesting that, you know, and going back to our discussion, 01:36:25.040 |
like the Poincaré paper was like published on archive and it was not on a journal, 01:36:30.800 |
- Yeah, what do you make of that? I mean, he's such a fascinating human being. 01:36:34.720 |
- I mentioned to you offline that I'm going to Russia. He's somebody I'm really- 01:36:40.080 |
- Yeah, well, so I definitely will interview him. And I believe I will, I believe I can, 01:36:48.480 |
I just don't know how to, I know where he lives. So here, okay. My hope is, my conjecture is that 01:36:57.520 |
if I just show up to the house and look desperate enough, that, or threatening enough, 01:37:03.040 |
or some combination of both, that like the only way to get rid of me is to just get the thing done. 01:37:08.960 |
- It's actually interesting that you mentioned that because I, after I, so a couple of weeks 01:37:13.840 |
ago I was searching for like stuff about Perlman online and ended up on this Twitter account of 01:37:20.240 |
this guy that claims to be Perlman's assistant. And he's like, he has been posting a bunch of 01:37:26.800 |
pictures like next to Perlman. You can see like Perlman in a library and he's like next to him, 01:37:31.600 |
like taking a selfie or like Perlman walking on the street and like, maybe you could reach out to 01:37:36.960 |
this assistant then I'll send you this Twitter account. 01:37:43.360 |
- No, but going back to like Perlman is super interesting because the fact that he published 01:37:48.160 |
the proofs on archive is what was also like a way for him to, because he really didn't like 01:37:55.120 |
the scientific publishing industry and the fact that you had to pay to get access to articles. 01:38:02.080 |
And that was a form of like protest. And that's why he published those papers there. I mean, 01:38:07.760 |
I think Perlman is just a fascinating like character. And for me, it's this kind of ideal of, 01:38:13.360 |
platonic ideal of what a mathematician should be. It's someone that is, it's just cares about, 01:38:20.640 |
deeply cares about mathematics. It cares about fair attribution of disregards money. And like 01:38:30.080 |
the fact that he published like on archive is a good example of that. 01:38:33.840 |
- What about the Fields Medal that he turned down the Fields Medal? What's, what's, 01:38:39.360 |
- Yeah, I mean, if you look at like the reasons why he rejected the Fields Medal. So after, 01:38:44.160 |
so Perlman did a postdoc in the US and when he came back to Russia. 01:38:54.800 |
- Especially given lectures in American universities. 01:38:59.600 |
- Well, certainly not listen, but I haven't been able to get anybody. Cause I know a lot of people 01:39:03.840 |
have been to those lectures. I'm not able to get a sense of like, yeah, but how strong is the 01:39:09.920 |
accent? What are we talking about here? Is this going to have to be in Russian? Is it going to 01:39:13.120 |
have to be in English? It's fascinating. But he writes the papers in English. 01:39:16.080 |
- True. But there's so many, like such a fascinating character. And there are a couple 01:39:21.920 |
of examples like him, like at, I think 28 or 29, he proved like a really famous conjecture called 01:39:27.920 |
the Salk Conjecture, I believe it was like in a very short four page proof of that. It was a 01:39:32.400 |
really big breakthrough. Then he went to Princeton to give a lecture on that. And after the lecture, 01:39:38.240 |
the chair of the math department at Princeton, a guy called Peter Sarnak, went up to Perlman, 01:39:44.640 |
was trying to recruit him, trying to offer him a position at Princeton. And at some point he asked 01:39:51.520 |
for Perlman's resume. And Perlman responded saying, just gave a lecture on like this 01:39:57.920 |
really tough problem. Why do you need my resume? Like, I'm not going to send you, like I just 01:40:02.880 |
proved my value. But going back to the Fields Medal, like when Perlman went to back to Russia, 01:40:11.760 |
he arrived at a time where the salary of postdocs were so much off in regards to inflation that they 01:40:20.400 |
were not making any money. Like people didn't even bother to pick up the checks at the end of the 01:40:25.920 |
month because they were just like ridiculous. But thankfully he had some money that he had 01:40:30.560 |
gained while he was doing his postdocs. So he just concentrated on like the Poincare conjecture 01:40:37.600 |
problem, which he, when he took that, he took it after it was reframed by this mathematician 01:40:44.640 |
called Richard Hamilton, which posed the problem in a way that it turned into this super like math 01:40:50.720 |
Olympiad problem with like perfect boundaries, well-defined, and that was perfect for Perlman 01:40:56.160 |
to attack. And so he spent like seven years working on that. And then in 2002, he started 01:41:01.520 |
publishing those papers on archive. And people started jumping on that, reading those papers. 01:41:07.840 |
And there was like a lot of excitement around that. A couple of years later, there were two 01:41:12.960 |
researchers, I believe it was, they were from Harvard that took Perlman's work. They sanded 01:41:19.680 |
some of the edges and they republished that saying that, you know, based on Perlman's work, 01:41:26.080 |
they were able to figure out the Poincare conjecture. And then there was at the time at 01:41:32.560 |
the international conference of mathematics in 2006, I believe that's when they were going to 01:41:40.080 |
give out the Fields Medal. There was a lot of debate of like, oh, who's like, who should get 01:41:45.920 |
the credit for solving this big problem. And for Perlman, it felt really sad that people were even 01:41:54.240 |
considering that he was not the person that solved that. And the claims that those like researchers, 01:42:01.680 |
when they published after Perlman, they were false claims that they were the ones, they just 01:42:05.440 |
sanded a couple of edges, like Perlman did all the really hard work. And so just the fact that 01:42:12.080 |
they doubted that Perlman had done that, like was enough for him to say, I'm not interested in this 01:42:17.680 |
prize. And that was one of the reasons why he rejected the Fields Medal. Then he also rejected 01:42:22.960 |
the Clay prize. So the Poincare conjecture was one of the millennium prizes. There was a million 01:42:28.800 |
dollar prize associated with that problem. And that has to add to do with the fact that for them 01:42:34.080 |
to attribute that prize, I think it had to be published on a journal. The proof. And again, 01:42:39.360 |
Perlman's principles of like interfered here. And he also just didn't care about the money. He was 01:42:46.080 |
like, Clay, I think was a businessman and he's like, doesn't have to do anything with mathematics. 01:42:51.760 |
I don't care about these. Like, that's one of the reasons why he rejected it. 01:42:55.920 |
- Yeah, it's hard to convert into words, but at MIT, I'm distinctly aware of the distinction 01:43:05.440 |
between when I enter a room, there's a certain kind of music to the way people talk when we're 01:43:10.960 |
talking about ideas versus what that music sounds like when we're talking, when it's like bickering 01:43:21.200 |
in the space of like, whether it's politics or funding or egos, it's a different sound to it. 01:43:30.080 |
And I'm distinctly aware of the two. And I kind of, sort of, to me personally, happiness was just 01:43:38.960 |
like swimming around the one that like is the political stuff or the money stuff and all that, 01:43:46.080 |
or egos. And I think that's probably what Perlman is as well. Like the moment he senses there's any, 01:43:53.440 |
as with the Fields Medal, like the moment you start to have any kind of drama around credit 01:43:59.440 |
assignment, all those kinds of things, it's almost not that it's important who gets the credit. It's 01:44:04.560 |
like the drama in itself gets in the way of the exploration of the ideas or the fundamental thing 01:44:10.000 |
that makes science so damn beautiful. - And you can really see that this is also 01:44:14.240 |
a product of that Russian school of like doing science. And you can see that people were, 01:44:21.760 |
during the Cold War, a lot of mathematicians, they were not making any money. They were doing math 01:44:27.760 |
for the sake of math, like for the intellectual pleasure of like solving a difficult problem. 01:44:33.600 |
And even if it was a flawed system and there were a lot of problems with that, they were able to 01:44:41.200 |
actually achieve these. And there were a lot of, and Perlman for me is the perfect product of that. 01:44:46.560 |
He just cared about like working on tough problems. He didn't care about anything else. It was just 01:44:51.920 |
math, pure math. - Yeah, there's a, like for the broader 01:44:57.120 |
audience, I think another example of that is like professional sports versus Olympics. Especially 01:45:03.680 |
in Russia, I've seen that clear distinction where, because the state manages so much of the Olympic 01:45:11.840 |
process in Russia, as people know, with the steroids, yes, yes, yes. But outside of steroids 01:45:17.280 |
thing is like the athlete can focus on the pure artistry of the sport. Like not worry about the 01:45:27.440 |
money, not just in the way they talk about it, the way they think about it, the way they define 01:45:32.400 |
excellence versus like in the, perhaps a bit of a capitalist system in the United States with 01:45:39.040 |
American football, with baseball, basketball, so much of the discussion is about money. 01:45:47.040 |
Now, of course, at the end of the day, it's about excellence and artistry and all of that. 01:45:52.400 |
But when the culture is so richly grounded in discussions of money and 01:46:00.240 |
sort of this capitalistic like merch and businesses and all those kinds of things, 01:46:06.400 |
it changes the nature of the activity. And it's in a way that's hard again to describe in words, 01:46:12.160 |
but when it's purely about the activity itself, it's almost like you quiet down all the noise 01:46:20.400 |
enough to hear the signal, enough to hear the beauty. Like whenever you're talking about the 01:46:25.200 |
money, that's when the marketing people come and the business people, the non-creators come and 01:46:30.640 |
they fill the room and they create drama and they know how to create the drama and the noise, 01:46:35.200 |
as opposed to the people who are truly excellent at what they do, the person in their arena. 01:46:41.120 |
Like when you remove all the money and you just let that thing shine, that's when true excellence 01:46:50.000 |
can come out. And that was of the few things that worked with the communist system, 01:46:55.200 |
the Soviet Union, to me at least, as somebody who loves sport and loves mathematics and 01:47:01.280 |
science, that worked well, removing the money from the picture. 01:47:06.320 |
Not that I'm saying poverty is good for science. There's some level in which not worrying about 01:47:16.880 |
money is good for science. It's a weird, I'm not exactly sure what to make of that because capitalism 01:47:22.560 |
works really damn well, but it's tricky how to find that balance. - One Fields Medalist that is 01:47:32.080 |
interesting to look at, and I think you mentioned it earlier, but it's Cédric Villani, which is, 01:47:36.240 |
might be the only Fields Medalist that is also a politician now. But so it's this brilliant French 01:47:44.400 |
mathematician that won the Fields Medal. And after that, he decided that one of the ways that he 01:47:52.560 |
could have the biggest leverage kind of in pushing science in the direction that he thinks 01:47:59.680 |
science should go would be to try to go into politics. And so that's what he did. And he has 01:48:07.680 |
ran, I'm not sure if he has won any election, but- - I think he's running for mayor of Paris. 01:48:13.440 |
- Paris or something like that. But it's this brilliant mathematician that before winning the 01:48:18.880 |
Fields Medal had only been just a brilliant mathematician. But after that, he decided to 01:48:24.720 |
go into politics to try to have an impact and try to change some of the things that he would complain 01:48:30.320 |
about before. So there's that component as well. - Yeah, and I've always thought mathematics and 01:48:37.840 |
science should be like, like James Bond would, in my eyes, I think be sexier if he did math. 01:48:46.000 |
Like we should, as a society, put excellence in mathematics at the same level as being able to 01:48:52.480 |
kill a man with your bare hands. Like those are both useful features, like that's admirable. It's 01:48:57.600 |
like, oh, like that makes you like, that makes the person interesting. Like being extremely well read 01:49:03.440 |
about history or philosophy, being good in mathematics, being able to kill a man with 01:49:08.160 |
bare hands. Those are all the same in my book. So I think all are useful for action stars. 01:49:13.120 |
And I think the society will benefit for giving more value to that. Like one of the things that 01:49:18.640 |
bothers me about American culture is the, I don't know the right words to use, but like the nerdiness 01:49:27.360 |
associated with science. Like, I don't think nerd is a good word in American culture because 01:49:37.440 |
it's seen as like weakness. There's like images that come with that. And it's fine, you could be 01:49:44.640 |
all kinds of shapes and colors and personalities, but like, to me, having sophisticated knowledge 01:49:52.960 |
in science, being good at math, doesn't mean you're weak. In fact, it could be the very opposite. 01:50:00.560 |
And so it's an interesting thing because it was very much differently viewed in the Soviet Union. 01:50:07.600 |
So I know for sure as an existence proof that it doesn't have to be that way. But it... 01:50:14.080 |
- I also feel like we lack a lot of role models in terms, if you ask people to mention one 01:50:22.400 |
mathematician that they know that is alive today, I think a lot of people would struggle 01:50:26.400 |
to answer that question. - And I also think, I love Neil deGrasse Tyson. 01:50:32.960 |
But there is, having more role models is good. Like different kinds of personalities. He has 01:50:43.520 |
kind of fun and it's very, it's like Bill Nye, the science guy, I don't know if you guys know him. 01:50:49.680 |
- Net spectrum. - That, yeah. But there's not, like 01:50:54.240 |
Feynman is no longer there. Those kinds of personalities. 01:51:00.160 |
Like a seriousness that's like not playful. - Not apologetical. 01:51:06.800 |
- Yeah, exactly. Not apologetic about being knowledgeable. And in fact, the kind of energy 01:51:15.760 |
where you feel self-conscious about not having thought about some of these questions. 01:51:23.280 |
Just like when I see James Bond, I feel bad about that I don't, have never killed a man. Like I need 01:51:30.160 |
to make sure I fix that. That's the way I feel. So that same way, I want to feel like that way 01:51:34.720 |
with Carl Sagan talks, I feel like I need to have that same kind of seriousness about science. Like 01:51:40.000 |
if I don't know something, I want to know it well. What about Terence Tao? He's kind of 01:51:45.680 |
a superstar. What are your thoughts about him? - True. He's probably one of the most famous 01:51:50.160 |
mathematicians alive today. And probably one of, I mean, regardless of like, is of course, 01:51:54.560 |
he won a Fields Medal, is a really smart and talented mathematician. It's also like a big 01:52:04.000 |
inspiration for us, at least for some of the work that we do with Fermat's library. So Terence Tao 01:52:12.240 |
is known for having a big blog and he's pretty open about like his research. And he also, 01:52:18.960 |
he tries to make his work as public as possible through his blog posts. In fact, there's a really 01:52:27.280 |
interesting problem that got solved a couple of years ago. So Tao was working on a problem, 01:52:34.480 |
on an Erdos problem actually. So Paul Erdos was this mathematician from Hungary, and he was known 01:52:42.000 |
for like the Erdos, for a lot of things, but one of the things that he was also known was for the 01:52:47.280 |
Erdos problem. So he was always like creating these problems and usually associating prizes 01:52:52.960 |
with those problems. And a lot of those problems are still open. And there will be, some of them 01:52:58.000 |
will be open for like maybe a couple of hundred years. And I think that's actually an interesting 01:53:02.800 |
hack for him to collaborate with future mathematicians. His name will keep coming up 01:53:09.680 |
for future generations. But so Tao was working on one of these problems called the Erdos discrepancy 01:53:15.440 |
and he published a blog post about that problem and he reached like a dead end. And then all of 01:53:23.920 |
a sudden there was this guy from Germany that wrote like a comment on his blog post saying, 01:53:29.680 |
"Okay, so this problem is like a Sudoku like flavor and some of the machinery that we're using 01:53:36.240 |
to solve Sudoku could be used here." And that was actually the key to solve the Erdos discrepancy 01:53:42.160 |
problem. So there was a comment on his blog. And I think that for me is an example of like how to do, 01:53:47.680 |
again, going back to collaborative science online and the power that it has. But Tao is also like 01:53:55.920 |
pretty public about like some of the struggles of being a mathematician. And even he wrote about 01:54:04.400 |
some of the unintended consequences of having extraordinary ability in a field. And he used 01:54:10.960 |
himself as an example. When he was growing up, he was extremely talented in mathematics from a young 01:54:16.560 |
age. Like Tao was a person, he won a medal in like one of the IMOs at the age I think was a gold 01:54:22.960 |
medal at the age of 10 or something like that. And so he mentioned that when he was growing up, 01:54:28.240 |
like, and especially in college, when he was in a class that he enjoyed, it just came very natural 01:54:34.960 |
for him and he didn't have to work hard to just ace the class. And when he found that the class 01:54:40.160 |
was boring, like it didn't work and he barely passed. I think in college, he almost failed two 01:54:47.600 |
classes. And he was talking about that and how he brought those studying habits or like in existence 01:54:54.880 |
of studying habits when he went to Princeton for his PhD. And in Princeton, when he started kind of 01:55:01.200 |
delving into more complex problems in classes, he struggled a lot because he didn't have that, 01:55:08.320 |
those habits, like he wasn't taking notes and he wasn't studying hard when he faced problems. 01:55:14.720 |
And he almost failed out of his PhD, he almost failed his PhD exam. And it talks about like 01:55:22.960 |
having this conversation with his advisor and the advisor pointing out like, you're not, 01:55:27.040 |
this is not working, you might have to get out of the program. And like how that was a kind of a 01:55:32.640 |
turning point for him. And like it was super important in his career. So I think Tao is also 01:55:39.440 |
like this figure that apart from being just an exceptional mathematician, he's also pretty open 01:55:44.320 |
about what it takes to be a mathematician and some of the struggles of these type of careers. 01:55:50.000 |
And I think that's super important. - In many ways, he's a contributor to open science and open 01:55:55.920 |
humanity. He's being an open human by communicating. Scott Aronson is another in computer 01:56:02.800 |
science world who's a very different style, very different style, but there's something about a 01:56:07.680 |
blog that is authentic and real and just gives us a window into the mind and soul of these brilliant 01:56:16.080 |
folks. So it's definitely a gift. Let me ask you about Fermat's library on Twitter, which, 01:56:22.240 |
I mean, I don't know how to describe it. People should definitely just follow Fermat's library 01:56:27.520 |
on Twitter. I keep following and unfollowing Fermat's library because it's so, it gives, 01:56:36.960 |
when I follow it leads me on down rabbit holes often that are very fruitful, but anyway, so 01:56:48.800 |
the posts you do with the, on Twitter are just these beautiful, are things that reveal some 01:56:55.760 |
beautiful aspect of mathematics. Is there something you could say about the approach there? 01:57:04.480 |
And maybe broadly what you find beautiful about mathematics and then more specifically how 01:57:13.600 |
you convert that into a rigorous process of revealing that in tweet form. 01:57:19.200 |
- That's a good point. I think there's something about math that a lot of the mathematical content 01:57:24.560 |
and papers are like little proofs, has in a way sort of an infinite half-life. What I mean by 01:57:33.760 |
that is that if you look at like Euclid's elements, it's as valid today as it was when it was created 01:57:40.080 |
like 2000 years ago. And that's not true for a lot of other scientific fields. And so in regards to 01:57:48.080 |
Twitter, I think there's also a very, it's a very under-explored platform from a learning perspective. 01:57:56.400 |
I think if you look at content on Twitter, it's very easy to consume. It's very easy to read. 01:58:03.920 |
And especially when you're trying to explain something, we humans get a dopamine hit if we 01:58:12.080 |
learn something new. And that's a very, very powerful feeling. And that's why people go to 01:58:18.800 |
classes when you have a really good professor, it's looking for those dopamine hits. And 01:58:24.640 |
that's something that we try to explore when we're producing content on Twitter. Imagine if we could, 01:58:32.560 |
if you would, on a line to a restaurant, you could go to your phone to learn something new instead of 01:58:37.600 |
going to a social network. And I think it's very hard to sometimes to 01:58:46.000 |
kind of provide that feeling because you need to sometimes digest content and put it in a way that 01:58:54.560 |
it fits 280 characters. And it requires a lot of sometimes time to do that, even though it's easy 01:59:02.400 |
to consume, it's hard to make. But once you are able to provide that Eureka moment to people, 01:59:08.320 |
that's very powerful to get that dopamine hit and you create this feedback cycle and people come 01:59:14.400 |
back for more. And in Twitter, compared to an online course or a book, you have a 0% dropout. 01:59:20.960 |
So people will read the content. So it's part of the creators, the person that is creating the 01:59:28.480 |
content, if you're able to actually get that feedback cycle, it's super, super powerful. 01:59:33.200 |
Yeah. But some of this stuff is like, how the heck do you find that? And I don't know why it's 01:59:39.200 |
so appealing. This is from a, what is it? A couple of days ago. I'll just read out the number. 2, 01:59:49.200 |
3, 4, 5, 6, 7, 8, 9 is the largest prime number with consecutive increasing digits. 01:59:54.720 |
I mean, that is so cool. That's like some weird like glimpse into some deep universal truth, 02:00:04.640 |
even though it's just the number. I mean, that's like so arbitrary. Like why is it so pleasant 02:00:10.880 |
that that's a thing? But it is in some way, it's almost like it is a little glimpse at some 02:00:16.240 |
much bigger like. - And I think like, especially if we're 02:00:20.880 |
talking about science, there's something unique about you go, and with a lot of the tweets, 02:00:26.000 |
you go sometimes from a state of not knowing something to knowing something. And that is very 02:00:30.800 |
particular to science, math, physics, and that again, is extra extremely addictive. And that's 02:00:36.800 |
how I feel about that. And that's why I think people engage so much with our tweets and go 02:00:44.320 |
into rabbit holes. And then they, you start with prime numbers and all of a sudden you are spending 02:00:49.440 |
hours reading number theory things and you go into Wikipedia and you lose a lot of time there. 02:00:55.760 |
- Well, the variety is really interesting too. There's human things, there's physics things, 02:01:01.840 |
there's like numeric things, like I just mentioned, but there's also more rigorous mathematical 02:01:08.400 |
things. There's stuff that's tied to the history of math and the proofs and there's visual, 02:01:13.760 |
there's animations, there are looping animations that are incredible, that reveal something. 02:01:18.800 |
There's Andrew Wiles on being smart. This is just me now, like ignoring you guys and just going 02:01:26.960 |
through your Twitter. - No, yeah, we're a bit like 02:01:28.480 |
math drug dealers. We're just trying to get you hooked. We're trying to give you that hit 02:01:32.960 |
and trying to get you hooked. - Yes, some people are brighter than 02:01:36.400 |
others, but I really believe that most people can really get to quite a good level of mathematics 02:01:42.080 |
if they're prepared to deal with these psychological issues of how to handle the 02:01:47.760 |
- Yeah, there's some truth to that. - That's truth. I feel that's like really, 02:01:51.600 |
it's some truth in terms of research and also about startups. You're stuck a lot of the time 02:01:57.840 |
before you get to a breakthrough and it's difficult to endure that process of like being stuck 02:02:03.440 |
because you're not trained to be in that position. I feel, yeah, that's- 02:02:09.280 |
- Yeah, most people are broken by the stuckness or like they're distracted. I've been very 02:02:16.960 |
cognizant of the fact that more and more social media becomes a thing, like distractions become 02:02:25.520 |
a thing that that moment of being stuck is your mind wants to go do stuff that's unrelated to 02:02:32.960 |
being stuck and you should be stuck. I'm referring to small stucknesses, like you're like trying to 02:02:40.160 |
design something and it's a dead end, basically little dead ends, dead ends of programming, 02:02:44.800 |
dead ends in trying to think through something. And then your mind wants to like, this is the 02:02:51.680 |
problem with this like work-life balance culture is like take a break, like as if taking a break 02:02:59.760 |
will solve everything. Sometimes it solves quite a bit, but like sometimes you need to sit in a 02:03:03.920 |
stuckness and suffer a little bit and then take a break, but you definitely need to be, 02:03:09.040 |
and like most people quit from that psychological battle of being stuck. 02:03:14.640 |
So success is people who persevere through that. - Yeah, and in the creative process, 02:03:22.960 |
that's also true. I was, the other day I was, I think it was reading about, what is his name? 02:03:28.400 |
Ed Sheeran, like the musician was talking a little bit about the creative process and he was using 02:03:33.360 |
this analogy of a faucet, like where you, when you turn on a faucet, it's as like the dirty water 02:03:38.800 |
coming out in the beginning and you just have to keep trusting that at some point your clean, 02:03:44.960 |
clean, clear water will come out, but you have to endure that process. Like in the beginning, 02:03:49.040 |
it's going to be dirty water and just embrace that. - Yeah, actually this, the entirety of my 02:03:56.880 |
YouTube channel and this podcast have been following that philosophy of dirty water. 02:04:01.120 |
Like I've been, you know, I do believe that, like you have to get all the crap out of your system 02:04:07.200 |
first. And sometimes it's all, sometimes it's all crappy work. I tend to be very self-critical, 02:04:14.240 |
but I do think that quantity leads to quality for some people. It does for my, the way my mind 02:04:19.760 |
works is like, just keep putting stuff out there, keep creating and the quality will come as opposed 02:04:27.840 |
to sitting there waiting, not doing anything until the thing seems perfect. Cause the perfect may 02:04:34.240 |
never come. - But just, just on like on, on, on our Twitter profile, I really, and sometimes when 02:04:41.200 |
you look on, on some of those tweets, they might seem like pretty kind of, you know, why is this 02:04:47.120 |
interesting? It's like, so raw, like it's just a number, but I really believe that especially with 02:04:53.200 |
math or physics, it is possible to get everyone to love math or physics, even if you think you hate 02:04:59.680 |
it. It's, it's not a function of the student or the person that is on the other side. I think it's 02:05:04.320 |
just purely a function of like how you explain a hidden beauty that they hadn't realized before. 02:05:10.400 |
It's not easy, but I think it's like, a lot of the times it's on like on the creator's side 02:05:16.560 |
to, to be able to like show that beauty to the other person. - I think some of that is native to, 02:05:22.720 |
to humans. We just have that curiosity and you look at small toddlers and babies and like 02:05:28.160 |
them trying to figure things out. And there's just something that is born with us that we, we, 02:05:32.640 |
we want for that understanding. We want to figure out the world around us. And, and so, 02:05:36.880 |
yeah, it shouldn't be like, whether or not people are going, are going to, to enjoy it. Like 02:05:43.920 |
I, I, I also really believe that everybody has that capacity to fall in love with, with math and 02:05:49.440 |
physics. - You mentioned startup. What do you think it takes to build a successful startup? 02:05:55.360 |
- Yeah, that it's what, what Louise was saying that you need to, to be able to endure being stuck. 02:06:03.840 |
And, and I think the best way to put it is that startups don't have a linear reward function, 02:06:12.160 |
right? You oftentimes don't get rewarded for effort. And, and, and, and most of our lives, 02:06:18.000 |
we go through these processes that do give you those small rewards for effort, right? In school, 02:06:25.360 |
you study hard, generally you'll get a good grade and then you good, you get like good grades ever, 02:06:30.560 |
or you get grades every semester. And so you're, you're slowly getting rewarded and pushed in the 02:06:36.160 |
right direction for, for startups and startups are, are not the only thing that is like this, 02:06:41.600 |
but for startups, it's, you know, you can put in a ton of effort into something that, and then get 02:06:47.280 |
no reward for it, right? It's, it's like, like Sisyphus boulder, where you were pushing that 02:06:51.920 |
boulder up the mountain and, and, and you get to the top and then it just rolls all the way back 02:06:57.760 |
down. And, and so that's something that I think a lot of people are not equipped to deal with and 02:07:03.120 |
can be incredibly demoralizing, especially if that happens more than, than a few times. 02:07:09.200 |
And so, but I think it's absolutely essential to, to power through it because by the nature 02:07:16.160 |
of startups, it's oftentimes, you know, you're dealing with, with, with non-obvious ideas and 02:07:22.240 |
things that there might be contrarian. And so you're gonna, you're gonna run into, into that a 02:07:28.320 |
lot. You're going to do things that are not going to work out and you need to be prepared to deal 02:07:33.120 |
with that. But, but if we're not coming out of college, you're, you're just not equipped. 02:07:37.600 |
I'm not sure if there's a way to train people to deal with those non-linear reward functions, 02:07:44.080 |
but it's definitely, I think one of the most difficult things to, you know, about doing a 02:07:49.360 |
startup. And also happens in research sometimes, you know, we're talking about the default state 02:07:53.760 |
is being stuck. You just, you know, you don't like, you try things, you get zero results, 02:07:58.240 |
you close doors, you constantly closing doors until you, you know, find something. And yeah, 02:08:04.640 |
that is a big thing. - What about sort of this point when you're stuck, there's a kind of decision, 02:08:10.160 |
whether you, if you have a vision to persist through, through with this direction that you've 02:08:15.760 |
been going along or what a lot of startups do or businesses is pivot. How do you decide whether 02:08:22.720 |
like to give up on a particular flavor of the way you've imagined the design and to like adjust it 02:08:31.520 |
or completely like alter it? - I think that's a core question for startups that I've asked 02:08:39.200 |
myself exactly. And like, I've never been able to come up with a great framework to make those 02:08:44.640 |
decisions. I think that's really at the core of, yeah, out of a lot of the toughest questions that 02:08:52.320 |
people that started a company have to deal with. - I think maybe the best framework that I 02:08:59.280 |
was able to figure out is like when you run out of ideas, you just, you know, you're exploring 02:09:05.920 |
something is not working, you try it in a different angle, you know, we try a different business model. 02:09:11.760 |
When you run out of ideas, like you don't have any more cards, just switch. And yeah, 02:09:18.560 |
it's not perfect because you also, it's, you have a lot of stories of startups was like, 02:09:23.520 |
people kept pushing and then, you know, that paid off. And then you have philosophies, 02:09:30.240 |
there's like fail fast and pivot fast. So it's, you know, it's hard to, you know, 02:09:36.080 |
balance these two worlds and understand what is the best framework. - And I mean, 02:09:41.120 |
if you look at Fermat's library, you're maybe you can correct me, but it feels like you're 02:09:46.800 |
operating in a space where there's a lot of things that are broken or could be significantly improved. 02:09:54.960 |
So it feels like there's a lot of possibilities for pivoting or like, how do you revolutionize 02:10:01.200 |
science? How do you revolutionize the aggregation, the annotation, the commenting, the community 02:10:09.360 |
around information of knowledge, structured knowledge? I mean, that's kind of what like 02:10:13.840 |
Stack Overflow and Stack Exchange has struggled with to come up with a solution. And they've come 02:10:19.520 |
up, I think, with an interesting set of solutions that are also, I think, flawed in some ways, 02:10:24.160 |
but they're much, much better than the alternatives. But there's a lot of other 02:10:28.560 |
possibilities. If we just look at papers, as we talked about, there's so many possible revolutions 02:10:33.120 |
and there are a lot of money to be potentially made as revolutions, plus coupled with that, 02:10:38.480 |
the benefit to humanity. And so like you're sitting there, like, I don't know how many 02:10:43.760 |
people are legitimately from a business perspective, playing with these ideas. It feels like 02:10:48.880 |
there's a lot of ideas here. - True, it varies. 02:10:51.360 |
- Are you right now grinding in a particular direction? Like, is there a five-year vision 02:10:56.960 |
that you're thinking in your mind? - For us, it's more like a 20-year vision 02:11:02.000 |
in the sense that we've consciously tried to make the decision of... So we run Formats as 02:11:10.800 |
it's a side project. And it's a separate in the sense like it's not what we're working on full 02:11:15.840 |
time. But our thesis there is that we actually think that that's a good thing, at least for 02:11:25.280 |
this stage of Formats library. And also because some of these projects, if you're coming from 02:11:32.960 |
a startup framework, you probably try to fit every single idea into something that can change 02:11:41.920 |
the world within three to five years. And there's just some problems that take longer than that. 02:11:47.280 |
Right? And so, we were talking about archive and I'm very doubtful that you could grow 02:11:52.560 |
like archive into what it is today, like within two or three years, no matter how much money you 02:11:57.840 |
throw at it, there's just some things that can take longer. But you need to be able to power 02:12:02.560 |
through the time that it takes. But if you look at it as, "Okay, this is a company, this is a 02:12:09.520 |
startup, we have to grow fast, we have to raise money," then sometimes you might forego those 02:12:15.040 |
ideas because of that, because they don't very well fit into the typical startup framework. 02:12:23.680 |
And so, for us, Formats, it's something that we're okay with having it grow slowly and maybe taking 02:12:30.720 |
many years. And that's why we think it's not a bad thing that it is a side project, because it 02:12:36.800 |
makes it much more acceptable in a way to be able to be okay with that. 02:12:44.000 |
- That said, I think what happens is if you keep pushing new little features, 02:12:49.040 |
new little ideas, I'd feel like there's like certain ideas will just become viral. 02:12:53.760 |
And then you just won't be able to help yourself, but it'll revolutionize things. It feels like 02:12:59.920 |
there needs to be, not needs to be, but there's opportunity for viral ideas to change science. 02:13:08.880 |
- And maybe we don't know what those are yet. It might be a very small kind of thing. 02:13:13.280 |
- Maybe you don't even know if, should this be a for-profit company doing this? 02:13:19.040 |
- Yeah. There are a lot of questions, like really fundamental questions about this space 02:13:26.800 |
- I mean, you take Wikipedia and you try to run it as a startup, and by now it would have a paywall, 02:13:31.120 |
you'd be paying 9.99 a month to read more than 20 articles. 02:13:34.320 |
- I mean, that's one view. The other, the ad-driven model. So, they rejected the ad-driven model. 02:13:42.640 |
I don't know if we could, I mean, this is a difficult question. If Archive was supported 02:13:47.600 |
by ads, I don't know if that's bad for Archive. If Fermat's Library was supported by ads, 02:13:54.160 |
I don't know. It's not trivial to me. Unlike, I think a lot of people, 02:13:59.760 |
I'm not against advertisements. I think ads, when done well, are really good. I think the problem 02:14:06.320 |
with Facebook and all the social networks are the way, the lack of transparency around the way they 02:14:11.280 |
use data and the lack of control the users have over their data. Not the fact that data is being 02:14:17.120 |
collected and used to sell advertisements. It's the lack of transparency, lack of control. If you 02:14:22.880 |
do a good job of that, I feel like it's a really nice way to make stuff free. 02:14:26.560 |
- Yeah. It's like Stack Overflow, right? I mean, I think they've done a good job with that, 02:14:32.720 |
even though, as we said, they're capturing very little of the value that they're putting out 02:14:37.120 |
there, right? But it makes it a sustainable company and they're providing a lot of, it's a 02:14:44.160 |
fantastic and very productive community. - Let me ask a ridiculous tangent of a 02:14:49.920 |
question. Luis, you wrote a paper on Game of Thrones, Battle of Winterfell, just as a side 02:14:56.560 |
little, I'm sorry, I noticed, I'm sure you've done a lot of ridiculous stuff like this. I just 02:15:00.800 |
noticed that particular one. By ridiculous, I mean ridiculously awesome. Can you describe 02:15:06.560 |
the approach in this work, which I believe is a legitimate publication? 02:15:10.240 |
- So going back to the original, like when we were talking about the backstory of papers and 02:15:16.480 |
the importance of that. So this is actually, when the last season of the show was airing, 02:15:22.880 |
this was during a company lunch. In the last season, there's a really big battle against the 02:15:32.800 |
forces of evil and the forces of good. And this is called the Battle of Winterfell. And in this 02:15:41.840 |
battle, there are like these two armies, and there's a very particular thing that they have 02:15:46.400 |
to take into account is that in the army of dead, like if someone dies in the army of the living, 02:15:52.320 |
like that person is gonna be reborn as a soldier in the army of the dead. And so that was an 02:16:00.880 |
important thing to take into account. - And the initial conditions, as you 02:16:04.000 |
specify, it's about 100,000 on each side. - Exactly. So I was able to, like based on 02:16:08.800 |
some images, like on previous episodes to figure out what was the size of the armies. And so what 02:16:13.040 |
I want, what we wanted to, what we were theorizing was like, how many soldiers does like a soldier on 02:16:19.840 |
the army of the living has to kill in order for them to be able to destroy the army of the dead 02:16:26.880 |
without like losing, because every time one of the good soldiers dies, it's gonna turn into like 02:16:32.880 |
the other side. And so we were theorizing that and I wrote a couple of differential equations, 02:16:39.760 |
and I was able to figure out that based on the size of the armies, I think was the ratio had to 02:16:44.400 |
be like 1.7. So it had to kill like 1.7 soldiers of like the army of the dead in order for them to 02:16:51.040 |
win the battle. - Yeah, that's science. It's the most powerful. And this is also somehow a pitch 02:17:00.000 |
for like a hiring pitch in a sense like this is the kind of important science you do at lunch. 02:17:07.360 |
- Exactly. Well, it turned out to be, you know, as for people that have watched these shows, 02:17:12.640 |
it's like they know that every time you try to predict something that is gonna happen, 02:17:16.640 |
it's gonna, you're gonna fail miserably. And that's what happened. So it was not at all important 02:17:21.440 |
for the show, but we ended up like putting that out and there was a lot of people that shared 02:17:26.960 |
that. I think it was some like elements of the show, the cast of the show that actually retweeted 02:17:32.400 |
that and shared that. So it was fun. - I would love if this kind of calculation 02:17:36.960 |
happened like during the making of the show or, you know, I love it like in, 02:17:44.640 |
for example, I now know Alex Garland, the director of Ex Machina, and I love it. And he doesn't seem 02:17:51.200 |
to be some, not many people seem to do this, but I love it when directors and people who wrote the 02:17:58.560 |
story really think through the technical details. Like whether it's knowing like how things, even 02:18:05.120 |
if it's science fiction, if you were to try to do this, how would you do this? Like Stephen Wolfram 02:18:10.960 |
and his son were collaborating with the movie Arrival in designing the alien language of how 02:18:18.320 |
you communicate with the aliens. Like how would you really have a math-based language that could 02:18:25.680 |
span the alien and being and the human being? So I love it when they have that kind of rigor. 02:18:33.280 |
- The Martian was also big on that. Like the book and the movie was all about like, 02:18:36.960 |
can we actually, is this plausible? Can this happen? It was all about that. 02:18:42.240 |
- And that can really bring you in. Like sometimes the small details, I mean, 02:18:47.360 |
the guy that wrote the Martian book is another book that is also filled with those like things 02:18:53.440 |
that when you realize that, okay, these are grounded in science can just really bring you in. 02:18:58.960 |
- Yeah. - Like he has a book about a colony on- 02:19:02.800 |
- A colony on the moon. And he goes about like all the details that would be required about 02:19:07.600 |
setting up a colony in the moon and like things that he wouldn't think about. Like the fact that 02:19:12.800 |
they would, it's hard to bring like air to the moon so they wouldn't like, how do you make that 02:19:20.400 |
breathable, that environment breathable? You need to bring oxygen, but like you probably wouldn't 02:19:26.480 |
bring nitrogen. So what you do is like, instead of having an atmosphere that is a hundred percent 02:19:33.280 |
oxygen, you like decrease the pressure so that you have the same ratio of oxygen on earth, but like 02:19:39.040 |
lowering the pressure here. And so like things like water boils at the lower temperature. So 02:19:45.120 |
people would have coffee and the coffee would be colder. Like there was a problem in this environment 02:19:50.880 |
in the moon. So like, and these are like small things in the book, but I studied physics. So 02:19:56.880 |
like when I read these, that throws me into like tangents and I start researching that. And it's 02:20:04.080 |
like, I really like to read books and watch movies when they go to that level of detail about science. 02:20:11.840 |
- Yeah. I think Interstellar was one where they also consulted heavily with a number of people. 02:20:15.840 |
- Yeah, yeah, yeah. - I think even resulted in- 02:20:17.440 |
- In a couple of papers. - A couple of papers about like the black 02:20:19.840 |
hole visualizations and yeah. But there's even more examples of interesting science around like 02:20:27.840 |
these fantasy. We were reading at some point, like these guys that were trying to figure out if 02:20:34.080 |
the Tolkien's middle earth, if it was round, if it was like a sphere, if it was like a flat earth. 02:20:43.040 |
- Based on the map. - Based on the map and some of the 02:20:46.160 |
references in the books. And so- - Yeah, we actually, I think we tweeted 02:20:53.040 |
- Based on the distance between the cities, you can actually prove that that could be like a map 02:20:58.480 |
of a sphere or like a spheroid and you can actually calculate the radius of that planet. 02:21:03.760 |
- That's fascinating. I mean, yeah, that's fascinating. But there's something about like 02:21:12.080 |
calculating the number, like exactly the calculation you did for the battle of Winterfell 02:21:20.320 |
is something fascinating about that because that's not like being, that's very mathematical versus 02:21:27.520 |
grounded in physics. And that's really interesting. I mean, that's like injecting mathematics into 02:21:34.720 |
fantasy. There's something magical about that. - I see what you're saying. And that for me, 02:21:40.320 |
that's why I think it's also when you look at things like Fermat's Last Theorem, like problems 02:21:47.200 |
that are very kind of self-contained and simple to state. I think like that's the same with that 02:21:52.640 |
paper. It's very easy to understand the boundaries of the problem. And that for me, that's why math 02:22:01.360 |
is so appealing. And those like problems are also so appealing to the general public. It's not that 02:22:06.400 |
they look simple or that people think that they're easy to like solve, but I feel that a lot of the 02:22:12.400 |
times they are almost intellectually democratic because everyone understands the starting point. 02:22:18.240 |
You know, you look at Fermat's Last Theorem, everyone understands like, this is the universe 02:22:24.000 |
of the problem. And the same, maybe with that paper, everyone understands, okay, these are 02:22:27.440 |
the starting conditions. And yeah, the fact that it becomes intellectually democratic, and I think 02:22:34.640 |
that's a huge motivation for people. And that's why so many people gravitate towards these like 02:22:40.160 |
Riemann hypotheses or Fermat's Last Theorem or that simple paper, which is like just one page. 02:22:44.560 |
It was very simple. - And I just talked to somebody, I don't know if you know who he is, 02:22:48.880 |
Jocko Willink, who is this person who among many things loves military tactics. So he would 02:22:58.080 |
probably either publish a follow-on paper, maybe you guys should collaborate, but he would see the 02:23:04.000 |
fundamental, the basic assumptions that he started that paper with is flawed because, you know, 02:23:08.960 |
there's like dragons too, right? There's like, you have to integrate tactics because it's not 02:23:15.280 |
a homogeneous system. - I don't take into account the dragons and like- - And he would say tactics 02:23:20.880 |
fundamentally change the dynamics of the system. And so like- - That's what happened. (laughs) 02:23:26.280 |
- So yeah, so at least from a scientific perspective, he was right, but he never 02:23:31.840 |
published, so there you go. Let me ask the most important question. You guys are from 02:23:36.400 |
Portugal, both? - Yeah. - So who is the greatest soccer player, footballer of all time? 02:23:44.320 |
- Yeah, I think we're a little bit biased on this topic, but I mean- - Maradona? 02:23:50.160 |
- I have a huge, I have a tremendous respect for what- - Here we go. 02:23:58.960 |
- We can convince you. - I mean, I have tremendous respect for what 02:24:03.440 |
Ronaldo has achieved in his career. And I think soccer is one of those sports where I think you 02:24:08.960 |
can get to maybe be one of the best players in the world if you just have like natural talent. 02:24:15.520 |
And even if you don't put a lot of hard work and discipline into soccer, you can be one of 02:24:20.880 |
the best players in the world. And I think Ronaldo is kind of like, of course, he's naturally talented, 02:24:27.200 |
say the football from Portugal. - Exactly, from Portugal, and not the 02:24:32.320 |
Brazilian in this case. And Ronaldo came from nothing. He's known from being probably one of 02:24:38.960 |
the hardest working athletes in the game. And I see that sometimes a lot of these discussions about 02:24:44.320 |
the best player, a lot of people tend to gravitate towards like, this person is naturally talented 02:24:51.760 |
and the other person has to work hard. And so, as if it was bad, if he had to work hard to be good 02:24:59.360 |
at something. And I think so many people fall into that trap. And the reason why so many people fall 02:25:07.520 |
into that trap is because if you're saying that someone is good and achieved a lot of success by 02:25:14.480 |
working hard, as opposed to achieving success because he has some sort of God-given natural 02:25:20.000 |
talent that you can't explain why the person was born with that. What does it tell you about you? 02:25:25.040 |
It tells you that maybe if you work hard on a lot of fields, you could accomplish a lot of great 02:25:31.280 |
things. And I think that's hard to digest for a lot of people. - So in that way, Ronaldo is 02:25:37.600 |
- So you find hard work inspiring, but he's way too good looking. That's the- 02:25:45.120 |
- No, I like the part of the hard work and like of him being like one of the hardest working 02:25:49.760 |
athletes in soccer. - So he is to you the greatest of all time? 02:25:53.920 |
Is he up to, is he will be number, okay. - I agree. 02:25:59.360 |
- Well, I definitely disagree. I mean, I like him very much. He works hard. I admire, 02:26:04.880 |
I admire, you know, what, like he's incredible goal scorer, right? But I, 02:26:17.440 |
so first of all, Leo Messi, and there was some confusion because I've kept saying Maradona is 02:26:22.000 |
my favorite player, but I think Leo has surpassed them. So it's Messi, then Maradona, then 02:26:30.800 |
Pelé for me. But the reason is, there's certain aesthetic definitions of beauty that I admire, 02:26:40.400 |
whether it came by hard work or through God given talent or through anything. It doesn't really 02:26:46.720 |
amount to me. There's certain aesthetic, like genius when I see it to me, and especially it 02:26:53.840 |
doesn't have to be consistent. It is in the case of Messi, in the case of Ronaldo, but just even 02:26:59.360 |
moments of genius, which is where Maradona really shines. - Even if that doesn't translate into like 02:27:06.640 |
results and goals being scored. - Right, right. And that's the challenge. 02:27:10.400 |
I'm like, I did that because that's where people that tell me that Leo Messi's never, 02:27:17.520 |
even on strong teams have led his, the national team, people aspire to the world cup, right? 02:27:24.080 |
That's really important. And to me, no, it's the moment, like winning to me was never important. 02:27:29.680 |
What's more important is the moments of genius. And, but you're talking to the human story and 02:27:40.000 |
yeah, Cristiano Ronaldo definitely has a beautiful human story. 02:27:42.640 |
- Yeah. And I think you can't, for me, it's hard to decouple those two. I don't just look at the 02:27:49.920 |
list of achievements, but I like how he got there and how he keeps pushing the boundaries. It's like 02:27:54.960 |
almost 40 and how that sets up an example. Like maybe 10 years ago, I wouldn't have ever imagined 02:28:00.960 |
that like one of the top players in the world could be a top player at like 37 or. 02:28:05.280 |
- But so, and there's an interesting, the human story is really important, but like, 02:28:09.520 |
if you look at Ronaldo, he's like, he's somebody like kids could aspire to be. But at the same 02:28:16.240 |
time, I also like Maradona who like is a tragic figure in many ways. It's like the, you know, 02:28:23.280 |
the drugs, the temper, all of those things, that's beautiful too. Like I don't necessarily think to 02:28:29.920 |
me, the flaws are beautiful too. And athletes, I don't think you need to be perfect from a 02:28:39.120 |
personality perspective. Those flaws are also beautiful. So, but yeah, there is something about 02:28:44.720 |
hard work and there's also something about the being an underdog and being able to carry a team. 02:28:52.640 |
That's an argument from Maradona. I don't know if you can make that argument for Messi and Ronaldo 02:28:59.040 |
either, 'cause they've all played on superstar teams for most of their lives. So I don't know 02:29:05.760 |
how, you know, it's difficult to know how they would do when they had to work, like did what 02:29:15.200 |
Maradona had to do to carry a team on his shoulders. And Pelé did as well, depending on the 02:29:22.000 |
context. - Maybe you could argue that with the Portuguese national team, but we have a good team. 02:29:26.720 |
Yeah, but maybe with what Maradona did with, you know, Naples and a couple other teams, 02:29:33.200 |
it's incredible. - It speaks to the beauty of the game that, you know, we're talking about 02:29:37.040 |
all these different players that have, or especially, you know, if you're comparing 02:29:41.040 |
Messi and Ronaldo, they have such different, you know, styles of play and also even 02:29:45.280 |
their bodies are so different. But these two very different players can be at the top of the game. 02:29:54.160 |
And that's not, there are not a lot of other sports where you have that, you know, like you 02:30:00.000 |
have kind of a mental image of a basketball player and like the top basketball players kind of fit 02:30:06.400 |
that mental image and they look a certain way. But for soccer, there's, it's not so much like 02:30:15.120 |
that. And I think that's beautiful, but that really adds something to the sport. - Well, 02:30:21.920 |
do you play soccer yourself? Have you played that in your life? What do you find beautiful 02:30:26.160 |
about the game? - Yeah, I mean, it's one of the, I'd say it's the biggest sport in Portugal. And 02:30:30.960 |
so growing up, we played a lot. - Did you see the paper from DeepMind? I didn't look at it, 02:30:35.520 |
where they're like doing some analysis on soccer strategy. - Yeah, interesting. - I saved that 02:30:42.160 |
paper. I haven't read it yet. It's actually, when I was in college, I actually did some 02:30:49.680 |
research on applying machine learning and statistics in sports. In our case, we're doing 02:30:57.440 |
it for basketball. But what they're effectively trying to do was, have you ever watched Moneyball? 02:31:05.440 |
So they're trying to do something similar, right? Taking that, in this case, basketball, 02:31:10.640 |
taking a statistical approach to basketball. The interesting thing there is that baseball is much 02:31:17.840 |
more about having these discrete events that happen kind of in similar conditions. And so it's easier 02:31:22.960 |
to take a statistical approach to it. Whereas basketball, it's a much more dynamic game. 02:31:28.080 |
It's harder to measure. It's hard to replicate these conditions. And so you have to think about 02:31:36.800 |
it in a slightly different way. And so we were doing work on that and working like with the 02:31:41.600 |
Celtics to analyze the data that they had. Like they had these cameras in the arena, they were 02:31:46.720 |
tracking the players. And so they had a ton of data, but they didn't really know what to do with 02:31:51.120 |
it. And so we were doing work on that. And soccer is maybe even a step further. It's a game where 02:31:58.960 |
you don't have as many... In basketball, you have a lot of field goals. And so you can measure 02:32:03.360 |
success. Soccer, it's more of a poisson process almost, where it's like you have a goal or two 02:32:11.120 |
in a game. In terms of metrics, I wonder if there's a way... And I've actually have thought 02:32:15.120 |
about this in the past, never coming up with any good solution. If there's a way to definitively 02:32:19.840 |
say whether it's messy or not, they're the greatest of all time. Honestly, measure, 02:32:24.160 |
like convert the game of soccer into metrics, like you said, baseball. But those moments of genius, 02:32:30.880 |
if it's just about goals or passes that led to goals, that feels like it doesn't capture 02:32:38.800 |
the genius of the play. You have more metrics, for instance, in chess, and you can try to 02:32:48.080 |
understand how hard of a move that was. There's Bobby Fischer has this move that I think it's 02:32:56.080 |
called the move of the century, where you have to go so deep into the tree to understand that 02:33:01.680 |
that was the right move and you can quantify how hard it was. So it'd be interesting to try to 02:33:06.560 |
think of those types of metrics, but say, yeah, for soccer. - Computer vision unlocks some of that 02:33:10.720 |
for us. That's one possibility. - I have a cool idea, a computer vision product, Lex, that you 02:33:15.840 |
could build for soccer. - Let's go. I'm taking notes. - If you could detect the ball and imagine 02:33:22.640 |
that... It seems totally doable right now, but if you could detect when the ball enters one of the 02:33:29.120 |
goals and just had a crowd cheering for you when you're playing soccer with your friends, every 02:33:35.680 |
time you score a goal, or you had the Champions League song going on, and having that, you go play 02:33:42.080 |
soccer with your friends, you just turn that on and there's a computer vision program analyzing 02:33:46.320 |
the ball. - Detects the ball. - Detects the ball every time there's a goal. If you miss, there's 02:33:49.680 |
the fans are reacting to that. - And then... - Should be pretty simple by now. I think there's 02:33:55.280 |
an opportunity there. Just throwing that. - I'm going to go all out. By the way, I did... 02:33:59.920 |
I've never released... I was thinking of just putting it on GitHub, but I did write exactly 02:34:04.640 |
that, which is the trackers for the players, for the bodies of the player. This is the hard part, 02:34:10.240 |
actually. The detection of player bodies and the ball is not hard. What's hard is very robust 02:34:19.040 |
tracking through time of each of those. So I wrote a tracker that's pretty damn good. - Is that 02:34:26.560 |
open source? You open source? - No, I've never released it. - Interesting. - Because I thought 02:34:31.920 |
I need to... This is the perfection thing, because I knew it was going to be like... 02:34:37.520 |
It's going to pull me in and it wasn't really that done. And so I've never actually been part 02:34:44.080 |
of a GitHub project where it's like really active development. - Interesting. - And I didn't want 02:34:48.080 |
to make it... I knew there's a non-zero probability that it will become my life for like a half a 02:34:52.720 |
year. That's just how much I love soccer and all those kinds of things. And ultimately, it will be 02:34:58.800 |
all for just the joy of analyzing the game, which I'm all for. - I remember you also, in one of the 02:35:05.760 |
episodes, you mentioned that you did also a lot of eye tracking analysis on Joe Rogan. - That was 02:35:11.040 |
the research side of my life. - Interesting. And you have that library, right? You kind of 02:35:15.280 |
downloaded all the episodes. - Allegedly. And of course I didn't, if you're a lawyer and listening 02:35:20.720 |
to this. - I was listening to the episode where you mentioned that and I was actually... There 02:35:25.200 |
was something that I might ask you for access to that, to allegedly that library. But I was 02:35:32.080 |
doing some, not regarding like eye tracking, but I was playing around with analyzing the distribution 02:35:38.880 |
of silences on one of the Joe Rogan episodes. So I did that for the Elon conversation, where it's 02:35:46.560 |
like, you just take all the silences, after Joe asked the question and Elon responded, and you 02:35:52.240 |
plot that distribution and see how that looks like. - Yeah, I think there's a huge opportunity, 02:35:59.200 |
especially with long form podcasts, to do that kind of analysis, bigger than Joe. - Exactly. It 02:36:05.600 |
has to be a fairly unedited podcast so that you don't cut the silences. - So one of the benefits 02:36:10.400 |
I have doing this podcast is, what we're recording today is there's individual audio that's being 02:36:17.840 |
recorded. So I have the raw information, when it's published, it's all combined together, 02:36:22.960 |
and individual video feeds. So even when you're listening, which I usually do, I only show one 02:36:28.000 |
video stream, I can track your blinks and so on. But ultimately, the hope is you don't need that 02:36:37.520 |
raw data, because if you don't need the raw data for whatever analysis you're doing, you can then 02:36:41.920 |
do a huge number of podcasts. It's quickly growing now, the number, especially comedians, 02:36:47.760 |
there's quite a few comedians with long form podcasts, and they have a lot of facial 02:36:54.400 |
expressions, they have a lot of fun and all those kinds of things, and it's prone for analysis. - 02:37:00.000 |
There's so many interesting things. That idea actually sparked because I was watching 02:37:04.800 |
a Q&A by Steve Jobs, and I think it was at MIT. And then, people did a talk there, and then 02:37:13.520 |
the Q&A started, and people started asking questions. I was working while listening to it, 02:37:18.400 |
and someone asked a question, and he goes on a 20-second silence before answering the question. 02:37:23.760 |
I had to check if the video hadn't paused or something. And I was thinking about if that 02:37:31.840 |
is a feature of a person, how long on average you take to respond to a question, and if it's like- 02:37:37.680 |
- Oh, that's fascinating. - Has to do with how thoughtful you are, 02:37:40.800 |
and if that changes over time. - But it also could be, this is a really 02:37:44.720 |
fascinating metric, 'cause it also could be, it's certainly a feature of a person, but it's also a 02:37:51.040 |
- Like, if you normalize to the person, you can probably infer a bunch of stuff about the 02:37:55.680 |
question. So, it's a nice flag, like it's a really strong signal, the length of that silence, 02:38:00.640 |
relative to the usual silence they have. So, one, the silence is a measure of how thoughtful they 02:38:06.480 |
are, and two, the particular silence is a measure how- - Thoughtful the question was. 02:38:11.040 |
- Thoughtful the question was. It's really interesting. I mean, yeah. 02:38:13.600 |
- Yeah. I just analyzed Elon's episode, but I think there's room for exploration there. 02:38:21.040 |
- I feel like the average for comedians would be, I mean, the time would be so small, 02:38:25.840 |
'cause you're trained to, I would think you're reacting to hecklers, you're reacting to all 02:38:30.160 |
sorts of things, you have to be so quick. - Maybe, maybe. Yeah. But some of the greatest 02:38:34.640 |
comedians are very good at sitting in the silence. I mean, there's Louis CK, they play with that, 02:38:41.040 |
'cause you have a rhythm. Like Dave Chappelle, a comedian who did a Joe show recently, 02:38:49.760 |
he has a, especially when he's just having a conversation, he does long pauses. It's kind 02:38:56.400 |
of cool. It's one of the ways to have people hang in your word, is to play with the pauses. 02:39:04.960 |
To play with the silences and the emphasis and mid-sentence. There's a bunch of different things 02:39:11.200 |
that it'd be interesting to really, really analyze, but still soccer to me is, that one's 02:39:17.520 |
fascinating. I just want a conclusive, definitive statement about, 'cause there's so many 02:39:23.520 |
soccer highlights of both Messi and Ronaldo. I just feel like the raw data is there. 02:39:31.040 |
- In jest, and decide. - Definitive statement. 02:39:35.440 |
'Cause you don't have that with Pelé and Mardona. - Yeah, true. 02:39:38.800 |
- But here's a huge amount of high-def data, the annoying, the difficult thing, 02:39:43.840 |
and this is really hard for tracking. And this is actually where I kind of gave up. 02:39:47.920 |
I didn't really give much effort, but I gave up to the way that highlights or usually football 02:39:57.360 |
match filmed is they switch the camera. So they'll do a different switch perspective. So you have to, 02:40:03.360 |
it's a really interesting computer vision problem. When the perspective is switched, 02:40:07.280 |
you still have a lot of overlap about the players, but the perspective is sufficiently different that 02:40:12.320 |
you have to like recompute everything. So there's two ways to solve this. One is doing it the full 02:40:19.600 |
way where you're constantly doing the slam problem. You're doing a 3D reconstruction the whole time, 02:40:24.480 |
and projecting into that 3D world. But you could also, there could be some hacks. 02:40:28.960 |
Then I wonder like some trick where you can hop, like when the perspective shifts, 02:40:35.360 |
do a high probability tracking hops from one object to another. But I thought, 02:40:42.480 |
especially in exciting moments when you're passing players, like you're doing a single ball dribble 02:40:51.840 |
across players and you switch perspective, which is when they often do when you're making a run on 02:40:56.000 |
goal. If you switch your perspective, it feels like that's going to be really tricky to get right 02:41:01.040 |
automatically. But in that case, for instance, I feel like if somebody released that data set, 02:41:06.640 |
where it's like, you just have all like this data set, a massive data set of all these games from 02:41:13.440 |
say Ronaldo and Messi, like, and you just add that in like whatever, CSV format and some publicly 02:41:19.680 |
available data set like that. I feel like people would just, there would be so many cool things 02:41:25.120 |
that you could do with it. And you just set it free and then like the world would like do its 02:41:29.360 |
thing. And then like interesting things would come out of it. - By the way, I have this data set. So 02:41:34.480 |
the two things I've did of this scale is soccer. So it's body pose and ball tracking for soccer. 02:41:42.320 |
And then it's the pupil tracking and blink tracking for, it was Joe Rogan and a few other 02:41:49.120 |
podcasts that I did. So those are the two data sets I have. - Did you analyze any of your podcasts? 02:41:54.400 |
- No, I think I really started doing this podcast after doing that work and it's difficult to, 02:42:04.080 |
maybe I'd be afraid of what I find. I'm already annoyed with my own voice and video, 02:42:12.800 |
like editing it. But perhaps that's the honest thing to do. 'Cause one useful thing about doing 02:42:18.880 |
computer vision about myself is like, I know what I was thinking at the time. So you can start to 02:42:24.480 |
connect the particular, the behavioral peculiarities of like the way you blink, the way 02:42:32.000 |
you squint, the way you close your eyes, like talking about details. It's like, for example, 02:42:39.520 |
I just closed my eyes. Is that a blink or no? Like figuring that out in terms of timing, in terms of 02:42:46.240 |
the blink dynamics is tricky. It's very doable. I think there's universal laws about what is a blink 02:42:53.920 |
and what is a closed eye and all those things, plus makeup and eyelashes. I actually have 02:42:59.920 |
annoyingly long eyelashes. So I remember when I was doing a lot of this work, I would cut off my 02:43:05.200 |
eyelashes, which when like, especially it was funny, like female colleagues were like, what the 02:43:11.040 |
fuck are you doing? Like, no, keep the eyelashes. But it, cause it got in the way, made the computer 02:43:16.240 |
vision a lot more difficult, but. - Super interesting topics. 02:43:20.240 |
- Yeah. But speaking about the one, still on the topic of the data sets for sports, there's one, 02:43:26.400 |
one paper and I actually edited it on format and it's, it was published in 90s, 90s, I believe, 02:43:35.440 |
90s or 80s, I forget. But the researcher was effectively looking at the hot end phenomena 02:43:43.920 |
in basketball, right? So whether like the fact that you just made a field goal, 02:43:48.960 |
if, you know, if on your next attempt, if you're more likely to make it or not. 02:43:54.400 |
And it was super interesting cause I mean, he pulled like, I think a hundred undergrads and 02:44:01.760 |
I think from Stanford and Cornell and asking people like, do you, do you think that's, that 02:44:06.320 |
do you have a higher likelihood of making your free throw if you just, just made one? And I think 02:44:10.880 |
it was like 68, 68% said yes, they believe that. And then he looked at the data and this was back 02:44:20.240 |
in, as I said, like few decades ago. And so I think he had the data set of about, he looked at 02:44:25.600 |
it specifically for free throws and he had a data set of about 5,000 free throws. And 02:44:31.200 |
and effectively what he found was that specifically in the case of free throws, 02:44:37.280 |
he didn't, for the aggregate data, he didn't find that he couldn't really spot that correlation, 02:44:44.800 |
that hot end correlation. So if you made the first one, you weren't more likely to, to make 02:44:50.480 |
the second one. What he did find was that they were just better at the second one, 02:44:54.400 |
because you just got like maybe a tiny practice and you just attempted once and then, and then 02:44:59.920 |
you're going to be better at the next one. And then I, I, then I went and there's a data set 02:45:04.320 |
on Kaggle that has like 600,000 free throws. And I re-ran the, the same computations and, 02:45:11.440 |
and, and confirmed, like, you can see a very clear pattern that they're just better 02:45:16.080 |
at their second free throw. - That's interesting. Cause I think there's 02:45:20.080 |
similar, that kind of analysis is so awesome. Cause I think with tennis, they have like, 02:45:25.200 |
like a fault, like when you serve, they have analysis of like, are you most likely to miss 02:45:30.000 |
the second serve if you missed the first, obviously. I think that's the case. So that 02:45:35.200 |
integrates, that's so cool when psychology is converted into metrics in that way. And in sports, 02:45:41.440 |
it's especially cool because it's such a constrained system that you can really study 02:45:46.640 |
human psychology because it's repeated, it's constrained. So many things are controlled, 02:45:51.520 |
which is something you rarely have in, in the wild psychological experiments. So it's cool. 02:45:57.200 |
Plus everyone loves it. Like sports is really cool to analyze. 02:46:01.600 |
- People actually care about the results. - Yeah. I still think, well, like I, 02:46:08.800 |
I know we'll definitely publish this work on Messi versus Ronaldo and 02:46:12.560 |
objective, fully objective. - I'd love to peer review. 02:46:19.200 |
Yeah, this is very true. This is not past peer review. Let me ask sort of an advice question 02:46:26.560 |
to the young folks. You've explored a lot of fascinating ideas in your life. You built a 02:46:32.880 |
startup, worked on physics, worked in computer science. What advice would you give to young 02:46:38.640 |
people today in high school, maybe early college about life, about career, about science and 02:46:46.000 |
mathematics? - I remember like, I read, like, I remember 02:46:51.520 |
reading that Poincaré was once asked by a French journal about his advice for young people and 02:47:00.320 |
what was his teaching philosophy. And he said that like, one of the most important things that 02:47:05.200 |
parents should teach their kids is how to be enthusiastic in regards to like the mysteries 02:47:11.360 |
of the world. And that he said, like, striking that balance was actually one of the most important 02:47:16.480 |
things between like in education, you know, you want to have your kids be enthusiastic about the 02:47:21.200 |
mysteries of the world, but you also don't want to traumatize them. Like if you really force them 02:47:24.640 |
into something. And I think like, especially if you're young, I think you should be curious. 02:47:32.640 |
And I think you should explore that curiosity to the fullest, to the point where you even become 02:47:40.320 |
almost as an expert on that topic. And you might start with something that it's small, 02:47:46.400 |
like you might start with, you know, you're interested in numbers and how to factor numbers 02:47:50.720 |
into primes. And then all of a sudden you go and you're like lost in number theory and you 02:47:56.160 |
discover cryptography. And then all of a sudden you're buying Bitcoin. And I think you should do 02:48:01.520 |
this. You should really try to fulfill this curiosity and you should live in a society that 02:48:06.800 |
allows you to fulfill this curiosity, which is also important. And I think you should do this 02:48:12.000 |
not to get to some sort of status or fame or money, but I think this is the way, this iterative 02:48:17.360 |
process, I think this is the way to find happiness. And I think this is also allows you to find the 02:48:23.760 |
meaning for your life. I think it's all about like being curious and being able to fulfill 02:48:29.440 |
that curiosity and that path to fulfilling that, your curiosity. - Yeah, the star small, 02:48:36.560 |
the fire build is kind of interesting way to think about it. - And you never know where you're 02:48:40.320 |
going to end up. It's like, for us, Formaz is just a really good example. We started like by 02:48:46.800 |
doing this as an internal like thing that we did in the company. And then we started putting out 02:48:52.080 |
there and now a lot of people follow it and know about it. And so... - And you still don't know 02:48:56.960 |
where Formaz library is going to end up actually. - True, exactly. So yeah, I think that would be my 02:49:03.440 |
piece of advice with very limited experience, of course, but yeah, I agree. - I agree. I mean, 02:49:10.800 |
is there something from particular, Joao, from the computer science versus physics perspective, 02:49:15.920 |
do you regret not doing physics? Do you regret not doing computer science? Which one is the 02:49:22.320 |
wiser, the better human being? This is Messi versus Ronaldo. Those are very, I don't know 02:49:29.920 |
if you would agree, but they're kind of different disciplines. - True. - Yeah. - Very much so. 02:49:35.280 |
I actually, I had that question in my mind. I took physics classes as an undergrad or like 02:49:45.280 |
besides what I had to take. And it's definitely something that I considered at some point. 02:49:53.440 |
And I do feel like later in life, that might be something that I'm not sure if regret is the right 02:50:04.720 |
word, but it's kind of something that I can imagine in an alternative universe, what would 02:50:09.840 |
have happened if I had gone into physics. I try to think that like, well, it depends on what your 02:50:19.280 |
path ends up being, but that it's not super important, right? Like exactly what you decide to 02:50:25.360 |
major on. I think Tim Urban, the blogger had a good visualization of this where it's like, 02:50:34.160 |
he has a picture where you have all sorts of paths that you could pursue in your life. And then 02:50:40.480 |
maybe you're in the middle of it. And so there's maybe some paths that are not accessible to you, 02:50:44.480 |
but like the tree that is still in front of you, it gives you a lot of optionality. And so- 02:50:48.960 |
- There's two lessons to learn from that. Like we have a huge number of options now, 02:50:53.440 |
and probably you're just one to reflect, like to try to derive wisdom from the one little path 02:51:02.960 |
you've taken so far may be flawed because there's all these other paths you could have taken. 02:51:07.040 |
So it's like, so one, it's inspiring that you can take any path now. And two, 02:51:13.120 |
it's like the path you've taken so far is just one of many possible ones. But it does seem that 02:51:18.880 |
physics and computer science both open a lot of doors and a lot of different doors. 02:51:26.480 |
- It is. I feel like in this case, and especially in our case, because I could see the difference. 02:51:32.080 |
I studied, I went to college in Europe and João went to college here in the US. So I could see 02:51:38.080 |
the difference in like, the European system is more rigid in the sense that when you decide 02:51:43.760 |
to study physics, you don't have a lot, especially in the early years, you don't have a lot of, 02:51:47.200 |
you can't choose to take like a class from like computer science course or something like that. 02:51:52.880 |
You don't have a lot of freedom to explore in that sense in university, as opposed to here 02:51:56.640 |
in the US where you have more freedom. And I think that's important. I think that's what 02:52:02.880 |
constitutes a good kind of educational system is one that gravitates towards the interests of a 02:52:09.120 |
student as you progress. But I think in order for you to do that, you need to explore different 02:52:14.480 |
areas. And I felt like if I had a chance to take say more computer science class when I was in 02:52:19.680 |
college, I would have probably have taken those classes. But yeah, but I ended up like focusing 02:52:24.800 |
maybe too much in physics. And I think here, at least my perception is that you can explore more 02:52:32.560 |
fields. - But there is a kind of, it's funny, but physics can be difficult. So I don't see too many 02:52:39.440 |
computer science people then exploring into physics. It's like the one, not the one, but 02:52:46.400 |
one of the beneficial things of physics, it feels like it, what was it, Rutherford that said like, 02:52:54.160 |
like basically that physics is the hard thing and everything is easy. So like there's a certain 02:53:00.640 |
sense once you've figured out some basic like physics, that it's not that you need the tools 02:53:05.600 |
of physics to understand the other disciplines. It's that you're empowered by having done difficult 02:53:11.280 |
shit. I mean, the ultimate, I think is probably mathematics there. - Yeah, true. - So maybe just 02:53:17.600 |
doing difficult things and proving to yourself that you can do difficult things, whatever those 02:53:22.080 |
are. - That's net positive, I believe. - Net positive. - Yeah. And I think like, before I 02:53:27.120 |
started a company, I worked in the financial sector for a bit. And I think having a physics 02:53:34.000 |
background, I felt I was not afraid of learning finance things. And I think when you come from 02:53:40.000 |
those backgrounds, you are generally not afraid of stepping into other fields and learning about 02:53:44.960 |
those because I feel they've learned a lot of difficult things and that's an added benefit, 02:53:52.320 |
I believe. - This was an incredible conversation, Luis, João. We started with, who do we start 02:54:00.480 |
with? Feynman, ended up with Messi and Ronaldo. So this is like the perfect conversation. It's 02:54:05.360 |
really an honor that you guys would waste all this time with me today. It was really fun. Thanks 02:54:09.600 |
for talking. - Thank you so much for having us. Yeah, thank you so much. - Thanks for listening 02:54:13.840 |
to this conversation with Luis and João Batala. And thank you to Skiff, SimplySafe, Indeed, NetSuite 02:54:20.880 |
and Four Sigmatic. Check them out in the description to support this podcast. And now 02:54:26.560 |
let me leave you with some words from Richard Feynman. "Nobody ever figures out what life is 02:54:31.280 |
all about and it doesn't matter. Explore the world. Nearly everything is really interesting 02:54:37.600 |
if you go into it deeply enough." Thank you for listening. I hope to see you next time.