back to indexGPT 4 - hype vs reality
00:00:00.000 |
You may have heard rumours that ChatGPT 4 is going to be released imminently, say in January, 00:00:06.580 |
and that it's going to dwarf ChatGPT 3's ability, just like you can see in this graph. 00:00:12.480 |
Well, I'm going to let Sam Altman correct the record here on both fronts. 00:00:17.140 |
And then at the end of the video, I'm going to discuss what I think ChatGPT 4 will be capable of. 00:00:22.140 |
Here's Sam Altman on the timing of the release of ChatGPT 4. 00:00:27.100 |
Can you comment on whether GPT 4 is coming out in the first quarter, first half of the year? 00:00:32.440 |
It'll come out at some point when we are like confident that we can do it safely and responsibly. 00:00:38.640 |
I think in general, we are going to release technology much more slowly than people would like. 00:00:45.840 |
We're going to sit on it for much longer than people would like. 00:00:48.300 |
Then referring to ChatGPT 4, he talked about whether it will be an exponential increase in terms of its abilities 00:00:58.040 |
Given the magnitude of the economic impact we expect here, more gradual is better. 00:01:03.700 |
And so putting out a very weak and imperfect system like ChatGPT and then making it a little better this year, 00:01:09.400 |
a little better later this year, a little better next year, that seems much better than the alternative. 00:01:14.040 |
Then Sam directly addressed the hype train that had been generated by graphics such as this one. 00:01:21.000 |
I saw a visual, and I don't know if it was accurate, but it showed GPT 3.5 versus, I guess, what GPT 4 is expected. 00:01:33.940 |
The GPT 4 rumor mill is like a ridiculous thing. 00:01:41.740 |
I don't know why people don't have better things to speculate on. 00:01:44.440 |
I get a little bit of it, like it's sort of fun, but that it's been going for like six months at this volume. 00:01:50.200 |
People are begging to be disappointed, and they will be. 00:01:52.940 |
Like it's, you know, people are going to like, the hype is just like, we don't have an actual AGI. 00:01:58.060 |
And I think that's sort of what is expected of us. 00:02:00.000 |
And, you know, yeah, we're going to disappoint those people. 00:02:02.440 |
In a moment, you're going to see Sam respond to a question about ChatGPT putting Google out of business. 00:02:09.260 |
And he made a fantastic point, which I'm going to go into more detail on after he gives his remarks. 00:02:16.200 |
Yeah, I think whenever someone like talks about a technology being the end of some other giant company, it's usually wrong. 00:02:21.500 |
Like I think people forget they get to make a counter move here and they're like pretty smart, pretty competent. 00:02:27.160 |
The counter move that Sam is referring to may well be Palm, which is a 540 billion parameter transformer model. 00:02:37.200 |
And you can see from the graphic the improvements that have been made by increasing the number of parameters. 00:02:43.900 |
But I've done some research on Palm, see this graph, notice the line crossing the performance of the average human. 00:02:52.400 |
It admits when you go into detail about the tasks that Palm can achieve, 00:02:57.100 |
that we're still talking about solving 60% of problems that 9 to 12 year olds can solve. 00:03:05.960 |
So the average 12 year old can solve 60% of them and Palm can solve 58% of them. 00:03:11.440 |
But that isn't exactly AGI, not yet, at least. 00:03:14.640 |
So if ChatGPT-4 is anywhere in that kind of ballpark, we're going to notice the difference. 00:03:20.340 |
We're going to see the improvement just as that graph increased quite dramatically by increasing the number of parameters. 00:03:26.600 |
Maybe it will be able to explain jokes, for example, understand books a bit better. 00:03:31.300 |
But we're definitely not talking about AGI, not yet. 00:03:35.400 |
I am certain that the initial buzz around ChatGPT-4 is going to be amazing. 00:03:40.060 |
But I'm going to let Sam address the difference between the initial buzz, 00:03:44.500 |
the impressive achievements it can do, and the robustness of the entire model. 00:03:49.500 |
One of the sort of strange things about these technologies is they are impressive, but not robust. 00:03:55.100 |
And so you use them in a first demo, you kind of have this like very impressive, like, wow, this is like incredible and ready to go. 00:04:02.860 |
You use them 100 times, you see the weaknesses. 00:04:05.460 |
And so I think people can get a much sort of a false impression of how good they are. 00:04:15.100 |
The critics who point these problems out and say, well, this is why it's like, you know, all like, like, you know, fake news or whatever, are equally wrong. 00:04:22.740 |
Thank you for watching and do subscribe for more such content.