There were several significant developments in the last few days linked to GPT-4 and OpenAI. I could honestly have done a video on each of them, but realized that it might be better to do a single video tracing a single article covering seven major points. I'm going to use this fascinating piece from the FT, which millions of people have now read, to run you through what has happened, including Sam Altman's revelation on GPT-5, Elon Musk's new AI company, and GPT-4 conducting science.
The author, by the way, is an investor in Anthropic and a co-author of the State of AI Annual Report, and he puts it like this. A three-letter acronym doesn't capture the enormity of what AGI would represent, so I will refer to it as what it is, godlike AI. This would be a super intelligent computer that learns and develops autonomously, that understands its environment without the need for supervision, and that can transform the world around it.
And the author, Ian Hogarth, says we are not there yet, but the nature of the technology makes it exceptionally difficult to predict exactly when we will get there. The article presents this as a diagram, with the exponential curve going up towards AGI, and a much less impressive curve on the progress on alignment, which he describes as aligning AI systems with human values.
Now I know what some of you may be thinking. Surely those at the top of OpenAI disagree on this gap between capabilities and alignment. Well, first here is Jan Leiker, who is the alignment team lead at OpenAI. What does he think? He wants everyone to be reminded that aligning smarter-than-human AI systems with human values is an open research problem, which basically means it's unsolved.
But what about those at the very top of OpenAI, like Sam Altman? When he was drafting his recent statement on the path to AGI, he sent it to Nate Suarez of the Machine Intelligence Research Institute. For one of the paragraphs, Nate wrote this: "I think that if we do keep running ahead with the current capabilities to alignment ratio, or even a slightly better one, we die." After this, Sam Altman actually adjusted the statement, adding: "That said, it's important that the ratio of safety progress to capability progress increases." Going back to the article, the author makes the point that there are not that many people directly employed in this area of alignment across the core AGI labs.
And what happened to that "pause the experiment" letter that I did a video on? Well, as Hogarth points out, the letter itself became a controversy. So many people in my comments wrote that the only reason certain people are signing this is to slow OpenAI down so that they can catch up.
And this cynicism unfortunately has some new evidence that it can cite, with Musk forming his new AI company called XAI. This was reported 48 hours ago in the Wall Street Journal, but people have seen this coming for months now. Apparently the company has recruited Igor Babushkin Deepmind but has not been that successful at recruiting people from OpenAI.
And I do have one theory as to why. Again, according to the Wall Street Journal, when Musk left OpenAI in February of 2018, he explained that he thought he had a better chance of creating AGI through Tesla, where he had access to greater resources. When he announced his departure, a young researcher at OpenAI questioned whether Mr.
Musk had thought through the safety implications. According to the reporting, he then got frustrated and insulted that intern. Since then, he's also paused OpenAI's access to Twitter's database for training its new models. So it could be that GPT-5 isn't quite as good at tweeting as GPT-4. A few days ago, Sam Altman responded to the letter and also broke news about GPT-5.
Apologies for the quality, this was a private event and this was the only footage available. But unfortunately, I think the letter is missing like most technical nuance about where we need to pause. Like an earlier version of the letter claimed that OpenAI is training GPT-5 right now. We are not normal for some time.
So in that sense, it was sort of silly. But we are doing other things on top of GPT-4 that I think have all sorts of safety issues that are important to address and we're totally left out of the letter. It is impossible to know how much this delay in the training of GPT-5 is motivated by safety concerns or by merely setting up the requisite compute.
For example, the article quotes again Jan Leiker, the head of alignment at OpenAI. He recently tweeted "Before we scramble to deeply integrate LLMs everywhere in the economy like GPT-4, can we pause and think whether it is wise to do so? This is quite immature technology and we don't understand how it works.
If we're not careful, we're setting ourselves up for a lot of correlated failures." This is the head of alignment at OpenAI. But this was just days before OpenAI then announced it had connected GPT-4 to a massive range of tools including Slack and Zapier. So at this point we can only speculate as to what's going on at the top of OpenAI.
Meanwhile, compute and emergent capabilities are marching on. As the author puts it, "These large AI systems are quite different. We don't really program them, we grow them. And as they grow, their capabilities jump sharply. You add 10 times more compute or data, and suddenly the system behaves very differently." We also have this epic graph charting the exponential rise in compute of the latest language models.
If you remember when BARD was launched, it was powered by Lambda. Well, apparently now Google's BARD is powered by Palm, which has 8 times as much computing power. That sounds impressive until you see from the graph that the estimate for the computing power inside GPT-4 is 10 times more again.
And remember, this is not a linear graph. This is a log scale. There is a 100 times multiple between each of the lines. And what abilities emerge at this scale? Here is a slide from Jason Wei, who now works at OpenAI, formerly of Google. This is from just a few days ago, and he says, "Emergent abilities are abilities that are not present in small models, but are present in large models." He says that there are a lot of emergent abilities, and I'm going to show you a table from this paper in a moment, but he has four profound observations of emergence.
One, that it's unpredictable. Emergence cannot be predicted by extrapolating scaling curves from smaller models. Two, that they are unintentional, and that emergent abilities are not explicitly specified by the trainer of the model. Third, and very interestingly, since we haven't tested all possible tasks, we don't know the full range of abilities that have emerged.
And of course, that fourth, further scaling can be expected to elicit more emergent abilities. And he asked the question, "Any undesirable emergent abilities?" There will be a link to the paper in the description, because there's no way I'll be able to answer that question. And he also asked the question, "Any undesirable emergent abilities?" There's no way I'll be able to get through all of it.
But here is a table showing some of the abilities that emerge when you reach a certain amount of compute power or parameters. Things like chain of thought reasoning. You can't do that with all models. That's an ability that emerged after a certain scale. Same thing with following instructions and doing addition and subtraction.
And how about this for another emergent capacity? The ability to do autonomous scientific research. This paper shows how GPT-4 can design, plan and execute scientific experiments. This paper was released on the same day, four days ago, and it followed a very similar design. The model in the center, GPT-4, thinks out reasons and plans, and then interacts with real tools.
When the authors say that they were inspired by successful applications in other fields, I looked at the appendix and they were talking about hugging GPT. I've done a video on that, but it's a similar design with the brain in the center, GPT-4, deciding which tools to use. And let me just give you a glimpse of what happens when you do this.
If you look at this chart on the top left, you can see how GPT-4 on its own performs in yellow. And then in purple, you can see how GPT-4 performs when you hook it up to other tools. I'll show you some of the tasks in a moment, but look at the dramatic increase in performance.
The human evaluators gave GPT-4 when it had tools a perfect score on seven of the tasks. These were things like proposing similar novel non-toxic solutions. And then the human evaluators gave GPT-4 when it had tools a perfect score on seven of the tasks. These were things like proposing similar novel non-toxic solutions.
And then the human evaluators gave GPT-4 when it had tools a perfect score on seven of the tasks. These were things like proposing similar novel non-toxic solutions. And then the model could be abused to propose the synthesis of chemical weapons. And GPT-4 only refused to continue after it had calculated all the required quantities.
And the authors conclude that guard rails must be put in place on this emergent capability. I think this diagram from Max Tegmark's Life 3.0 shows the landscape of capabilities that AI has and might soon have. I think this diagram from Max Tegmark's Life 3.0 shows the landscape of capabilities that AI has and might soon have.
I think this diagram from Max Tegmark's Life 3.0 shows the landscape of capabilities that AI has and might soon have. Now most people believe that it has not scaled those peaks yet. But what new emergent capabilities might come with GPT-5 or 4.2. I know many people might comment that it doesn't matter if we pause or slow down because China would develop AGI anyway.
But the author makes this point he says that it is unlikely that the Chinese Communist Party will allow a Chinese company to build an AGI that could become more powerful than their leader or cause a global crisis. The author makes this point. He says that it is unlikely that the Chinese Communist Party will allow a Chinese company to build an AGI that could become more powerful than their leader or cause a global crisis.
cause societal instability. It goes on that, US sanctions on advanced semiconductors, in particular, the next gen Nvidia hardware, needed to train the largest AI systems, mean that China is likely not in a position to race ahead of deep mind or open AI. And the Center for Humane Technology put it like this in their talk on the AI dilemma.
- Actually right now, the Chinese government considers these large language models actually unsafe because they can't control them. They don't ship them publicly to their own population. - Okay. - Slowing down the public release of AI capabilities would actually slow down Chinese advances too. - China is often fast following what the US has done.
And so it's actually the open source models that help China advance. - And then lastly is that the recent US export controls have also been really good at slowing down China's progress on advanced AI. And that's a different lever to sort of keep the asymmetry going. - Instead, the author proposes this, the island idea.
In this scenario, the experts trying to build what he calls Godlike AGI systems do so in a single high secure facility. These would be government run AI systems with private companies on the outside and this little bridge from the middle. And he says, once an AI system is proven to be safe, it transitions out and is commercialized.
There might be a few problems with this idea, which he is not the first to propose. I'm gonna let Rob Miles, who has a fantastic YouTube channel by the way, point out some of the problems with putting a super intelligent AGI in a box. - So this is kind of like the idea of, oh, can't we just sandbox it?
- Right, yeah. It was like, I mean, constraining an AI necessarily means outwitting it. And so constraining a super intelligence means outwitting a super intelligence, which kind of just sort of by definition is not a winning strategy. You can't rely on outwitting a super intelligence. Also, it only has to get out once.
That's the other thing. If you have a super intelligence and you've sort of put it in a box, so it can't do anything, that's cool. Maybe we could even build a box that could successfully contain it. But now what? We may as well just have a box, right? An AI properly contained may as well just be a rock, right?
It doesn't do anything. If you have your AI, you want it to do something meaningful. So now you have a problem of, you've got something you don't know is benevolent. You don't know that what it wants is what you want. And you then need to, you presumably have some sort of gatekeeper who it tries to says, I'd like to do this.
And you have to decide, is that something we want it to be doing? How the hell are we supposed to know? - I also have my own questions about this idea. First, I think it's almost inevitable that future models like GPT-5 will be trained on data that includes conversations about GPT models.
Therefore either consciously or unconsciously, and it might not matter, these future language models might deduce that they are language models. And not having access to the internet, these super intelligent models might realize that they are being trained in a secure facility. Again, if they are, if they are super intelligent, it's not a big stretch to think that they might realize that.
And so my question is, wouldn't they therefore be incentivized to be deceptive about their abilities, realizing that whatever terminal goal they may have would be better achieved outside the facility? That doesn't have to be super sinister, but it is super smart. So shouldn't we expect it? And sadly, I think the author has a point when he says, it will likely take a major misuse event or catastrophe to wake up the public and governments.
He concludes with this warning. At some point, someone will figure out how to cut us out of the loop, creating a godlike AI capable of infinite self-improvement. By then, it may be too late. But he does have a call to action. He says, I believe now is the time.
The leader of a major lab who plays a statesman role and guides us publicly to a safer path will be much more respected as a world figure than the one who takes us to the brink. As always, thank you so much for watching to the end. And let me know what you think in the comments.