AGI Inches Closer - 5 Key Quotes: Altman, Huang and 'The Most Interesting Year'

00:00:00.000 | It has been an interesting few days for the pursuit of Artificial General Intelligence.

00:00:05.300 | So I wanted to give you some of the highlights from GPT 4 point something to recursively improving semiconductor manufacturing.

00:00:14.820 | We got at least five revealing quotes from Jensen Huang, Sam Altman and others.

00:00:20.340 | And the summary is this.

00:00:22.260 | If you thought or know someone who thinks that Artificial Intelligence peaked with chat GPT, you or they are going to have to weather exponential increases in computational power through at least the rest of this decade.

00:00:37.020 | As Sam Altman just said of 2024, and I agree, this is the most interesting year in human history, except for all future years.

00:00:46.140 | So let's start with Sam Altman, who said that OpenAI's goal is to avoid shocking updates.

00:00:52.660 | Our goal is not to have shock updates to the world, but that's what we're trying to do.

00:00:56.860 | That's like our state of the strategy.

00:00:58.500 | And I think we're somehow missing the mark.

00:01:00.620 | So maybe we should think about, you know, releasing GPT 5 in a different way or something like that.

00:01:05.020 | Yeah, 4.71, 4.72.

00:01:08.540 | And what does he mean by releasing iteratively without shock updates?

00:01:12.500 | Well, probably something similar to another co-founder of OpenAI, Greg Brockman.

00:01:16.780 | He said that their plan for safety involved deploying GPT 5 in stages, essentially creating a continuum of incrementally better AIs, such as by deploying subsequent checkpoints of a given training run.

00:01:29.780 | Think of that like saves on the way to completing a video game.

00:01:33.580 | In short, it's highly likely now that we will be getting something equivalent to GPT 4.5 before we get GPT 5.

00:01:40.460 | As to whether the marketing department comes up with a better name than GPT 4.5, well, let's see.

00:01:46.020 | And on timing, he said this.

00:01:48.020 | Blink twice if it's this year.

00:01:50.140 | I also...

00:01:56.380 | We will release an amazing new model this year.

00:02:00.660 | I don't know what we'll call it.

00:02:02.140 | We'll release over in the coming months.

00:02:06.060 | Many different things, I think they'll be very cool.

00:02:08.540 | I think before we talk about like a GPT 5 like model called that or not called that or a little bit worse or a little bit better than what you'd expect from a GPT 5.

00:02:17.660 | I know we have a lot of other important things to release.

00:02:20.460 | And don't forget that not all progress depends on new models.

00:02:23.140 | We can have new systems like Let's Verify or Q* based on existing models.

00:02:28.300 | Sam Altman practically confirmed the existence of Q* in this interview with Lex Friedman from yesterday.

00:02:34.180 | Can you speak to what Q* is?

00:02:36.060 | We are not ready to talk about that.

00:02:37.820 | See, but an answer like that means there's something to talk about.

00:02:40.660 | It's very mysterious, Sam.

00:02:42.980 | I mean, we work on all kinds of research.

00:02:45.820 | Yeah.

00:02:46.820 | I've done an entire video gathering the best evidence as to what Q* is, so do check that one out.

00:02:54.180 | But if you want the massively condensed TL;DR, it's this.

00:02:57.940 | Models can essentially think or compute for longer on harder questions and generate thousands of example answers

00:03:05.780 | and have internal systems for checking which answer is best and only showing you that best answer.

00:03:10.980 | A system two to complement the base system one thinking, if you will.

00:03:15.020 | But before we leave that interview, there were two more fascinating quotes, at least from my perspective, that I want to show you.

00:03:20.460 | One involved the possible social response to ever improving AI and the chances of it going theatrically wrong.

00:03:27.300 | I worry about that for AI.

00:03:28.460 | I think some things are going to go theatrically wrong with AI.

00:03:32.860 | I don't know what the percent chance is that I eventually get shot, but it's not zero.

00:03:38.780 | I'll come back to social responses later in this video, but that was a startling moment.

00:03:45.500 | At the moment, at least, thankfully, the power struggle for AGI is only financial.

00:03:51.340 | Here is Demis Hassabis essentially laughing when an interviewer said that OpenAI was a non-profit.

00:03:58.140 | A lot of AI labs have been grappling with governance and what is the best structure for something like AGI to emerge.

00:04:05.820 | You just mentioned the possibility of some sort of international collective or cooperative that would handle this.

00:04:11.340 | But, you know, across the industry, like OpenAI has set itself up as a non-profit with a for-profit subsidiary.

00:04:17.780 | Anthropic is a public benefit corporation.

00:04:20.660 | So before we get to NVIDIA's GTC, let's linger for a moment on AGI, its definition and recent updates to the timeline to AGI.

00:04:30.020 | Yesterday, Andrej Karpathy, who was until recently at the very top of OpenAI, said this about the definition of AGI.

00:04:37.660 | He thinks of it like the OpenAI charter as being an autonomous system that surpasses humans in most economically valuable tasks.

00:04:45.260 | And is it me or does that definition not automatically foreshadow economic strife?

00:04:50.020 | In other words, definitionally, AGI won't have arrived until it can do the work of at least half of all humans.

00:04:56.460 | Now, every word matters when we're defining something as consequential as AGI.

00:05:01.220 | And Google DeepMind, led by Demis Hassabis, moderated OpenAI's definition.

00:05:05.500 | They said we'll count it as having achieved AGI if we have systems that are technically capable of performing economically important tasks,

00:05:12.900 | but don't necessarily realize that economic value, as in they might not actually be deployed in the workforce for legal, ethical or social reasons.

00:05:20.940 | But imagine the economic incentives in that scenario.

00:05:24.420 | AGI would be here and be capable of realizing trillions of dollars of economic value.

00:05:30.020 | And these companies are supposed to hold back from deploying it.

00:05:33.580 | Would Google allow that? Would Microsoft or would the definition change conveniently?

00:05:38.820 | But even under that wider definition, when does Demis Hassabis think that AGI will arrive?

00:05:44.460 | I will say that when we started DeepMind back in 2010, you know, we thought of it as a 20 year project.

00:05:49.660 | And actually, I think we're on track, which is kind of amazing for 20 year projects, because usually they're always 20 years away.

00:05:55.940 | So that's the joke about, you know, whatever it is, quantum, AI, you know, take your pick.

00:06:00.660 | And but I think we, you know, I think we're on track.

00:06:03.100 | So I wouldn't be surprised if we had AGI like systems within the next decade.

00:06:07.820 | Others think that that moment, which again would have colossal economic ramifications, will come before 2030.

00:06:14.100 | Here's one alignment researcher at OpenAI.

00:06:17.020 | He thinks that there's around a two third chance of AGI before 2028.

00:06:22.380 | And he goes on that he can't talk about all the reasons why he has this timeline,

00:06:26.620 | but mostly it should be figureoutable from publicly available information.

00:06:30.820 | I'm guessing that's an oblique reference to Q* or Let's Verify.

00:06:34.620 | He also returns to the economic definition of AGI.

00:06:38.220 | When I say AGI, I mean something which is basically a drop in substitute for a human remote worker circa 2023.

00:06:45.780 | And not just a mediocre one, a good one, e.g. an OpenAI research engineer.

00:06:50.700 | Notice, though, he's focusing on remote work and even Karpathy limits his comments to digital work.

00:06:56.180 | But as we'll see at the end of this video, even physical tasks might be automated sooner than you think.

00:07:01.860 | Before we leave Daniel Cocotagelo, though, there's one more quote I want to show you of his.

00:07:06.420 | I think in this one, he's feeling somewhat panicked.

00:07:09.060 | Probably there will be AGI soon, literally any year now.

00:07:12.340 | And probably whoever controls AGI will also be able to use it to get to artificial super intelligence shortly thereafter.

00:07:19.860 | He says maybe in another year, give or take a year.

00:07:22.780 | Now, if you do the maths of that comment, give or take a year means that it could be instantaneous or it could take two years from AGI, according to him.

00:07:31.700 | At least according to Google DeepMind, an artificial super intelligence would involve outperforming 100% of humans,

00:07:38.500 | just like in their respective domains, AlphaZero and Stockfish already do.

00:07:42.860 | And in the light of these shortening timelines, some AI researchers are already adapting their behavior.

00:07:48.620 | One lead researcher at OpenAI said this.

00:07:51.180 | The closer we get to the singularity, that's the moment when progress is so fast humans can't even keep up.

00:07:56.140 | The lower, he said, my risk tolerance gets.

00:07:58.540 | I'd already ruled out skydiving and paragliding.

00:08:01.380 | Last year, I started wearing a helmet consistently while cycling.

00:08:05.100 | And he ended, I think this year might be the year I give up skiing.

00:08:08.660 | In other words, if you think AGI, ASI and the singularity are going to happen in the 2020s, it will be kind of a pity to die before that date, probably.

00:08:17.860 | But at this point in the video, and I promise you I will get to the GTC straight after this, things are getting kind of heavy.

00:08:23.860 | So I want to bring in a paper I read that's on a lighter note.

00:08:27.860 | What the paper says, essentially, is that peer reviewers are now starting to use ChatGPT wholesale to do peer review.

00:08:34.940 | How did they discover this?

00:08:36.060 | Well, mentions of the word commendable, innovative, meticulous, intricate, notable and versatile.

00:08:42.060 | Now, I think those are words that I use all the time, but maybe not everyone does.

00:08:45.540 | Previously, they were incredibly rare in peer reviews, but they became somewhat common.

00:08:51.180 | Hmm. Makes you wonder.

00:08:52.660 | They go on that the estimated fraction of large language model generated text is higher in reviews which report lower confidence.

00:09:00.100 | That kind of makes sense, right?

00:09:01.140 | If you're not confident, you're going to use an LLM to help you.

00:09:03.940 | But the next bit is funny.

00:09:04.980 | They were submitted close to the deadline.

00:09:07.140 | So you have these panicked peer reviewers who are like, oh, no, the deadline's coming.

00:09:11.020 | Let's use ChatGPT to do it.

00:09:12.580 | And the other correlation was it was more common from reviewers who are less likely to respond to author rebuttals.

00:09:18.900 | Now, that seems somewhat unfair to me.

00:09:21.020 | You don't even bother to write the peer review yourself and you don't even reply when the author replies to you.

00:09:26.900 | These were peer reviews of prominent deep learning conferences, and the rates were 10 and almost 17 percent.

00:09:34.340 | And we're not talking about spell checks.

00:09:35.820 | We're talking about being substantially modified by ChatGPT.

00:09:39.260 | Obviously, now is not the time to go through this paper, but I thought it's worth showing you.

00:09:42.620 | I mean, it's one more effect of AGI, right?

00:09:44.540 | The whole peer review system might become the AGI review system.

00:09:48.900 | So the conference from around 24 hours ago, obviously way too much to get to in this video.

00:09:54.140 | But I'm going to give you the five moments that stood out for me.

00:09:56.860 | First, the obvious one, the announcement of the Blackwell GPU.

00:10:01.100 | Over the course of the last eight years, we've increased computation by 1000 times.

00:10:06.940 | Eight years, 1000 times.

00:10:08.500 | Remember back in the good old days of Moore's law, 10x every five years, 100 times every 10 years.

00:10:14.940 | 100 times every 10 years in the middle of the heydays of the PC revolution.

00:10:22.860 | Now, this graph does involve some hype because it's not comparing the same level of precision, FP16 to FP4.

00:10:29.260 | But the point still stands that we are exceeding Moore's law.

00:10:32.780 | Here's another example.

00:10:33.820 | The Blackwell Superchip system isn't just two times better inference or actually generating tokens.

00:10:39.660 | It's 30 times more performant than the H100 series.

00:10:44.020 | In short, there's going to be a lot more generations from generative AI.

00:10:47.660 | The cost and energy consumption also drops by a major factor.

00:10:51.540 | And of course, almost every CEO in the world that you can think of lined up to praise and get in the queue for these Blackwell Superchips.

00:10:59.580 | Next, of course, the model sizes that these systems can serve just keeps getting bigger.

00:11:04.660 | Remember, GPT-3 was trained at 175 billion parameters, then GPT-4 at around 1.8 trillion.

00:11:11.700 | That's 10 times bigger.

00:11:12.940 | Well, notice how as we proceed, we're not doubling or 3Xing.

00:11:16.860 | We're talking about another tenfold increase.

00:11:19.420 | NVIDIA said that their server clusters could deploy a 27 trillion parameter model.

00:11:24.260 | Now, of course, just because NVIDIA can deploy that size of model doesn't mean that the AGI labs will create one that big.

00:11:30.140 | I think a more reasonable estimate for the next generation of models would be around 10 trillion parameters.

00:11:35.700 | But the point still stands, we're not doubling each time.

00:11:37.860 | In case you're not familiar, by the way, the number of parameters is like the number of dials in a model that you can tune to better match deep, intricate patterns and patterns within patterns.

00:11:47.260 | Of course, those patterns have to be found within the data that you give it.

00:11:50.380 | So garbage in, garbage out.

00:11:51.980 | But of course, everyone's working on getting higher quality data.

00:11:55.260 | The next interesting moment I think many people might have slept on.

00:11:58.420 | NVIDIA have built a platform that accelerates the compute intensive part of lithography.

00:12:04.260 | That's the key process in making new and more advanced chips.

00:12:08.580 | And in this announcement, NVIDIA say that TSMC are already going into production with this platform.

00:12:14.820 | They're going to be accelerating manufacturing and pushing the limits of physics for the next generation of advanced semiconductor chips.

00:12:21.820 | Not only that, but these 40 or 60 X improvements also utilize Gen AI.

00:12:27.100 | As best I can tell, they're using generative AI to create a mask, which is key in lithography.

00:12:33.180 | Think of that mask as a template that transfers a pattern onto the chip.

00:12:37.700 | I'm reading the book Chip Wars at the moment, and lithography is absolutely key to the latest chips.

00:12:43.180 | But the bigger point is this.

00:12:44.300 | They're using Gen AI for ideation suggestions, but then the actual mask is derived by traditional physically rigorous methods.

00:12:51.700 | It's that marrying of Gen AI to suggest and traditional systems to check that we'll see again in a moment.

00:12:57.580 | But there is another obvious point here.

00:12:59.260 | This is somewhat recursive.

00:13:00.860 | We have better chips creating better, cheaper, faster generative AI.

00:13:05.100 | And now more and more, we're getting generative AI helping in the creation of new and better chips.

00:13:10.820 | And of course, those new chips might be with us sooner.

00:13:13.740 | Photo masks that took two weeks can now be processed overnight.

00:13:16.980 | Here's what the CEO of ASML said, which, of course, is the company that's integral to the creation of semiconductors.

00:13:23.460 | This collaboration will bring tremendous benefit to computational lithography and therefore to semiconductor scaling.

00:13:30.460 | If you thought things were already progressing fast, it'll get even faster for the rest of this decade.

00:13:35.940 | One quick point to make here is that we're actually still lagging the front edge of what's computable by about two years.

00:13:42.420 | Most people, if they're not still using the original chat GPT, are using GPT-4, which finished training two years ago, or Gemini 1 Ultra.

00:13:50.740 | But I spotted recently from Asabis that Gemini 1 Ultra actually used just the same compute as was rumored for GPT-4.

00:13:58.420 | That's 2022 compute levels.

00:14:00.860 | Actually, Gemini 1 used roughly the same amount of compute, maybe slightly more than what was rumored for GPT-4.

00:14:06.060 | I don't know exactly what was used, so I think it was in the same ballpark.

00:14:09.820 | In other words, the public hasn't begun to grasp what even 2023 levels of compute could do for training a language model.

00:14:17.780 | But there was, of course, one more announcement that I simply cannot ignore.

00:14:21.700 | Project Groot.

00:14:24.020 | This is NVIDIA Project Groot, a general purpose foundation model for humanoid robot learning.

00:14:36.900 | The Groot model takes multimodal instructions and past interactions as input and produces the next action for the robot to execute.

00:14:47.100 | We developed Isaac Lab, a robot learning application to train Groot on Omniverse Isaac Sim.

00:14:53.420 | We can train Groot in physically based simulation and transfer zero shot to the real world.

00:15:00.980 | The Groot model will enable a robot to learn from a handful of human demonstrations so it can help with everyday tasks.

00:15:10.660 | And emulate human movement just by observing us.

00:15:15.940 | This is made possible with NVIDIA's technologies that can understand humans from videos, train models and simulation and ultimately deploy them directly to physical robots.

00:15:26.260 | As Jensen Huang said, humanoid robots will at first just watch us and learn from imitation data.

00:15:31.620 | But embodied learning does have one advantage over large language models.

00:15:35.300 | They can use reinforcement learning in simulation, try tasks in simulation, see how they work out and then perform in the real world.

00:15:42.900 | I actually discussed this with two leading figures at NVIDIA, four AI insiders on Patreon.

00:15:48.340 | But let me leave you with this thought.

00:15:50.020 | If you think these robot imitations look kind of cute and rubbish at the moment, think about GPT-2 or maybe the first system of BARD that you interacted with.

00:15:58.980 | And now think of GPT-4 or CLAWD-3.

00:16:02.340 | In those cases, they were learning to imitate human text.

00:16:05.660 | In this case, it will be human actions.

00:16:07.780 | But the lesson is the same.

00:16:08.900 | These models can improve much faster than you might think.

00:16:11.980 | And don't forget with all of this, as yet another OpenAI employee put it, hope you enjoyed some time to relax.

00:16:18.340 | It'll have been the slowest 12 months of AI progress for quite some time to come.

00:16:23.460 | Hopefully you'll join me as I cover that progress in the coming months.

00:16:27.140 | Thank you as always for watching to the end and have a wonderful day.