back to indexSelf-Driving Cars: State of the Art (2019)
Chapters
0:0 Introduction
1:53 2018 in review
4:49 Fatalities
8:29 Taxi services
10:54 Predictions
16:55 Human-centered autonomy
19:42 Levels of autonomy and proliferation strategies
24:48 Out-of-the-box ideas
27:28 Who will be first?
29:26 Historical context
31:5 Underlying beliefs of the industry and public
32:32 Driving is hard
35:32 Humans are amazing
37:10 Humans and automation don't mix well?
41:55 Two approaches: Lidar vs Vision
49:54 In the meantime… data
52:49 The road ahead
00:00:14.720 |
as we also get to hear from the different perspectives 00:00:30.520 |
for many it may be about entrepreneurial possibilities 00:00:34.800 |
but really it's about improving access to mobility, 00:00:39.600 |
moving people around the world that don't have that ability, 00:00:54.840 |
in the time we spend in traffic and transportation. 00:00:58.880 |
One of the most hated things in terms of stress and motion, 00:01:09.180 |
So the ability to convert that into efficiency, 00:01:18.960 |
at least for me and for many of us working in this space, 00:01:29.880 |
somebody in the world dies in a car auto crash. 00:01:54.220 |
with that base of mission that we're all working towards 00:01:57.400 |
from the different ideas and different perspectives, 00:02:00.560 |
I would like to review what happened in 2018. 00:02:09.800 |
in deploying and testing their vehicles in various domains 00:02:13.880 |
and have in October reached the mark of 10 million miles 00:02:18.480 |
driven autonomously, which is an incredible accomplishment. 00:02:21.240 |
It's truly a big step for fully autonomous vehicles 00:02:26.240 |
in terms of deployment and obviously is growing 00:02:30.580 |
and growing by day and we'll have Drago here from Waymo 00:02:40.240 |
that's the pair, that's the mirror side of this equation. 00:02:49.040 |
is the one billion mile mark reached by Tesla 00:02:58.840 |
its position in the lane, center itself in the lane, 00:03:01.440 |
it's able to control the longitudinal movement, 00:03:12.500 |
is the ability to do so for many minutes at a time, 00:03:15.240 |
even hours at a time, especially on highway driving, 00:03:19.120 |
And the fact that they've reached one billion 00:03:21.480 |
with a B miles is an incredible accomplishment. 00:03:24.940 |
All of that from the machine learning perspective is data, 00:03:32.440 |
are driven with the primary sensor being a camera, 00:03:39.480 |
And how does computer vision work in modern day? 00:03:42.080 |
Especially with the second iteration of Autopilot hardware, 00:03:50.400 |
That is probably the largest deployment of neural networks 00:03:55.400 |
in the world that has a direct impact on a human life 00:04:17.120 |
on ImageNet and you sit there with a TensorFlow 00:04:26.960 |
You take from that a step towards there's a human life, 00:04:31.560 |
your parents driving, your grandparents driving this, 00:04:39.400 |
and there's a neural network making the decision 00:04:43.120 |
So that one billion mark is an incredible accomplishment. 00:04:54.800 |
There's been two fatalities that happened in March of 2018. 00:05:00.720 |
with Uber in Tempe, Arizona hitting a pedestrian 00:05:08.320 |
And on the semi-autonomous side of Tesla Autopilot, 00:05:12.400 |
the third fatality that Tesla Autopilot led to 00:05:15.680 |
and the one in 2018 is in Mountain View, California 00:05:20.560 |
when Tesla slammed into a divider, killing his driver. 00:05:33.600 |
as we talk about the progression of autonomous vehicles, 00:05:38.000 |
proliferation in our world is our response as a public, 00:05:48.640 |
And obviously there's a disproportionate amount 00:06:03.760 |
in order to design successful autonomous vehicles, 00:06:25.800 |
can have significant detrimental effect to the progress 00:06:32.320 |
So that's something we really have to think about. 00:06:37.720 |
So the question was, do we know the rate of fatalities 00:06:47.800 |
So there's about 80, 90, 100 million miles driven 00:06:51.560 |
in manually controlled cars at every fatality. 00:06:55.280 |
So one fatality per, depending on which numbers you look at, 00:07:04.880 |
the fatality is, well, we could just take the one billion 00:07:14.160 |
And that's something actually that we're working on 00:07:22.240 |
that directly are comparable to the autopilot miles. 00:07:26.480 |
So autopilot is a modern vehicle that's much safer. 00:07:31.080 |
than the general population of manually driven vehicles. 00:07:34.440 |
Autopilot is driven on only particular kinds of roads, 00:07:42.720 |
all these kinds of factors need to be considered 00:07:54.780 |
And for anyone that's ever taken a statistics class, 00:08:03.720 |
by which to make any significant conclusions. 00:08:15.120 |
Which PR and marketing aspects of these different companies 00:08:26.480 |
But it's something that we have to think about. 00:08:51.260 |
In many of these cases, it's very small scale, 00:08:58.200 |
in a constrained environment, in a constrained community. 00:09:02.000 |
And almost always, really always with a safety driver, 00:09:06.480 |
there's a few exceptions for demonstration purposes. 00:09:09.240 |
But there's always an actual driver in the seat. 00:09:11.640 |
Some of the brilliant folks representing these companies 00:09:20.640 |
Awesome work they're doing in villages in Florida. 00:09:37.520 |
Waymo One that's gotten some publicity in Phoenix, Arizona. 00:09:50.100 |
So we didn't say it has to be delivering humans, 00:09:58.840 |
its autonomous vehicle taxi service testing in Pittsburgh 00:10:34.920 |
the head of Tesla Autopilot that launched Aurora, 00:10:37.560 |
and the Chris Hermsen behind this young upstart company 00:10:41.800 |
is doing testing in San Francisco and Pittsburgh. 00:10:44.080 |
And then Cruise, Kyle will be here to talk from GM, 00:10:47.240 |
is doing testing in San Francisco, Arizona, and Michigan. 00:10:56.440 |
when we're going to have autonomous vehicles. 00:10:59.360 |
And when you yourself think about what it means, 00:11:06.600 |
such that the Uber that you call will be autonomous 00:11:14.720 |
is what we think about how we define autonomous, 00:11:34.880 |
and it will give you a ride in a particular location. 00:11:46.920 |
than most of the companies that we're talking about today. 00:11:55.880 |
it says 10,000, where there's a meaningful deployment. 00:11:59.740 |
When it's truly going beyond that prototype demo mode 00:12:06.320 |
to where it's really touching the general population 00:12:09.300 |
in a fundamental way, scale is everything here. 00:12:16.200 |
there's 46,000 active Uber drivers in New York City. 00:12:40.180 |
has to make a prediction, but they all seem to. 00:12:43.660 |
All the major automakers have made a prediction 00:12:47.320 |
when they will be able to deploy autonomous vehicles. 00:12:56.700 |
a prediction that they will have autonomous vehicles 2018. 00:12:59.780 |
In 2018, they've now adjusted the prediction to 2019. 00:13:04.260 |
Nissan, Honda, Toyota have made prediction for 2020 00:13:27.700 |
So there is the predictions that are extremely optimistic 00:13:41.340 |
that they're at the cutting edge of innovation. 00:13:44.220 |
And then there is many of the leading engineers 00:13:49.100 |
including Carl and Yamaha and Gil Pratt from MIT, 00:14:04.460 |
to remove the human from the loop of automation. 00:14:20.300 |
And that's how he thinks about autonomous vehicles, 00:14:22.780 |
even with greater and greater degree of automation, 00:14:25.020 |
there's still going to have to be a human in the loop. 00:14:33.980 |
and they're making some announcements at CES, 00:14:49.560 |
is one of the sort of the great roboticists in the world 00:14:53.900 |
and a lot of our progress, historically speaking, 00:14:59.740 |
we can think of it as the Elon Rodney spectrum 00:15:07.300 |
The Elon Musk, who's extremely bold and optimistic 00:15:18.360 |
the impossible is possible in order to make it happen. 00:15:26.180 |
the former head of C-Cell, the AI laboratory here, 00:15:37.160 |
For Rodney, the vehicles are really fully autonomous, 00:16:02.800 |
when you ban manually driven vehicles in certain parts. 00:16:09.380 |
majority of US cities will ban manually driven vehicles. 00:16:18.180 |
is that my guess is that in probably 10 years, 00:16:27.300 |
So we also have to think about the long tail of the fact 00:16:32.300 |
that many people drive cars that are 10 years old, 00:16:48.320 |
to take a little pause into the ridiculous and the fun 00:16:53.600 |
Yes, that is me playing guitar in our autonomous vehicle. 00:17:08.960 |
Now for those of you born in the 90s, that's classic rock. 00:17:15.000 |
So the point I'm trying to make beyond predictions 00:17:18.560 |
is that autonomous vehicles will not be adopted 00:17:24.120 |
in the next 10, 15 years, because they're safer. 00:17:33.440 |
that that's going to be the reason you adopt. 00:17:49.520 |
and therefore more annoying in the way we think about 00:17:53.680 |
We take risk, we drive assertively with speed 00:17:57.760 |
That is not how autonomous vehicles today operate. 00:18:06.760 |
really there's still significant investment going into them. 00:18:11.160 |
And there's not good economics in the near term 00:18:14.280 |
of how to make them obviously significantly cheaper. 00:18:22.360 |
the taxi service because of the human experience. 00:18:26.300 |
In the same way, autonomy will only take over 00:18:40.080 |
This video and many others that we're putting out 00:18:48.540 |
the ability of the car to sense everything you're doing 00:18:50.920 |
from the activity of the driver to the driver's attention 00:18:55.080 |
and being able to transfer control back and forth 00:18:57.680 |
in a playful way, but really in a serious way. 00:19:15.620 |
because many of the people that'll be speaking at this class 00:19:18.840 |
and many of the people that are working on this problem 00:19:27.000 |
that once we solve the autonomous vehicle problem, 00:19:31.480 |
I believe you first have to make it fun as hell 00:19:40.860 |
In the language that we're talking about here, 00:19:43.820 |
there's several levels of autonomy that are defined 00:19:50.900 |
level three, four and five, increasing automation. 00:19:53.420 |
So level two is when the driver is still responsible. 00:19:58.620 |
less and less responsibility, but really in three, four, five 00:20:07.180 |
So there's only really two, as far as I'm concerned, 00:20:09.760 |
levels, human-centered autonomy and full autonomy. 00:20:13.220 |
Human-centered means the human is responsible. 00:20:25.900 |
That means full autonomy does not allow for teleoperation. 00:20:37.000 |
because that means the human is still in the loop. 00:20:58.740 |
It has to be able to pull off to the side of the road 00:21:08.120 |
And so how do we envision these two levels of automation 00:21:12.780 |
proliferating society, getting deployed at a mass scale? 00:21:21.160 |
the way to think about it with the predictions 00:21:35.620 |
One is last mile delivery of goods and service, 00:21:42.780 |
These are zero occupancy vehicles delivering groceries 00:21:50.740 |
it's slow moving transport to the destination 00:21:54.640 |
where most of the tricky driving along the way 00:21:56.900 |
is done manually, and then the last mile delivery 00:22:01.220 |
is done by zero occupancy autonomous vehicles. 00:22:06.220 |
Trucking on the highway, possibly with platooning 00:22:09.980 |
where a sequence of trucks follow each other. 00:22:14.580 |
as a pretty well-defined problem of highway driving 00:22:25.420 |
and globally on the highway driving is automatable. 00:22:30.560 |
kind of like what a lot of these companies are working on, 00:22:38.380 |
There's certain pickup locations you're allowed to go to, 00:22:41.580 |
there are certain drop-off locations, that's it. 00:22:53.940 |
The closed communities, something Oliver Cameron 00:22:58.880 |
with Voyage is working on defining and Optimus Ride, 00:23:14.660 |
you have slow moving transport that gets people 00:23:30.480 |
and it comes to you autonomously with nobody in there. 00:23:36.460 |
So imagine a world where we have empty vehicles 00:23:39.980 |
driving around, delivering themselves to you. 00:23:58.140 |
kind of like what Tesla is working towards most recently, 00:24:06.740 |
liability-wise and in terms of just observing the vehicle 00:25:03.180 |
So here shown on the bottom is our conventional approach. 00:25:09.380 |
because of traffic lights that turn red, green, yellow, 00:25:18.620 |
to significantly increase the traffic flow through a city. 00:25:47.500 |
constraining the problem to such a degree that, 00:25:50.940 |
I mean, the idea of autonomy just is completely transformed 00:25:56.160 |
a car is able to transform itself into a mini train, 00:26:07.680 |
and not necessarily drive, be driven 200 miles an hour, 00:26:40.400 |
somebody without a pilot's license, for example, 00:26:45.280 |
Making that experience accessible to regular people 00:26:53.960 |
one of the companies really seriously working on this 00:27:02.240 |
and the idea is that you would meet your vehicle 00:27:08.200 |
you take an elevator, you meet them at the roof of the, 00:27:15.120 |
and they're seriously addressing this problem. 00:27:18.060 |
Many of the great solutions to the world's problems 00:27:23.600 |
So, let's not laugh too loud at these possibilities. 00:27:28.260 |
Back in my day, we used to drive in the street. 00:27:49.520 |
fully autonomous cars operating on public roads 00:28:15.200 |
saying no one in the next 50 years will do it. 00:28:30.280 |
and probably some people in this room have used, 00:28:34.140 |
Obviously, they were the leaders in the space, 00:28:45.440 |
and that's why there's a lot of autonomous vehicle companies. 00:28:47.880 |
Some companies are taking advantage of the hype 00:28:51.760 |
and the fact that there's a lot of investment in the space, 00:28:54.220 |
but some companies, like some of the speakers 00:29:15.160 |
working towards trying to become fully autonomous, 00:29:18.700 |
and Waymo, starting with the fully autonomous, 00:29:20.780 |
working towards achieving scale at the fully autonomous, 00:29:38.140 |
when Stanley from Stanford won a race to the desert. 00:29:46.980 |
that the autonomous vehicle problem is solved in 2005. 00:30:11.780 |
It's really a mechanical engineering problem. 00:30:16.700 |
it's not really an autonomous driving problem 00:30:22.020 |
the DARPA put together the Urban Grand Challenge, 00:30:30.260 |
the thought was, at that point, that's it, we're done. 00:30:51.460 |
with the DARPA Grand Challenge and solving that, 00:30:53.620 |
that we solve the fundamental problem of autonomy. 00:31:19.900 |
that the other idea is that humans are bad at driving. 00:31:57.180 |
So when you put a human in a room with a robot 00:32:11.840 |
People are, they tune out, they overtrust the system, 00:32:38.900 |
vehicle to pedestrian nonverbal communication 00:32:47.180 |
millions of times every single day in Boston. 00:32:50.740 |
Subtle nonverbal communication between vehicles. 00:32:55.360 |
You have to solve all the crazy road conditions 00:33:09.620 |
you have 100, 200 milliseconds to make a decision. 00:33:16.460 |
Then you have to deal with the nonverbal communication 00:33:26.780 |
You have to not only understand what they're, 00:33:30.260 |
the intent of the movement that's anticipated. 00:33:35.180 |
So anticipating the trajectory of the pedestrian, 00:33:37.180 |
you also have to assert yourself in a game theoretic way. 00:33:45.020 |
You have to take a risk that if I don't slow down, 00:34:03.660 |
based on the simple physics of the current velocity, 00:34:15.040 |
is something that we have to incorporate into algorithms 00:34:21.160 |
So if driving is easy, we have to solve that too. 00:34:26.320 |
with the coast runners and the boat going around 00:34:52.820 |
If there's any learning that as I showed yesterday, 00:35:06.620 |
So if the objective function is to maximize the reward, 00:35:09.060 |
you can slam into the wall over and over and over again, 00:35:11.500 |
and that's actually the way to optimize the reward. 00:35:20.100 |
to the objective function without a human in the loop. 00:35:36.280 |
The fact that we have 540 million years worth of data 00:35:40.720 |
means we don't understand how damn impressive it is 00:35:43.920 |
to be able to perceive and understand the scene 00:35:54.560 |
about anticipating the physics of the scene and so on. 00:36:01.020 |
The humans don't give enough credit to ourselves. 00:36:16.960 |
I think there's like four or five times he scores. 00:36:31.640 |
And that's all the movement and so on involved with that. 00:36:57.280 |
And the ones on the right are actually not fully autonomous. 00:37:05.240 |
in terms of our ability to understand the world 00:37:07.680 |
and in terms of our ability to act in that world. 00:37:10.360 |
And the fact that humans, the idea, the view, 00:37:29.080 |
Most of the experiments are actually in the lab. 00:37:34.900 |
you put a undergrad, grad student in a lab and say, 00:37:39.520 |
"Here, watch this screen and wait for the dot to appear." 00:37:45.740 |
But when it's your life and you're on the road, 00:37:48.440 |
it's just you in the car, it's a different experience. 00:37:51.640 |
It's not completely obvious the vigilance will be lost. 00:37:58.480 |
it's not completely obvious what the psychology, 00:38:05.240 |
So one of the things we did is we instrumented here 00:38:07.360 |
22 Teslas and observe people now over a period of two years 00:38:11.320 |
of what they actually do when they're driving on a pilot, 00:38:17.500 |
and cyan showed vehicle control on autopilot. 00:38:32.660 |
And in 26,000 moments of transfer of control, 00:38:47.260 |
to a critical situation, to a challenging road situation. 00:38:55.720 |
that's 0.1% or less than the full Tesla fleet 00:39:01.540 |
that has autopilot, but it's still an inkling. 00:39:06.100 |
to build a system that works together with a human being. 00:39:14.020 |
Some percentage, 90%, maybe less, maybe more. 00:39:17.420 |
When it can solve the problem of autonomous driving, 00:39:35.140 |
All the problems have to be solved exceptionally, 00:39:40.420 |
to the scene perception, to control, to planning, 00:39:43.620 |
to being able to find a safe harbor at any moment, 00:39:57.980 |
if you want to solve the fully autonomous problem. 00:40:04.380 |
And if you approach the shared autonomy side, 00:40:06.860 |
the semi-autonomous, where you're only responsible 00:40:09.620 |
for a large percentage, but not 100% of the driving, 00:40:14.640 |
the human interaction, the sensing what the driver is doing, 00:40:18.100 |
the collaborating, communicating with the driver, 00:40:20.800 |
and the personalization aspect that learns with the driver. 00:40:28.220 |
we have a lot of demonstrations of these kinds of ideas, 00:40:37.320 |
So it's as simple as, so this is just demonstration 00:40:47.860 |
of a vehicle taking control when the attention over time, 00:41:27.660 |
when it's not, and communicating effectively, 00:41:31.580 |
that problem is a fundamental machine learning problem. 00:41:40.620 |
So we have data, we have complicated human beings, 00:42:13.580 |
they're the highest resolution of information available. 00:42:20.020 |
which is exactly what neural networks love, right? 00:42:22.920 |
So to be able to cover all the crazy edge cases, 00:42:45.340 |
all the major data sets of visible light cameras. 00:42:53.340 |
whoever designed the simulation that we're all living in, 00:42:58.660 |
our world, our roads and our world is designed 00:43:11.660 |
most of the road textures that you use to navigate, 00:43:17.060 |
to drive are visible, are made for human eyes. 00:43:29.020 |
You make errors because driving is ultimately 00:43:45.140 |
taking a very particular constrained set of roads, 00:43:53.900 |
and then using the most accurate sensors available, 00:43:57.980 |
a Swedish sensors, but really LiDAR at the forefront, 00:44:06.540 |
especially when machine learning is not evolved, 00:44:16.620 |
It's not so much true for machine learning methods. 00:44:31.700 |
in perceiving the world using LiDAR primarily 00:44:37.220 |
And therefore they're not learning over time. 00:44:47.820 |
And there's only a tiny percentage of cars in the world, 00:45:04.140 |
it's kind of like the offensive line of football. 00:45:06.900 |
They're actually the ones that do all the work 00:45:14.780 |
to actually do the detection in terms of obstacle, 00:45:17.780 |
the most critical, safety critical obstacle avoidance. 00:45:30.660 |
to achieve any kind of degree of high autonomy. 00:45:53.700 |
visualized LiDAR on the right, radar on the left. 00:45:57.900 |
The resolution is just much higher and is improving 00:46:07.100 |
in terms of the amount of information transferred per frame. 00:46:11.540 |
And everybody, you know, really the scale of, 00:46:16.540 |
the number of vehicles that have this equipped is humongous. 00:46:19.860 |
So it's ripe for application of deep learning. 00:46:42.540 |
So LiDAR works in the dark, variable lighting conditions, 00:46:47.180 |
has pretty good resolution, has pretty good range, 00:46:55.100 |
and it doesn't provide rich textural contrast information. 00:47:00.900 |
And it's also sensitive to fog and rain conditions. 00:47:04.460 |
Now, ultrasonic sensors catch a lot of those problems. 00:47:09.900 |
They're high resolution in objects that are close, 00:47:16.740 |
in the sensor fusion package for an autonomous vehicle. 00:47:20.740 |
They really catch a lot of the problems that radar has. 00:47:42.300 |
There's very little information being provided. 00:47:50.620 |
They're cheap, they're small, range is great. 00:47:59.780 |
but doesn't work in the dark in extreme conditions, 00:48:01.900 |
and it's just susceptible to all these kinds of problems. 00:48:12.780 |
and everybody works together to build an entire picture. 00:48:22.660 |
Tesla is using, which is ultrasonic, radar, and camera, 00:48:29.900 |
that actually the suite of camera, radar, and ultrasonic 00:48:37.100 |
So those are the two comparisons that we have. 00:48:40.580 |
You have the costly non-machine learning way of LiDAR, 00:48:44.540 |
and you have the cheap, but needs a lot of data, 00:48:56.460 |
Now, of course, way more, so we'll talk about 00:49:12.900 |
When you say, oh, crap, I don't know what to do, 00:49:26.140 |
Fail safe is maps, that you can't rely on the human, 00:49:36.180 |
that you're able to, you have such good maps, 00:49:41.220 |
that the fundamental problem of obstacle avoidance, 00:49:44.060 |
which is what safety is about, can be solved. 00:49:49.060 |
The question is, what kind of experience that creates. 00:49:58.620 |
Ford F-150 is still the most popular car in America. 00:50:06.220 |
Semi-autonomous cars, every company is now releasing 00:50:38.740 |
very excited about, and machine learning researchers 00:50:42.140 |
are excited about, which fundamentally relies on huge data 00:50:46.780 |
The neural networks that are running inside the Tesla, 00:50:51.600 |
and with their new, it's kind of the same kind of path 00:50:58.500 |
Tesla is taking from the NVIDIA Drive PX2 system, 00:51:08.580 |
That kind of path that others are beginning to embrace, 00:51:15.020 |
And then people that are maybe more grounded, 00:51:19.020 |
and actually wanna really value safety, reliability, 00:51:30.220 |
it's difficult to work with, it's not reliable, 00:51:35.220 |
and so in that sense we have to have a sensor suite 00:51:48.960 |
Stop signs and traffic lights, pedestrians and so on. 00:51:52.040 |
Some of them, if you hit them, it's a problem. 00:51:55.480 |
Some of them are a bag flying through the air, 00:51:58.540 |
and all have different visual characteristics, 00:52:14.280 |
I think fog or smoke, these are interesting things. 00:52:17.840 |
They might look like an object to certain sensors 00:52:20.200 |
and not to others, but the traffic light detection problem, 00:52:25.200 |
luckily with cameras, is pretty solved at this point. 00:52:33.880 |
The hard part is when you have a green light, 00:52:36.080 |
and there's a drunk, drug, drowsy, or distracted, 00:52:47.660 |
So the road ahead for us, as engineers, as scientists, 00:52:57.200 |
from having these, even if they're large, toy data sets, 00:53:10.640 |
all the exciting deep RL stuff that we'll talk about 00:53:13.520 |
in the future weeks, really are toy examples. 00:53:18.300 |
But taking those algorithms and putting them in cars 00:53:29.180 |
for artificial intelligence in the 21st century, 00:53:39.400 |
We get to deal with the psychology, the philosophy, 00:53:44.420 |
how we socially think about it, to the robotics problem, 00:53:50.940 |
And we have many guest speakers exploring that 00:53:54.780 |
in different ways, and that's really exciting 00:53:57.900 |
to see how these people are trying to change the world. 00:54:01.220 |
So with that, I'd like to thank you very much.