back to indexGavin Miller: Adobe Research | Lex Fridman Podcast #23
00:00:00.000 |
The following is a conversation with Gavin Miller. 00:00:04.840 |
Adobe has empowered artists, designers, and creative minds 00:00:07.720 |
from all professions working in the digital medium 00:00:10.560 |
for over 30 years with software such as Photoshop, 00:00:13.680 |
Illustrator, Premiere, After Effects, InDesign, Audition, 00:00:17.600 |
software that work with images, video, and audio. 00:00:21.300 |
Adobe Research is working to define the future evolution 00:00:24.520 |
of these products in a way that makes the life 00:00:26.800 |
of creatives easier, automates the tedious tasks, 00:00:29.780 |
and gives more and more time to operate in the idea space 00:00:35.120 |
This is where the cutting edge deep learning methods 00:00:42.140 |
Gavin is the embodiment of combining tech and creativity. 00:00:48.940 |
and builds robots, both things that are near and dear 00:00:57.860 |
If you enjoy it, subscribe on YouTube, iTunes, 00:01:06.060 |
And now, here's my conversation with Gavin Miller. 00:01:12.940 |
leading a lot of innovative efforts in applications of AI, 00:01:20.200 |
but you're also yourself an artist, a poet, a writer, 00:01:28.760 |
that I will not spend the entire time we have together 00:01:33.560 |
I have to sprinkle it in at least a little bit. 00:01:35.880 |
So, some of them are pretty deep and profound, 00:01:40.520 |
Let's start with a few lines from the silly variety. 00:01:56.840 |
So, it opens with, "And now, dessert is near. 00:02:10.820 |
So, where does that love for poetry come from for you? 00:02:17.740 |
how does it all fit together in the bigger puzzle 00:02:25.520 |
That was a poem I wrote when I'd been to my doctor 00:02:28.120 |
and he said, "You really need to lose some weight 00:02:30.880 |
And whilst the rational part of my brain wanted to do that, 00:02:35.160 |
the irrational part of my brain was protesting 00:02:41.520 |
Taken to an extreme, I thought it would be funny. 00:02:43.580 |
Obviously, it's a serious topic for some people. 00:02:46.320 |
But I think for me, I've always been interested in writing 00:02:55.920 |
And sometimes there are parallel strands in your life 00:02:58.320 |
that carry on and one is more about your private life 00:03:01.440 |
and one's more about your technological career. 00:03:05.800 |
And then at sort of happy moments along the way, 00:03:15.000 |
- Do you think your writing, the art, the poetry 00:03:17.120 |
contribute indirectly or directly to your research, 00:03:24.520 |
imagine a future in a science fiction kind of way. 00:03:30.140 |
I think, well, why shouldn't I just build that? 00:03:35.820 |
when realistic voice synthesis first started in the 90s 00:03:44.140 |
I sort of sat down and started writing a poem 00:03:46.340 |
which each line I would enter into the voice synthesizer 00:03:49.380 |
and see how it sounded and sort of wrote it for that voice. 00:03:53.240 |
And at the time the agents weren't very sophisticated. 00:04:09.660 |
sort of telling the user to go home and leave them alone. 00:04:23.880 |
it's becoming more within the bounds of possibility. 00:04:31.140 |
I had a project at home where I did sort of a smart home. 00:04:45.420 |
and I'd leave the clothes in there for three days 00:04:49.740 |
it would say, "Don't forget the washing," and so on. 00:04:58.580 |
would send that over wireless radio to the agent 00:05:01.140 |
who would then play sounds that matched the image 00:05:05.180 |
So I was kind of in love with this idea of magical realism 00:05:07.960 |
and whether it was possible to do that with technology. 00:05:14.820 |
sort of intrigued me from a literary point of view 00:05:18.860 |
I think more recently, I've also written plays. 00:05:24.660 |
and obviously you write a fixed set of dialogue 00:05:29.180 |
But with modern agents, as you design a personality 00:05:35.460 |
I kind of have imaginary dialogue in my head. 00:05:41.580 |
but for it to really know what it's talking about? 00:05:51.660 |
But you rapidly realize that it's kind of just 00:05:57.940 |
real world knowledge about the thing it's describing. 00:06:07.260 |
So it sounds as though it really understands it. 00:06:13.140 |
But if it only has one way of referring to something, 00:06:22.220 |
and give a similar kind of response that people would, 00:06:40.320 |
Being able to generate different kinds of statements 00:06:43.060 |
about the same feature. - Right, so in my team, 00:06:50.460 |
or making up full sentences about what's in the image, 00:06:56.300 |
finding another image that matches the new sentence 00:07:04.820 |
and it synthesizes an asset that matches the description. 00:07:11.440 |
My early days in my career were about 3D computer graphics, 00:07:24.460 |
where people would model light and color and shape 00:07:39.100 |
in computer image generation using AI algorithms. 00:07:43.780 |
- So the creative process is more in the space of ideas 00:08:15.540 |
and then flow after you've made the brushstroke, 00:08:20.500 |
who want to create something really expressive 00:08:26.860 |
And then as certain other tasks become automated, 00:08:30.740 |
it frees the artists up to focus on the inspiration 00:08:35.660 |
So I'm thinking about different ideas, obviously. 00:08:49.080 |
And that used to take up an awful lot of time for artists. 00:08:55.260 |
And one of the targets of AI is actually to reason about, 00:08:59.500 |
from the first example of what are the lightning intent 00:09:09.460 |
so that it looks nicely artistic in that way? 00:09:26.940 |
to just make it easier and faster and cheaper to do 00:09:33.580 |
So everyone wants beautiful artwork for everything 00:09:41.820 |
as some of these things have automatic versions of them, 00:09:50.860 |
to being either the art director or the conceptual artist. 00:10:20.160 |
That said, everything I do is really manually. 00:10:24.700 |
And I set up, I use this old school Kinesis keyboard 00:10:33.340 |
of just making sure there's as few clicks as possible 00:10:45.580 |
where does AI, if you could speak a little more to it, 00:11:04.980 |
So we have a very rich array of algorithms already 00:11:09.060 |
in Photoshop, just classical procedural algorithms 00:11:14.620 |
In some cases, they end up with a large number of sliders 00:11:20.100 |
So one way in which AI can help is just an auto button, 00:11:31.900 |
So that's a very kind of make life easier for people 00:11:36.140 |
whilst making use of common sense from other example images. 00:11:43.420 |
Another one is something we've spent a lot of work 00:11:49.580 |
well 19, thinking about selection, for instance, 00:11:58.820 |
and figure out how to sort of flood fill into regions 00:12:04.780 |
But that algorithm had no visual common sense 00:12:12.940 |
And it was a big improvement over the previous work 00:12:16.180 |
where you had sort of almost click everything by hand, 00:12:26.940 |
to actually do a great job with say a single click, 00:12:48.420 |
Or they can be a great start for like tweaking it. 00:12:59.300 |
and essentially remove the background behind you. 00:13:10.580 |
it may come again in the future, but for now. 00:13:13.620 |
- So that sometimes makes it a little more challenging 00:13:17.420 |
How difficult do you think is that problem for AI 00:13:20.020 |
for basically making the quick selection tool 00:13:25.140 |
- Well, we have a lot of research on that already. 00:13:28.420 |
If you want a sort of quick, cheap and cheerful, 00:13:37.540 |
And you can do that today with a single click 00:13:44.260 |
with a little bit more guidance on the boundaries, 00:13:46.580 |
like you might need to touch it up a little bit. 00:13:49.980 |
We have other algorithms that can pull a nice mat 00:13:54.420 |
So we have combinations of tools that can do all of that. 00:14:14.420 |
well, pull at least a selection mask from a moving target, 00:14:17.980 |
like a person dancing in front of a brick wall or something. 00:14:21.060 |
And so it's going from hours to a few seconds 00:14:27.820 |
And then you might go in and touch up a little. 00:14:33.460 |
You know, there's like a journey for an idea, right? 00:14:40.260 |
has elements of just sort of, it inspires the concept. 00:14:44.540 |
It can work pretty well in majority of cases. 00:14:50.940 |
how do you make something that works maybe in all cases 00:15:00.500 |
between academic research and industrial research. 00:15:09.420 |
And we certainly love to be those people too, 00:15:30.820 |
And that might be a case where it's not about 00:15:43.380 |
So we have the luxury of very talented customers. 00:15:50.700 |
But if they can go in and just take it from 99 to 100 00:16:06.420 |
tedious task all the time to much less often. 00:16:19.780 |
- So on that thread, maybe you can untangle something. 00:16:23.820 |
Again, I'm sort of just speaking to my own experience. 00:16:29.140 |
- Maybe that is the most useful I can get at. 00:16:32.020 |
So I think Photoshop, as an example, or Premiere, 00:16:36.380 |
has a lot of amazing features that I haven't touched. 00:16:44.300 |
helping make my life or the life of creatives easier, 00:16:49.300 |
how, this collaboration between human and machine, 00:17:00.140 |
Is it something where you have to watch tutorials 00:17:25.940 |
And we're just like, think of it as Adobe University, 00:17:28.500 |
where you use the tool long enough, you become an expert. 00:17:35.860 |
but you know the important ones you need to get to. 00:17:40.860 |
which actually look at the thousands of hours 00:17:51.500 |
given the last three or four actions you did, 00:17:57.540 |
So if you want some inspiration for what you might do next, 00:18:00.820 |
or you just want to watch the tutorial and see, 00:18:03.300 |
learn from people who are doing similar workflows to you, 00:18:09.340 |
So really trying to use the context of your use of the app 00:18:19.140 |
or in a more assistive way where it could say, 00:18:28.780 |
which is if we really deeply understand the domain 00:18:47.020 |
or just showing the results of if you try this. 00:18:51.820 |
I was in a meeting today thinking about these things. 00:19:05.340 |
And the right thing to do is to give enough at each stage 00:19:14.540 |
- Are you aware of this gigantic medium of YouTube 00:19:19.260 |
that's creating just a bunch of creative people, 00:19:22.860 |
both artists and teachers of different kinds? 00:19:27.180 |
And the more we can understand those media types, 00:19:29.900 |
both visually and in terms of transcripts and words, 00:19:33.220 |
the more we can bring the wisdom that they embody 00:19:35.780 |
into the guidance that's embedded in the tool. 00:19:39.740 |
To remove the barrier from having to yourself 00:19:54.100 |
but does it modify the interface to be simpler? 00:20:04.460 |
I like to say that if you add a feature to a GUI, 00:20:12.740 |
Whereas if you have an assistant with a new skill, 00:20:15.780 |
if you know they have it, so you know to ask for it, 00:20:18.260 |
then it's sort of additive without being more intimidating. 00:20:28.260 |
of being able to master that complex interface 00:20:31.180 |
and keyboard shortcuts like you were talking about earlier, 00:20:40.540 |
And other people just want to get something done quickly 00:20:54.840 |
Whereas somebody who's in a deep post-production workflow 00:20:58.780 |
maybe want to be on a laptop or a big screen desktop 00:21:07.260 |
to really express the subtlety of what they want to do. 00:21:25.780 |
automatic image captioning, like we mentioned, 00:21:40.060 |
Can you talk through a favorite or some of them 00:21:53.660 |
'cause there's visual elements to all of them 00:21:59.100 |
- Why they're interesting for different reasons 00:22:08.420 |
It's almost like a, if you think of an assembly language, 00:22:28.300 |
It then mattes in the sky behind the foreground, 00:22:38.220 |
to recolor the foreground of the image that you're editing. 00:22:41.380 |
So if you say go from a midday sky to an evening sky, 00:22:53.820 |
and he has a number of paintings where it's surrealism 00:23:10.700 |
but we'd rather have it be natural by default 00:23:15.780 |
and then you have to do a whole bunch of post-production 00:23:18.740 |
So that's a case where we're kind of capturing 00:23:23.620 |
and doing it in about a second rather than a minute or two. 00:23:27.900 |
And when you do that, you can not just do it once, 00:23:30.300 |
but you can do it for say like 10 different backgrounds 00:23:33.060 |
and then you're almost back to this inspiration idea of, 00:23:43.180 |
as close to final production value as possible. 00:23:47.340 |
you might go back and slightly tweak the selection mask 00:23:49.620 |
just to make it perfect and do that kind of polish 00:23:52.540 |
that professionals like to bring to their work. 00:23:58.660 |
replacing it to different stock images of the sky. 00:24:15.540 |
- Right, so that was something we called concept canvas. 00:24:18.660 |
So normally when you do a, say an image search, 00:24:27.820 |
and it would find the nearest one that had those tags. 00:24:35.020 |
to be able to say, I want a big person in the middle 00:24:36.780 |
or in a dog to the right and umbrella above the left, 00:24:39.420 |
'cause you want to leave space for the text or whatever. 00:24:42.940 |
And so concept canvas lets you assign spatial regions 00:24:48.220 |
And then we've already pre-indexed the images 00:24:50.900 |
to know where the important concepts are in the picture. 00:24:54.140 |
So we then go through that index matching to assets. 00:24:58.140 |
And even though it's just another form of search, 00:25:01.260 |
because you're doing spatial design or layout, 00:25:05.980 |
You sort of feel oddly responsible for the image 00:25:10.940 |
- So it's a good example where giving enough control 00:25:15.940 |
starts to make people have a sense of ownership 00:25:20.620 |
And then we also have technologies in Photoshop 00:25:22.460 |
where you physically can move the dog in post as well. 00:25:25.740 |
But for concept canvas, it was just a very fast way 00:25:29.100 |
to sort of loop through and be able to lay things out. 00:25:31.940 |
- In terms of being able to remove objects from a scene 00:25:45.660 |
And that's so neural networks are stepping in there. 00:25:51.300 |
- Yes, the GANs for doing that is definitely one approach. 00:26:23.020 |
So you don't see any brightness contrasts in that region. 00:26:25.820 |
Or you've gradually ramped from one from dark to light 00:26:30.020 |
Where it gets complicated is if you have to infer 00:26:34.020 |
invisible structure behind the person in front. 00:26:37.540 |
And that really requires a common sense knowledge 00:26:53.540 |
and it looks like it's just sort of crumpled or messed up. 00:26:57.260 |
And so what GANs and neural nets bring to the table 00:27:00.700 |
is this common sense learned from the training set. 00:27:03.740 |
And the challenge right now is that the generative methods 00:27:08.740 |
that can make up missing holes using that kind of technology 00:27:15.640 |
And so you either need to then go from a low resolution 00:27:18.180 |
to a high resolution using some other algorithm, 00:27:22.180 |
and it's still in research to get to that point. 00:27:44.560 |
You really need a very diverse training set of images. 00:27:51.540 |
where you put it out there with some guardrails 00:28:15.560 |
either they vote to say how confident they are 00:28:22.540 |
"You're good at houses, you're good at trees." 00:28:25.680 |
And so, I mean, it's all this adds up to a lot of work 00:28:30.080 |
'cause each of those models will be a whole bunch of work. 00:28:32.320 |
But I think over time, you'd gradually fill out the set 00:28:39.420 |
and then sort of branch out as you get more capable. 00:28:44.080 |
and have you considered maybe looking far into the future, 00:28:54.520 |
a huge amount of people that use Photoshop, for example, 00:29:00.400 |
being able to collect the information by which they, 00:29:05.400 |
basically get information about their workflows, 00:29:11.840 |
whether it is houses or octopus that people work on more. 00:29:16.480 |
- Like basically getting a beat on what kind of data 00:29:20.400 |
is needed to be annotated and collected for people 00:29:23.520 |
to build tools that actually work well for people. 00:29:28.280 |
and the whole world of AI is what data can you gather 00:29:39.800 |
but we want them to teach us what's important 00:29:43.580 |
At the same time, we want to respect their privacy. 00:29:51.480 |
And I think the modern spirit of the age around this 00:29:57.560 |
how they're benefiting from sharing their data 00:30:06.440 |
or if they're friendly to your cause or your tool 00:30:11.800 |
'cause they depend on you for their livelihood, 00:30:14.360 |
they may be willing to share some of their workflows 00:30:18.040 |
or choices with the dataset to be then trained. 00:30:23.040 |
There are technologies for looking at learning 00:30:27.560 |
without necessarily storing all the information permanently 00:30:36.680 |
So we're definitely exploring all of those possibilities. 00:30:39.240 |
- And I think Adobe exists in a space where Photoshop, 00:30:43.200 |
like if I look at the data I've created and own, 00:30:55.080 |
there's an obvious benefit for sharing the data 00:31:02.720 |
because it's helping improve the workflow in the future. 00:31:06.840 |
it's not clear what the benefit is in social networks. 00:31:11.280 |
I mean, I think there are some professional workflows 00:31:16.200 |
such as if I was preparing evidence for a legal case, 00:31:22.720 |
phoning home to help train the algorithm or anything. 00:31:28.840 |
having a trial version or they're doing some, 00:31:33.240 |
where somebody has a more permissive relationship 00:31:36.400 |
with Adobe where they explicitly say, I'm fine. 00:31:44.480 |
and in exchange for some benefit, tangible or otherwise, 00:31:54.160 |
is to capture relatively crude high-level things 00:31:57.600 |
from more people and then more detailed knowledge 00:32:02.280 |
We do that today with explicit customer studies 00:32:20.000 |
because one of the things people treasure about Adobe 00:32:34.560 |
of thinking about AI rather than an afterthought. 00:32:37.520 |
- Well, when you start that program, sign me up. 00:32:42.480 |
- Is there other projects that you wanted to mention 00:32:53.520 |
because you might think of Adobe as only thinking in 2D, 00:33:01.920 |
where we're actually thinking more three-dimensionally 00:33:04.280 |
about how to assign features to faces so that we can, 00:33:07.440 |
you know, if you take, so what Puppetron does, 00:33:09.580 |
it takes either a still or a video of a person talking, 00:33:13.680 |
and then it can take a painting of somebody else 00:33:20.840 |
And it's unlike a sort of screen door post filter effect 00:33:30.480 |
It really looks as though it's sort of somehow attached 00:33:36.240 |
And so that's the case where even to do a 2D workflow 00:33:39.440 |
like stylization, you really need to infer more 00:33:44.280 |
And I think as 3D computer vision algorithms get better, 00:33:48.680 |
initially they'll focus on particular domains like faces 00:33:52.000 |
where you have a lot of prior knowledge about structure 00:33:54.400 |
and you can maybe have a parameterized template 00:34:05.000 |
that you're doing 3D reconstruction, but under the hood, 00:34:08.840 |
but it might then let you do edits much more reliably 00:34:15.920 |
- And, you know, the face is a very important application, 00:34:23.800 |
If you do something uncanny, it's very disturbing. 00:34:30.080 |
So in the space of augmented reality and virtual reality, 00:34:39.600 |
in the content we consume as people, as consumers 00:34:48.880 |
So I think VR and AR serve slightly different purposes. 00:34:52.880 |
So VR can really transport you to an entire immersive world, 00:35:01.000 |
To that extent, it's a bit like a really, really 00:35:03.240 |
widescreen television where it sort of snaps you out 00:35:08.280 |
And I think it's still evolving in terms of the hardware. 00:35:14.560 |
trying to solve the latency and sort of nausea problem, 00:35:17.200 |
which we did, but it was very expensive and a bit early. 00:35:22.720 |
I think, and increasingly those devices are becoming 00:35:25.560 |
all in one rather than something that's tethered to a box. 00:35:28.920 |
I think the market seems to be bifurcating into things 00:35:32.800 |
for consumers and things for professional use cases, 00:35:47.480 |
So I think for that, where you need a sense of scale 00:35:52.400 |
I think AR holds the promise of sort of taking 00:35:57.200 |
digital assets off the screen and putting them 00:36:02.040 |
on the table in front of you, on the wall behind you. 00:36:04.880 |
And that has the corresponding need that the assets 00:36:12.400 |
I mean, it's a bit like having a live theater troupe 00:36:20.040 |
at stately homes in England for the National Trust, 00:36:24.800 |
and even they'd walk the audience through the rooms 00:36:33.240 |
And I think AR will have the same issue that, 00:36:36.920 |
if you have a tiny table in a big living room or something, 00:36:43.360 |
And there's a little bit of a tension between fidelity, 00:36:47.600 |
where if you captured Sayin Uriah doing a fantastic ballet, 00:36:52.600 |
you'd want it to be sort of exactly reproduced, 00:37:00.560 |
might be walking around the room doing some gestures, 00:37:07.880 |
- And do you think fidelity is that important in that space, 00:37:12.840 |
- I think it may depend on the characteristic of the media, 00:37:17.960 |
then it may be that you want to catch every nuance, 00:37:20.120 |
and they don't want to be reanimated by some algorithm. 00:37:32.080 |
then it doesn't matter if the frog moves in a different way. 00:37:35.640 |
I think a lot of the ideas that have sort of grown up 00:37:42.080 |
once they're needing adaptive characters in AR. 00:37:47.880 |
that allow creators to create in the augmented world, 00:37:52.560 |
basically making a Photoshop for the augmented world? 00:38:03.000 |
That's actually been shown publicly as one example in AR. 00:38:06.280 |
Where we're particularly excited at the moment is in 3D. 00:38:14.800 |
and we believe that it's a worthwhile experiment 00:38:25.840 |
just like applications that we're talking about? 00:38:32.240 |
you'd sort of have a plan view and a side view 00:38:34.960 |
and you'd sort of be dragging it around with a mouse, 00:38:39.560 |
Whereas if you were really laying out objects, 00:38:48.040 |
They'd be in stereo, so you'd have a sense of depth, 00:38:50.920 |
'cause you're already wearing the depth glasses, right? 00:38:53.880 |
So it would be those sort of big, gross motor, 00:39:09.680 |
like very accurate constraints on, say, a CAD model 00:39:13.640 |
or something that may be better done on a desktop, 00:39:16.960 |
but it may just be a matter of inventing the right UI. 00:39:22.440 |
because there will be this potential explosion of demand 00:39:29.560 |
and more real-time animation on conventional screens, 00:39:40.920 |
- You've mentioned quite a few interesting new ideas. 00:39:43.680 |
And at the same time, there's old timers like me 00:39:52.600 |
But the opposed all change at all costs kind of. 00:39:56.480 |
Is there, when you're thinking about creating new interfaces, 00:40:00.760 |
do you feel the burden of just this giant user base 00:40:09.800 |
any new idea comes at a cost that you'll be resisted? 00:40:14.080 |
- Well, I think if you have to trade off control 00:40:26.280 |
you have more convenience and just as much control, 00:40:43.040 |
And to some extent, there's been a lot of brilliant thought 00:40:53.360 |
like a single click is good enough to select an object 00:40:58.560 |
that actually fits in quite nicely to the existing tool set, 00:41:02.600 |
either as an optional mode or as a starting point. 00:41:05.840 |
I think where we're looking at radical simplicity, 00:41:09.120 |
where you could encapsulate an entire workflow 00:41:14.160 |
then sometimes that's easier to do in the context 00:41:16.640 |
of either a different device, like a mobile device, 00:41:19.600 |
where the affordances are naturally different, 00:41:22.080 |
or in a tool that's targeted at a different workflow, 00:41:35.080 |
video editing for a certain class of media output 00:41:48.720 |
if I'm feeling like doing Premiere, big project, 00:41:56.800 |
But if I want to do something to show my recent vacation, 00:42:05.600 |
rather than the four hours I'd need to do it at work. 00:42:12.600 |
it really is much faster to get the same output, 00:42:16.920 |
have a much richer toolkit and more flexibility 00:42:21.720 |
- And then at the same time, with the flexibility control, 00:42:27.200 |
of using AI to coach you to like what Google has, 00:42:39.240 |
And then you almost, that's almost an educational tool. 00:42:43.560 |
- To show, because sometimes when you have all this control, 00:43:05.480 |
Where I'm stuck, I need to know what to look for, 00:43:20.800 |
of workflows and everything to maybe suggest something 00:43:23.080 |
to go and learn about or just to try or show the answer. 00:43:43.480 |
So first few lines of a recent poem of yours, 00:43:52.480 |
Today I left my phone at home and went down to the sea. 00:44:02.760 |
So this is a poem about you leaving your phone behind 00:44:11.880 |
and let's see if we can talk about it, figure it out. 00:44:32.300 |
is part of doing this, creating this illusion. 00:44:38.020 |
but how do you think we should adjust as human beings 00:44:41.340 |
to live in this digital world that's partly artificial, 00:44:45.140 |
that's better than the world that we lived in 00:44:49.740 |
a hundred years ago when you didn't have Instagram 00:44:59.720 |
- We're using the tooling of modifying the images 00:45:19.560 |
I actually wonder if 18th century aristocrats 00:45:22.440 |
who commissioned famous painters to paint portraits of them 00:45:30.280 |
- So human desire to put your best foot forward 00:45:35.120 |
I think it's interesting, you sort of framed it in two ways. 00:45:43.000 |
and visualize them, is that a good or bad thing? 00:45:48.000 |
and words and poetry, which still resides sometimes 00:45:51.120 |
on websites, but we've become a very visual culture 00:45:56.600 |
In the 19th century, we're very much a text-based culture. 00:46:10.240 |
I think it depends on how harmless your intent. 00:46:22.400 |
that you pick out of the photos that you have 00:46:24.360 |
in your inbox to say, "This is what I look like," 00:46:31.360 |
If someone's gonna judge you by how you look, 00:46:34.820 |
then they'll decide soon enough when they meet you 00:46:43.880 |
hold themselves up to an impossible standard, 00:46:46.180 |
which they then feel bad about themselves for not meeting. 00:46:53.240 |
But I think the ability to imagine and visualize 00:47:00.360 |
sometimes which you then go off and build later, 00:47:14.880 |
I think, I used to worry about exploration actually, 00:47:28.600 |
was to know what it would look like when you got there. 00:47:38.200 |
that it may take the edge off actually wanting to go, 00:47:44.720 |
you know, by the time we finally get to Mars, 00:47:53.320 |
I mean, I think Pluto was a fantastic recent discovery 00:47:57.040 |
where nobody had any idea what it looked like 00:47:59.080 |
and it was just breathtakingly varied and beautiful. 00:48:01.960 |
So I think expanding the ability of the human toolkit 00:48:06.640 |
to imagine and communicate on balance is a good thing. 00:48:10.880 |
I think there are abuses, we definitely take them seriously 00:48:17.600 |
I think there's a parallel side where the public needs 00:48:21.080 |
to know what's possible through events like this, right? 00:48:24.320 |
So that you don't believe everything you read 00:48:27.600 |
in print anymore and it may over time become true 00:48:57.120 |
and then Henry VIII wasn't pleased and, you know. 00:49:00.800 |
- History doesn't record whether Anne was pleased, 00:49:02.600 |
but I think she was pleased not to be married 00:49:06.080 |
So, I mean, this has gone on for a long time, 00:49:08.040 |
but I think it's just part of the magnification 00:49:14.440 |
- You've kind of built up an amazing research environment 00:49:37.160 |
and you tend to keep revisiting them through time. 00:49:40.440 |
If you're lucky, you pick one that doesn't just get solved 00:49:43.160 |
in the next five years and then you're sort of out of luck. 00:49:50.120 |
From the point of view of industrial research, 00:50:00.480 |
you end up investing a lot in a particular idea. 00:50:05.140 |
And if you're not careful, people can get too conservative 00:50:12.000 |
And interns let you explore the more fanciful 00:50:16.040 |
or unproven ideas in a relatively lightweight way, 00:50:20.280 |
ideally leading to new publications for the intern 00:50:24.520 |
And it gives you then a portfolio from which to draw 00:50:27.600 |
which idea am I gonna then try to take all the way through 00:50:30.100 |
to being robust in the next year or two to ship. 00:50:40.480 |
Many of our greatest researchers were former interns. 00:50:46.400 |
so we can get to know and build an enduring relationship 00:50:51.160 |
whom we often do academic gift funds to as well 00:50:54.120 |
as an acknowledgement of the value the interns add 00:51:01.480 |
And then the long-term legacy of a great research lab 00:51:04.920 |
hopefully will be not only the people who stay 00:51:09.280 |
and then go off and carry that same model to other companies. 00:51:12.840 |
And so, we believe strongly in industrial research 00:51:18.560 |
And we hope that this model will continue to propagate 00:51:23.640 |
which makes it harder for us to recruit, of course, 00:51:28.760 |
And a rising tide lifts all ships in that sense. 00:51:32.440 |
- And where's the idea born with the interns? 00:51:37.160 |
Is there discussions about, you know, like what- 00:51:50.960 |
- A question I ask at the beginning of every summer. 00:51:54.160 |
So, what will happen is we'll send out a call for interns. 00:51:59.080 |
They'll, we'll have a number of resumes come in. 00:52:15.800 |
And we think we'd love to do one of those projects too. 00:52:18.960 |
And then the intern stays in touch with the mentor, 00:52:26.480 |
at the end of two weeks, they have to decide. 00:52:47.160 |
and sometimes it takes a giant left turn and we go, 00:52:49.600 |
that sounded good, but when we thought about it, 00:52:51.640 |
there's this other project, or it's already been done, 00:52:57.200 |
So, it's pretty, pretty flexible at the beginning. 00:53:10.520 |
And so, in Adobe, we push the needle very much 00:53:23.640 |
So, if the projects ultimately end up impacting the products 00:53:28.840 |
And so, the alternative model, just to be clear, 00:53:33.920 |
who thinks he's a genius and tells everybody what to do, 00:53:47.640 |
there are strategic priorities for the company, 00:53:49.960 |
and there are areas where we do want to invest 00:53:58.680 |
And so, you don't tell people you have to do X, 00:54:01.480 |
but you say X would be particularly appreciated this year. 00:54:05.720 |
And then people reinterpret X through the filter 00:54:07.960 |
of things they want to do and they're interested in. 00:54:10.080 |
And miraculously, it usually comes together very well. 00:54:30.360 |
Once in a while, the product teams sponsor an extra intern 00:54:40.560 |
And then we sort of say, great, I get an extra intern. 00:54:43.120 |
We find an intern, he thinks that's a great problem. 00:54:54.160 |
It's really hard to predict at the beginning of the summer 00:55:05.120 |
Others turn out not to be as novel as we thought 00:55:07.960 |
but they'd be a great feature but not a paper. 00:55:11.440 |
And then others, we make a little bit of progress 00:55:16.400 |
And maybe we revisit that problem several years in a row 00:55:22.120 |
and then it becomes more on track to impact a product. 00:55:26.200 |
- Jumping back to a big overall view of Adobe Research, 00:55:31.200 |
what are you looking forward to in 2019 and beyond? 00:55:35.400 |
What is, you mentioned there's a giant suite of products. 00:55:46.080 |
Where do you think, what do you think the future holds? 00:55:52.040 |
- In terms of the technological breakthroughs? 00:55:56.440 |
especially ones that will make it into product, 00:56:01.760 |
- So I think the creative or the analytics assistants 00:56:04.920 |
that we talked about where they're constantly trying 00:56:09.480 |
and how can they be helpful and make useful suggestions 00:56:14.000 |
And it's very unpredictable as to when it'll be ready 00:56:25.320 |
like generative adversarial networks are immensely promising 00:56:29.400 |
and seeing how quickly those become practical 00:56:53.640 |
which is a way in which we're pulling our neural nets 00:56:57.920 |
and other intelligence models into a central platform 00:57:05.120 |
So we're in the middle of transitioning from a, 00:57:10.720 |
and they, you sort of hand design it for that use case 00:57:20.600 |
which should mean that the time between a good idea 00:57:22.840 |
and impacting our products will be greatly shortened. 00:57:28.680 |
many of the other products can just leverage it too. 00:57:35.800 |
but that combination of this sort of renaissance in AI, 00:57:42.480 |
and other really exciting emerging technologies. 00:57:47.360 |
you'll sort of basically be dancing with light, right? 00:57:50.400 |
Where you'll have real-time shadows, reflections 00:57:56.320 |
but then with all these magical properties brought by AI 00:57:58.720 |
where it sort of anticipates or modifies itself 00:58:01.760 |
in ways that make sense based on how it understands 00:58:06.640 |
- That's a really exciting future for creative 00:58:12.160 |
So first of all, I work in autonomous vehicles. 00:58:16.320 |
And I think you have a fascination with snakes, 00:58:23.600 |
I mean, their movement is beautiful, adaptable. 00:58:37.640 |
Saw that there's one that's 25 feet in some cases. 00:58:56.440 |
- Well, it actually came out of my work in the '80s 00:58:58.640 |
on computer animation where I started doing things 00:59:01.400 |
like cloth simulation and other kind of soft body simulation 00:59:04.880 |
and you'd sort of drop it and it would bounce 00:59:08.760 |
And I thought, well, what if you animate the spring lengths 00:59:18.680 |
on called the motion dynamics of snakes and worms. 00:59:26.120 |
and then did some of the early computer animation 00:59:30.520 |
- So your interest in robotics started with graphics. 00:59:40.440 |
we actually did a movie called "Her Majesty's Secret Serpent" 00:59:42.960 |
which is about a secret agent snake that parachutes in 00:59:46.240 |
and captures a film canister from a satellite 00:59:48.600 |
which tells you how old fashioned we were thinking back then 00:59:51.080 |
sort of classic 19 sort of 50s or 60s Bond movie 00:59:55.000 |
And at the same time, I'd always made radio controlled ships 01:00:02.120 |
And I thought, well, how hard can it be to build a real one? 01:00:08.200 |
like a 15 year obsession with trying to build 01:00:16.600 |
Then I added wheels and building things in real life 01:00:21.920 |
The thing that appeals to me is I love creating 01:00:25.840 |
the illusion of life, which is what drove me to animation. 01:00:31.520 |
of coordinated freedom that move in a kind of 01:00:34.400 |
biological way, then it starts to cross the uncanny valley 01:00:37.520 |
and to see me like a creature rather than a thing. 01:00:40.920 |
And I certainly got that with the early snakes by S3. 01:00:45.200 |
I had it able to sidewind as well as go directly forward. 01:00:50.000 |
My wife to be suggested that it would be the ring bearer 01:00:53.400 |
So it actually went down the aisle carrying the rings 01:00:56.120 |
and got in the local paper for that, which was really fun. 01:01:09.600 |
These things, the whole idea is that you would, 01:01:18.240 |
And so the very first one, I actually built the controller 01:01:21.840 |
from discrete logic, 'cause I used to do LSI, 01:01:25.360 |
you know, circuits and things when I was a teenager. 01:01:37.680 |
So they were radio controlled rather than autonomous 01:01:58.640 |
that you venture into and learn things about backlash 01:02:04.720 |
which is why what seemed like a good idea doesn't work. 01:02:08.920 |
And then more recently I've been building S9, 01:02:11.920 |
which is a much better engineered version of S3 01:02:15.160 |
where the motors wore out and it doesn't work anymore. 01:02:18.520 |
which is sad given that it was such a meaningful one. 01:02:29.640 |
I, unlike the typical roboticist, I taper my snakes. 01:02:33.560 |
There are good mechanical reasons to do that, 01:02:44.400 |
It actually saves weight and leverage and everything. 01:02:50.360 |
at the International Spy Museum in Washington, DC. 01:02:54.800 |
It was on YouTube and it got its own conspiracy theory 01:03:00.200 |
'cause I work at Adobe, it must be fake graphics. 01:03:03.080 |
And people would write to me, "Tell me it's real." 01:03:05.480 |
You know, they say, "The background doesn't move." 01:03:46.200 |
is now opening up a whole world of true autonomy 01:03:52.320 |
and still having that sort of biomimetic quality 01:03:56.640 |
that people, that appeals to children in particular, 01:04:02.920 |
but children actually think they look charming. 01:04:05.680 |
And I gave a series of lectures at Girls Who Code 01:04:10.600 |
to encourage people to take an interest in technology. 01:04:14.280 |
And at the moment, I'd say they're still more expensive 01:04:22.920 |
It makes me think about doing that very early thing I did 01:04:28.120 |
at Alias with changing the muscle rest lengths. 01:04:31.120 |
If I could do that with a real artificial muscle material, 01:04:37.600 |
rather than motors and gearboxes and everything. 01:04:52.000 |
It forced me to read biology and be curious about things 01:04:55.920 |
that otherwise would have just been, you know, 01:04:59.760 |
Suddenly I'm thinking, how does that snake move? 01:05:03.400 |
I look at the trails that sidewinding snakes leave in sand 01:05:06.200 |
and see if my snake robots would do the same thing. 01:05:17.120 |
which is where the intelligent agent research 01:05:19.160 |
will converge with the vision and voice synthesis 01:05:36.760 |
especially if you have a reasonably good sense 01:05:40.360 |
So not trying to have it so a stranger could walk up 01:05:45.040 |
and have one, but so as a pet owner or a robot pet owner, 01:05:53.840 |
- Or sometimes just a meaningful interaction. 01:05:56.240 |
If you have the kind of interaction you have with a dog, 01:06:03.000 |
- And nevertheless, it feels like a meaningful connection. 01:06:06.960 |
- And one of the things that I'm trying to do 01:06:16.680 |
why it knows something or why it thinks something. 01:06:21.360 |
that it really does know what it's talking about, 01:06:48.040 |
- Do you ever, do you see a future where Adobe 01:06:51.160 |
even expands into the more physical world perhaps? 01:07:04.080 |
- I'd have to say at the moment, it's a twinkle in my eye. 01:07:06.520 |
I think the more likely thing is that we will bring 01:07:15.000 |
And many of the ideas that might take five years 01:07:19.520 |
you can do in a few weeks with digital assets. 01:07:30.440 |
because we'll have been living with those personalities 01:07:35.400 |
And then they'll just say, oh, it's Siri with legs 01:07:47.800 |
and we don't know quite what the experience will be like. 01:07:53.840 |
all of these different strands of my career converge. 01:08:06.360 |
the last few lines of my favorite poem of yours 01:08:13.280 |
as our ideas live through the ideas of others, 01:08:33.000 |
to join a jostling lifting throng as others danced in me. 01:08:45.240 |
millions of people like myself for creating amazing stuff. 01:08:49.560 |
- Oh, thank you. It was a great conversation.