AI Engineering Without Borders

00:00:00.000 | I've been thinking a lot about borders recently for no particular reason and borders are a very

00:00:18.420 | innately human thing if I don't have the right piece of paper I cannot cross this line in the

00:00:23.320 | sand like it's a very very real problem that I face and many many people face every single day

00:00:27.640 | and borders are just one kind of constraint that humans just make up and I think that's very

00:00:33.360 | interesting that we respect borders so much but AI does not AI is a border disrespector it is very

00:00:38.040 | very easily multilingual so if you trained an LLM on mostly mostly English text corpus it's going to

00:00:44.360 | learn other languages just as a side effect it's going to be very natively multimodal because it

00:00:50.680 | can you can turn llama into a vision language model with just like 100 bucks of just post training

00:00:55.320 | there's it's very disrespecting of ground truth borders because it can just doesn't know the

00:01:01.360 | difference between hallucination and memorizing from a world model and it also doesn't respect

00:01:06.560 | copyright which is a whole other topic that we won't get into today but it's also super fascinating

00:01:10.340 | and how does that relate to do with AI engineering right like I think a lot of you here are here

00:01:15.400 | because you are interested in that concept at least maybe you identify as an AI engineer maybe you're

00:01:19.200 | trying to hire an AI engineer so there are a lot of definitions floating around and I confess that

00:01:25.220 | you know I've contributed to that is engineering an API line right that's the that's the line a lot of

00:01:30.600 | people have and that's come under some debate recently and yeah that's one form of AI engineering and I think

00:01:35.660 | that is useful to some people for understanding like where the responsibilities in a team might stop

00:01:40.280 | in and start with the other people in the team or maybe it's there's different subtypes right like

00:01:46.220 | last AI engineer summit I talked about the three types of AI engineer that I was seeing emerge the AI

00:01:51.760 | enhanced engineer the AI products engineer and the non-human agentic AI engineer or it could be a job

00:01:59.280 | description that you try to sort of list out and this is something that on the latent space podcast we

00:02:03.860 | recently went through with illicit talking about the different roles that they see within their teams

00:02:08.860 | as well so okay if I broadly have any of these three things do I have I nailed down a good definition of

00:02:15.360 | AI engineer that is workable yes right but is that something that we're happy with this is something

00:02:20.200 | that we can is there nothing left to explore I think the answer is no I think there's more to explore

00:02:24.580 | I think the very easy cop-out as well for people discussing this is that you have your opinion I have my

00:02:31.160 | opinion you come from your point of view I come from my point of view we agree to disagree or we agree

00:02:35.760 | that you know the different different strokes different folks and we move on I don't really

00:02:40.500 | like that just because there's no shared agreement on the things that is ground truth to everybody

00:02:45.960 | so I want to raise that challenge a little bit more I was in a podcast with Fraza Habib who's one of

00:02:52.340 | the speakers today talking about what this conference is and why this conference is does what it does

00:02:57.700 | and I always say that AI engineer conferences are effectively my highest stakes expression of what

00:03:03.220 | I think the state of AI engineering is so this time last year 2023 AI years are two times of human years

00:03:10.580 | we had a few tracks and we had a few topics that were up for debate we had RAG code gen and then agents

00:03:17.440 | and multi-modality all those tracks are repeated here today I have some speakers illustrated here just for

00:03:23.000 | illustrative purposes these obviously are not every not everyone involved but I think just like

00:03:28.520 | the inside-out metaphor that I've been thinking a lot about as the engineer matures so there's the

00:03:34.040 | number of concerns that you have to juggle in your head so this year you know after you're a competent

00:03:38.900 | AI engineer this year you're now faced with like okay I have to migrate to open models I have to build up

00:03:44.120 | my evals maybe I should have done that first that's a whole topic of discussion maybe I should scale up my

00:03:48.920 | inference or maybe I should deploy it to the fortune 500 and maybe on the management side of things I

00:03:54.120 | should be hiring teams of AI engineers and managing AI strategy for my company I think the last track

00:04:00.360 | you know like I talked about the nine tracks in in the engineers it's always about the network the

00:04:03.720 | community the network that we're building that's probably the single most important part of this

00:04:08.360 | conference and that's the part that we cannot sell we cannot I cannot put on the website hey we have

00:04:11.480 | good community because no one will believe us you have to come and see for yourself but please for those of you who've been uploading to the google photos album

00:04:17.320 | we've been tweeting out your photos and sharing them on LinkedIn please keep doing that that's a

00:04:22.520 | way for myself and everyone else who is not at the conference to try to join in on the fun

00:04:27.720 | the reason I'm not comfortable with any of these tracks because I is because I know how they were made

00:04:33.640 | because I made them up and I know that because I was looking at the the original document for the rise of the

00:04:40.840 | AI engineer it's we're celebrating the one year anniversary today and just down in the document

00:04:47.160 | somewhere I just listed out the you know disciplines that I thought the engineer would have and those

00:04:50.760 | eventually became mostly mapping to the tracks that I have been exploring in these conferences and the

00:04:55.640 | meetups that I do and it's arbitrary like why is there a separate agents track from code gen why is

00:05:01.880 | there a separate rag track from open models like these are all related what you know they're all of a kind

00:05:07.720 | and obviously as a competent engineer you should be familiar with all these things and that brings us

00:05:13.480 | back to this mindset of having boundaries and borders right these are all made up by someone I made them up

00:05:19.640 | for this one but you know you're going to live in a world where your boss made it up at your company

00:05:23.640 | and these are not reflective of how reality actually has to operate right if you start with all the rules

00:05:31.160 | and a different group of people came in would they agree on the same rules probably not just because

00:05:35.640 | they're made up but the laws of nature are hard to make up because the reality actually works that way

00:05:40.680 | if you had an alien civilization come down to earth they would discover that gravity works the same

00:05:44.360 | the sun the energy from the sun works the same the magnetic field works the same so what are the

00:05:49.640 | laws of AI engineering we'll come we'll come to that via definitions of software engineering and real

00:05:54.920 | engineering so I went and looked up definitions of software engineering and IEEE talks about the application

00:06:00.200 | of engineering to software with the design implementation testing and documentation of

00:06:03.960 | software Google developers talks about mostly the same thing they also mentioned software life cycle

00:06:08.760 | management okay like really reasonable definitions of kind until you look at real engineers real

00:06:15.800 | engineers talk about applying natural science sciences utilizing them for benefit of humanity

00:06:21.560 | both the IEEE and the national association of engineers agree on that and it's curiously missing from

00:06:27.160 | software engineering like the benefiting humanity part the understanding natural laws part completely

00:06:31.480 | missing from software engineering so my proposal to you is that AI engineering is somewhere in between

00:06:35.400 | right the the the the software engineering but then encountering a lot of the more of the real world

00:06:40.280 | constraints than you would in a typical software engineer career so if we know the laws of earth and

00:06:45.880 | they are independently derived they cannot make no matter what point of view you're looking at they all

00:06:49.960 | are the same laws then what are the equivalent laws of AI engineering I have a few you can come up with more but I'm just going to propose some to start off the base

00:06:57.080 | there's constants so for example if you're designing for humans you should respect the fact that

00:07:03.000 | humans only speak at 80 words per minute but they read at 200 words per minute right so there's an inherent

00:07:08.360 | disparity there there's also constant contingent facts that things that are true for now instead of true

00:07:13.560 | forever and true for now facts are for example like the apple intelligence when they ship a local model on

00:07:19.480 | every phone then that inference speed of 30 tokens per second that they're advertising becomes the baseline speed limit

00:07:25.400 | a speed barrier of what intelligence that is too cheap to meter should look like

00:07:29.960 | and these things they're not set in stone they're not actual physical laws so they also trend over time

00:07:36.040 | due to forces and momentum and I want to establish a little bit like I think it's under it's very

00:07:41.400 | beneficial for AI engineers to understand what the Moore's laws of AI is so that you can plan for them so

00:07:46.120 | that you don't have to make the bad bets that are not going to last just obviously just because of overwhelming evidence

00:07:51.960 | the first bet is the improving of context right a year ago I was interviewing mosaic and talking about

00:07:59.080 | mpc7b with their whopping 60 000 70 000 token context with a lot of loss today sitting in the audience we

00:08:05.640 | have people who have trained million token context windows and we've also uh have from anthropic which

00:08:12.840 | just released cloud three five years last week um the the fact that you know that we have complete

00:08:17.160 | utilization it's not just about the length of context it's also about the utilization of context and I've been

00:08:21.000 | Greg who's sitting in the audience as well would be very happy with how Claude is improving on their

00:08:25.240 | utilization of their very very long context windows there's also the cost of intelligence the commodification

00:08:31.480 | of intelligence so in the past two years we've seen a 99.55 percent decline in the cost of GPT-3 level

00:08:37.880 | intelligence the cost of GPT-4 level intelligence has probably come down maybe 90 percent maybe maybe 80 percent

00:08:43.240 | 80 percent um from from GPT-4 to Lama 3 and now all the the newer models that are finally I think it's worth

00:08:49.240 | commenting a little bit on where AI engineering stands in contrast to other AI philosophies there's EA versus

00:08:54.680 | EAC and maybe we're in the middle like we we care about safety but we also want to accelerate right it's

00:08:59.640 | kind of like a weird combination of the two things my proposal is that that one dimension isn't enough to

00:09:04.680 | express how AI engineer differs from the other philosophies and actually need to add a second dimension to talk about

00:09:09.720 | utility we are utility maxis above all else we see what's out there and we want to use it to benefit

00:09:14.360 | humanity so my message to everyone at the world's fair is to try to disrespect borders a little bit

00:09:20.280 | try to avoid your own dogmatic beliefs lazy consensus of other people or passive reactions

00:09:24.680 | and in other words try to disagree disagree more disagree with your own conclusions disagree with

00:09:29.400 | each other productively and disagree with the status quo and I think there you'll find that this

00:09:34.680 | conference becomes more of a useful landmark in your careers rather than just the party which it can very

00:09:40.360 | well be but my final analogy which I really like is that AI engineers are the kind of person that looks

00:09:44.840 | at shoggoth and sees instead of a monster that cannot be tamed they want to turn them into

00:09:49.480 | mass rapid transit and the kind of person that looks at that looks at you know a force of nature and wants to

00:09:55.880 | turn it into tools that are useful for people is the kind of engineer that I would love to speak to and welcome

00:10:00.680 | at conferences like this one so that's my view of what borders and engineering without borders should look like

00:10:07.400 | I very much encourage you to jump between tracks to jump between friend groups to jump between disciplines

00:10:11.560 | and modalities because here's the one place that you can do that outside of your work and to mingle

00:10:16.440 | with everyone else that we've gathered so I hope you enjoy doing that I wish I was there in person but

00:10:20.840 | just share them online and I hope to see you in person at the next one bye

AI Engineering Without Borders — swyx