back to indexHypermode Launch: Kevin Van Gundy

Chapters
0:0 Intro
0:34 Iterating
1:21 Getting it wrong
2:2 Integration is simple
2:55 Developer experience
3:18 Memory embedding
3:48 Intuition
4:5 Outro
00:00:00.000 |
Nick DiBona: Before hypermode, I worked at Vercel. 00:00:16.800 |
We had an office down the street above a pizzeria, 00:00:21.760 |
One, we were losing to other JavaScript frameworks. 00:00:25.520 |
Two, we were losing badly to other hosting providers. 00:00:29.320 |
And three, I was losing to my diet of exclusively pepperoni 00:00:34.980 |
Eventually, we started to win, not because we were smart 00:00:40.400 |
but because we dealt this core competency of iterating 00:00:47.760 |
but we figured if we just tried more things faster 00:00:51.700 |
be able to adapt and figure out the right products and strategies 00:00:57.960 |
Iteration is the compound interest of software. 00:01:00.960 |
Keep doing it long enough, and eventually really good stuff 00:01:05.460 |
Because we tried a lot of things really quickly, 00:01:10.200 |
One, developers want to incrementally adopt new technologies. 00:01:14.080 |
And two, they don't want to commit to architectural patterns 00:01:17.080 |
before they know how their application is actually going to work. 00:01:19.580 |
But iteration can't happen if you're afraid of getting it wrong. 00:01:26.580 |
The same thing that has held back web is also holding back AI. 00:01:31.080 |
And if I'm honest, there are even more things for us to get wrong about Gen AI. 00:01:34.580 |
When I think about it, I'm grossly overwhelmed. 00:01:43.080 |
Everyone here knows a horror story of someone with a runaway bill, 00:01:48.080 |
a hallucinating chatbot, a project that took months and months 00:01:53.580 |
And in the end, we need to build systems that de-risk getting it wrong. 00:02:02.080 |
Picking the wrong model doesn't matter if there's no friction to switching it out. 00:02:06.580 |
Integration is simple when your classical systems and your AI systems use the same APIs. 00:02:11.580 |
You can fearlessly make changes to prompt strategies, data mixes, 00:02:15.580 |
if you can trace that inference step by step by step. 00:02:18.580 |
At HyperMode, we care deeply about making AI approachable. 00:02:23.580 |
Everyone here should be able to put AI in their apps without specialized skills. 00:02:31.580 |
It allows you to easily integrate models and data into AI functions. 00:02:35.580 |
We then surround that runtime with a bunch of tools that make it easy for you to rapidly iterate 00:02:43.580 |
We make it easy to get started, incrementally adopt AI as appropriate, 00:02:48.580 |
and then as your team develops those skills, reimagine those applications as AI native. 00:02:54.580 |
First and foremost, we want to make the developer experience of developing with AI a lot less terrible. 00:03:01.580 |
When it comes to adding a new model to your service, 00:03:04.580 |
you probably don't want to read a bunch of pages of docs to figure out the temperatures on a 0 to 2 rather than a 0 to 1 or a 1 to 10. 00:03:11.580 |
With HyperMode, we provide you type ahead and your favorite code editor right out of the box. 00:03:19.580 |
Then when you do ship to prod, we give you strong defaults just to get started. 00:03:23.580 |
Or if you have your own stack, bring it along. 00:03:26.580 |
In either case, we'll remove a lot of that complexity for you. 00:03:28.580 |
For example, traditional RAG requires N+1 requests. 00:03:32.580 |
You need to make an additional call to embed the inputs. 00:03:41.580 |
We've built an in-memory embedding and search service that will allow you to do that and save a couple of milliseconds per request. 00:03:47.580 |
Finally, building intuition around non-deterministic systems is hard. 00:03:53.580 |
Each model has its own personality, and we make it really easy for you to quickly compare different inferences, different tunes, different models. 00:04:01.580 |
And you can then export this data set to fine tune. 00:04:05.580 |
On Monday, your boss is going to ask you, what did you learn at AI World Fair? 00:04:10.580 |
If you come by our workshop after lunch, I'll prove to you that you can make iteration velocity of core competency. 00:04:18.580 |
The team that built all this amazing stuff will be there. 00:04:21.580 |
We'll show you how to build natural language search, intelligently sort every data list in your product, detect outliers, catch bad guys. 00:04:28.580 |
You'll walk over the demo that you're proud of and a plan to put something like it in prod by the end of next month. 00:04:33.580 |
And if seeing my happy face again and building something really cool is not enough, we'll give you $1,000 in HyperMode credits to get started.