Nick DiBona: Before hypermode, I worked at Vercel. We had an office down the street above a pizzeria, and we had three big problems. One, we were losing to other JavaScript frameworks. Two, we were losing badly to other hosting providers. And three, I was losing to my diet of exclusively pepperoni pizza.
Eventually, we started to win, not because we were smart or we knew all the right answers, but because we dealt this core competency of iterating really, really, really quickly. We didn't know the optimum strategy, but we figured if we just tried more things faster than everyone else, we'd eventually be able to adapt and figure out the right products and strategies to figure out what the market wanted.
Iteration is the compound interest of software. Keep doing it long enough, and eventually really good stuff starts to happen. Because we tried a lot of things really quickly, we eventually figured out two things. One, developers want to incrementally adopt new technologies. And two, they don't want to commit to architectural patterns before they know how their application is actually going to work.
But iteration can't happen if you're afraid of getting it wrong. The same thing that has held back web is also holding back AI. And if I'm honest, there are even more things for us to get wrong about Gen AI. When I think about it, I'm grossly overwhelmed. What's the right hardware?
What's the right model? What's the right prompt? How do I integrate? How do I monitor? How do I improve? Everyone here knows a horror story of someone with a runaway bill, a hallucinating chatbot, a project that took months and months and never delivered any value. And in the end, we need to build systems that de-risk getting it wrong.
Because we are going to get it wrong a lot. Picking the wrong model doesn't matter if there's no friction to switching it out. Integration is simple when your classical systems and your AI systems use the same APIs. You can fearlessly make changes to prompt strategies, data mixes, if you can trace that inference step by step by step.
At HyperMode, we care deeply about making AI approachable. Everyone here should be able to put AI in their apps without specialized skills. At its core, HyperMode is a runtime. It allows you to easily integrate models and data into AI functions. We then surround that runtime with a bunch of tools that make it easy for you to rapidly iterate and observe those AI functions in prod.
We make it easy to get started, incrementally adopt AI as appropriate, and then as your team develops those skills, reimagine those applications as AI native. First and foremost, we want to make the developer experience of developing with AI a lot less terrible. When it comes to adding a new model to your service, you probably don't want to read a bunch of pages of docs to figure out the temperatures on a 0 to 2 rather than a 0 to 1 or a 1 to 10.
With HyperMode, we provide you type ahead and your favorite code editor right out of the box. No SDKs, nothing to download. Then when you do ship to prod, we give you strong defaults just to get started. Or if you have your own stack, bring it along. In either case, we'll remove a lot of that complexity for you.
For example, traditional RAG requires N+1 requests. You need to make an additional call to embed the inputs. Go talk to your vector store with HyperMode. You can do that all in one request. We've built an in-memory embedding and search service that will allow you to do that and save a couple of milliseconds per request.
Finally, building intuition around non-deterministic systems is hard. Each model has its own personality, and we make it really easy for you to quickly compare different inferences, different tunes, different models. And you can then export this data set to fine tune. On Monday, your boss is going to ask you, what did you learn at AI World Fair?
If you come by our workshop after lunch, I'll prove to you that you can make iteration velocity of core competency. The team that built all this amazing stuff will be there. We'll show you how to build natural language search, intelligently sort every data list in your product, detect outliers, catch bad guys.
You'll walk over the demo that you're proud of and a plan to put something like it in prod by the end of next month. And if seeing my happy face again and building something really cool is not enough, we'll give you $1,000 in HyperMode credits to get started. Thank you all so much.
Thank you all so much. Thank you all so much. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you. Thank you.
Thank you. We'll see you next time.