back to indexRevenue Engineering: How to Price (and Reprice) Your AI Product — Kshitij Grover, Orb

Chapters
0:0 Introduction
1:12 Replet
2:26 Pricing
4:43 Pricing principles
12:8 Pricing caveats
13:33 Predictions
00:00:00.000 |
Thanks for coming to my talk. I'm Shitej. I'm one of the co-founders at ORB, and I'm going to be 00:00:20.540 |
talking about how to think about pricing. Maybe top-level takeaway from this talk is that pricing 00:00:26.460 |
is a deep, complicated topic. We're going to cover some examples. We're going to cover some 00:00:30.260 |
tactical advice. But in general, the way you should think about pricing is pricing is a form of friction 00:00:35.020 |
for your product. And sometimes that friction can be applied for a very good reason. Sometimes that 00:00:40.480 |
friction can just prevent people from using your product. And so you really have to think about the 00:00:44.680 |
value your product is delivering, as well as the audience you're delivering it to, and the way that 00:00:49.040 |
you're building and innovating in your product to start really thinking about pricing. Just a quick 00:00:53.780 |
intro to ORB. ORB is a usage-based billing infrastructure company, so we help companies 00:00:58.780 |
at lots of layers of the stack think about pricing and billing. Nowadays, a lot of our conversations 00:01:05.220 |
first center around monetization and pricing as a practice, and then we get into the weeds of how do 00:01:09.860 |
we actually bill for this stuff. So maybe an intro tweet to start. This is Amjad. He's the CEO and 00:01:19.420 |
founder of Replit. Replit is one of ORB's customers. And this is a tweet actually pretty recent, starting 00:01:24.520 |
to think about how Replit agent should price itself. And so a pretty short tweet, but a lot of complexity 00:01:30.060 |
actually packed in here. Really starting to think about, as you have Replit, which is a very popular 00:01:35.920 |
agent that does programming, is starting to build full-stack web apps. How should a company like Replit 00:01:42.580 |
price their agent? Should it be like a super simple, every time the agent checkpoints, it charges you a 00:01:47.740 |
fixed amount? Or should it start aligning with the complexity of the change it made? And as you can 00:01:53.780 |
start thinking through this question, you have to think through considerations like who is Replit built 00:01:57.840 |
for, right? What are the costs that Replit incurs? But also just the user experience of if I get charged 00:02:03.800 |
some amount at the end of something that this agent has done without me really having control over what 00:02:08.720 |
the agent did other than the initial prompt, is that going to be surprising? Is that not going to be 00:02:12.840 |
surprising, right? So these considerations apply particularly to agents because the work that the 00:02:18.760 |
agent is doing is fairly opaque or can be opaque, but they actually apply at every layer of the stack, 00:02:23.800 |
including the kind of infrastructure layer of the AI stack. So monetization as a whole is changing. 00:02:29.560 |
Maybe let's start with some kind of traditional principles in pricing and how pricing has always been 00:02:35.320 |
thought about, right? So we're maybe coming from way back in the day, you know, you're selling a fixed 00:02:40.080 |
license, you go to like Fry's Electronics and buy your TurboTax license. And then of course, now you have 00:02:46.600 |
seat-based pricing subscription, maybe you're paying $30 a seat, all the way to you have this usage-based 00:02:52.380 |
pricing, which is very dynamic. And then maybe you can consider outcome-based pricing as a further 00:02:56.600 |
evolution of that. So in traditional pricing, you have a few key principles. You want to make sure that the 00:03:02.460 |
pricing is simple so that users can actually understand it. They can grok your pricing. They 00:03:05.600 |
know how they're going to pay for your tool. You want to produce some friction because it determines 00:03:10.680 |
the value of your product. Like willingness to pay is a very real signal. And then you want to protect 00:03:14.520 |
your margins, right? Traditionally, software margins have been very healthy. You know, a good software 00:03:20.540 |
margin looks like 80%. Obviously, as you start getting closer to the infrastructure layer and as you start 00:03:26.160 |
moving into the world of AI, these margins are much, much more variable, not necessarily unhealthy, but 00:03:31.360 |
there's a lot more degenerate workloads that you can incur with an agent. So now starting to think 00:03:37.080 |
about AI native pricing, predictability matters a lot, right? I think especially when you're selling 00:03:42.420 |
to more mature companies that need to budget and think about their, you know, cost profile using your 00:03:47.540 |
tool, they need to be able to predict how that cost is going to scale, not just on an individual 00:03:53.680 |
developer, how that cost might scale, but just generally throughout the company's buying process. 00:03:58.380 |
AI is very early. So speed and just showing the value of your product and people being able to play 00:04:06.740 |
around and experiment matters a lot. Oftentimes, at least in this space, it's not like people are coming 00:04:12.000 |
to you buying a very established product. It's kind of your burden of proof to say, you know, here's how 00:04:17.100 |
our agent works. Just go try it and then start thinking about pricing. So maybe the friction point and 00:04:21.940 |
where it is has changed. And then I was just saying, um, cogs are variable, but, but not only are they 00:04:27.780 |
higher, they're also changing very, very quickly. I'm sure everyone in this room seen, okay, great. Um, 00:04:32.440 |
you know, open AI, uh, cut, cut model costs by a third overnight. Uh, that has a pretty material impact 00:04:38.960 |
on like potentially the underlying cogs of many of these agentic tools. So, um, the three key principles 00:04:45.760 |
that, that I think matter here are like really thinking about the audience, who your product is 00:04:50.140 |
optimized for, what the value delivery mechanism looks like your margin structure. Um, and, and I say 00:04:56.320 |
margin structure and not actually margins, because again, the underlying costs are changing very, very 00:05:01.060 |
rapidly. So you want to think about what are the axes of scaling rather than what is the literal margin 00:05:05.980 |
on day one, especially if you're an earlier product. And then you want to give yourself the 00:05:10.140 |
flexibility to experiment over time. This comes with a lot of complexity, but I think it's very important 00:05:14.520 |
because you're just not going to get it right the first time. Um, so, so these are kind of the principles 00:05:18.980 |
to keep in mind, maybe going through one by one. Um, the, the way to kind of think about pricing for your 00:05:25.080 |
audiences, um, you know, what is their buying journey and what are they coming to you to, to buy and what value 00:05:32.540 |
are they looking to get out of it? Right. Uh, if you're like, I, I, we, we talked to lots of people 00:05:37.000 |
who are like, Oh, you know, service now is pricing is so silly. You go to their pricing page and it says 00:05:41.680 |
like contact sales. It's like, well, that's, that's a very, very different audience that they're selling 00:05:47.240 |
to where they're probably cross selling some, some existing contracts. That is a big enterprise commit. 00:05:52.160 |
They want you to talk to sales because they really want to tailor the value to their sales motion. 00:05:57.320 |
Right. And then on the other hand, companies that are selling to individual developers, 00:06:00.800 |
there's not a procurement team. There's no purchasing process. They want you to click, 00:06:04.740 |
try it and start using the product immediately. So, so, um, don't just think about the kind of 00:06:11.100 |
point of entry, but think about who's behind that at the company. Uh, is it going to be a cross 00:06:15.280 |
functional decision? Is it something that someone can just insert into, you know, as, as a vendor on day 00:06:20.500 |
one, or is there going to be more of a process around it? And then also, uh, start to kind of think 00:06:25.760 |
about, uh, what users are seeking as quickly as possible. If they're seeking a 00:06:30.620 |
proof point, then perhaps you want to be able to give them that proof point before you insert 00:06:34.240 |
pricing into the equation. Here's, here's an example, again, using replet, um, couple of things 00:06:39.280 |
to note here. Uh, you'll see that there's one, a lot of tiers and pricing is actually quite 00:06:45.360 |
transparent. Obviously it's a free tier, right? Indicating that they want you to see the value 00:06:49.560 |
without having to do a ton of work. Think about the monetization. Um, and, uh, one thing that's kind 00:06:56.000 |
subtle is oftentimes pricing and packaging really starts to, uh, dictate what sort of use case they 00:07:03.680 |
are imagining you to use, right? Like, um, you know, what models you have access to, how many agent 00:07:09.260 |
checkpoints you have access to included, how many seats you might have included. Uh, it kind of starts 00:07:13.900 |
positioning your product. Is it a multiplayer product? Is it something that people are going to 00:07:18.660 |
be using daily or is it something that you're going to be using, you know, maybe like five times a month, 00:07:22.620 |
uh, the way you package it really determines the incentives that you're pushing onto your users 00:07:27.020 |
and obviously controlling for the costs that you might pay on the backend. Here's another example 00:07:31.660 |
from Unify, which is, I think, uh, uh, agent agent to company, but it's like targeted at revenue and go 00:07:36.780 |
to market teams. One, you'll see the, like the price point is a lot higher. Um, I want you to like 00:07:41.480 |
forget whether it's monthly or annually or any of that, just like you come on this page, the price point 00:07:45.320 |
is higher that communicates something psychologically. And then they're like second duty years are custom, 00:07:50.180 |
right. Um, again, I, and, and maybe the third thing to point out here is that 50,000 credits, 00:07:56.180 |
200,000 credits, 600,000 credits. I actually don't know what that means, but that seems like a large 00:08:00.840 |
number that communicates something about the workflows they expect to participate in and like 00:08:04.720 |
what that might mean for how you use Unify. Right. Um, of course, like it maybe goes instead, 00:08:10.100 |
but the set of logos you put on your pricing page is also very important. Um, are you tailoring it 00:08:14.820 |
towards, okay, we, we service the best developer brands or we service the like fortune 100 enterprises. 00:08:19.620 |
Um, and so pricing is, is not just the price point, but, but of course also the, the whole experience 00:08:25.060 |
of what someone sees when they land on this page. Um, let's, let's talk a little bit about margins. Um, 00:08:31.040 |
you know, as, as, uh, the, the underlying architecture of your product changes, of course, 00:08:37.320 |
your, your margins are going to change. These inputs are changing very quickly. Um, one, 00:08:40.820 |
one, one interesting example here is, uh, you, you, as much as possible, you want to think about, 00:08:46.520 |
uh, the differentiation of your product also in R and D innovation and try to pass that on to your users as 00:08:52.020 |
basically pricing leverage. Uh, a good example of this is like Cloudflare has built their 00:08:57.420 |
infrastructure on and their workers infrastructure on these like isolates, right? So, so they're kind 00:09:01.880 |
of lightweight compute machines. And so they can charge for CPU milliseconds. And so if someone is 00:09:07.640 |
building like an agentic app on Cloudflare and it makes like a, you know, call out to open AI or 00:09:12.700 |
Anthropic, you're not getting charged for that because Cloudflare is just charging you for CPU 00:09:16.340 |
milliseconds instead of wall time. I think that's a really good example of them, you know, taking an 00:09:20.820 |
architectural bet, um, which, which might be good for many, many reasons, but it's a really good fit 00:09:25.500 |
for building AI agents or generally AI software on top of Cloudflare. So that's, I see that as an 00:09:30.860 |
example of like passing down technical innovation to the end user. Um, and then the other thing to 00:09:35.840 |
consider about your margins is you don't have to protect them at all costs. You just have to think 00:09:40.920 |
about what are the extreme edge cases and what are you doing to like prevent those outcomes, right? So 00:09:45.940 |
if you have these potentially really degenerate workloads, uh, how do you incentivize reasonable 00:09:50.880 |
usage? Do you put in rate limits or guardrails? Um, you don't have to kind of linearly necessarily 00:09:56.100 |
scale costs with usage. You, you can protect your margins potentially in, in different ways. Um, 00:10:02.220 |
this is an example from Jasper. I think this, this blog post and pricing has changed since, 00:10:06.600 |
since I pulled this example, but you know, Jasper realized like, Hey, we're, we're targeted toward 00:10:11.180 |
marketing teams. Marketing teams don't want to count credits. Like when they're, you know, 00:10:14.560 |
basically writing marketing copy. And so, uh, they, they talk a little bit in this blog post about 00:10:20.560 |
how they've architected around being able to like switch between models under the hood seamlessly. 00:10:25.580 |
You don't have to choose the right model for the job. And as a result of that, they went unlimited 00:10:29.780 |
on, uh, each of the three tiers. So you're, you're no longer counting credits as you're using Jasper. 00:10:34.960 |
Now, uh, this kind of highlights the core value prop of the product, which is they want to be 00:10:40.060 |
a really critical part of that workflow. And they don't want you to have to like pick and choose when 00:10:44.680 |
you use Jasper because you might have credits or might run out of credits. So again, kind of trying to 00:10:50.220 |
take, is there like an architectural, uh, advantage you have and how can you pass that on to your 00:10:55.480 |
pricing model? Finally, I want to talk about flexibility. Um, you know, you, you all, as, 00:11:01.500 |
as you're working in your teams, uh, you're constantly improving your product. You're putting 00:11:05.280 |
a lot of R and D into it. Um, one thing that is used to be true, especially in the seat based world 00:11:10.440 |
is like, you'd have R and D over here shipping a ton, um, increasing the product value. And then you'd 00:11:16.080 |
have like a once a year, okay, we're changing our seat price from $20 to $25. Um, one advantage 00:11:20.980 |
you have in the AI world is like, uh, the value is much closer to the end users. It's not necessarily 00:11:26.500 |
like a $20 a seat thing. It's like someone is literally seeing, okay, I'm, I'm taking this 00:11:30.780 |
action. This agent is giving me this output. Okay. It's getting better. It's getting, it's 00:11:34.760 |
better, it's getting better. So I think not only do you as a business have a responsibility 00:11:38.740 |
to incrementally evolve your price as, uh, you know, you invest R and D, but I think it's, 00:11:44.780 |
it's honestly a lot more understandable for customers why that's happening because the, 00:11:49.000 |
the monetization is much closer to the value of the product. Um, so, so, uh, you know, 00:11:54.420 |
obviously in orbs customer base, but in general, we are seeing people, uh, make many, many more 00:12:00.020 |
price changes, not just once a year or twice a year, but in fact, some customers making price 00:12:04.100 |
changes, uh, two, three times a month. Um, and again, there's, there's caveats that come 00:12:09.760 |
with that. Uh, you have to think about, you know, what, what is that complexity that introduces in 00:12:14.500 |
the, in the mental model of your end users, how are you going to manage change? Um, and a kind of 00:12:20.120 |
good internal dynamic example of this is, uh, if your go to market team is, you know, commissioned 00:12:25.900 |
and, and they're selling these big enterprise contracts and your pricing is changing, or you're 00:12:29.480 |
going to like a pay as you go model, you need to think about like, what does that mean for your 00:12:33.040 |
sales team? What does that mean for the incentives that they have in selling your product? 00:12:37.020 |
classic example of this is a lot of usage based companies now commission their sales reps, not 00:12:41.620 |
on the initial contract land, but on the expansion of that account over time. And then, okay, second 00:12:47.480 |
order effect. How do you think about customer success versus sales as an organization? And how do you 00:12:51.320 |
organize that internally? Right? So, so repricing can have lots of impacts, not just on your end user, 00:12:56.300 |
but also in the internal dynamics of your org. Um, finally, the thing I'll say is, uh, as you're 00:13:01.800 |
iterating, you, you really want to, as much as possible, be able to simulate the impact that this 00:13:07.740 |
pricing change is going to have on your users. In particular, I think, uh, you want to be able to 00:13:12.460 |
simulate, uh, different usage patterns. So if you think about your users and cohorts, um, some might 00:13:17.840 |
be using it for very different use cases. What is that revenue mix going to look like? What is your top, 00:13:22.840 |
uh, revenue generating customer going to, how are they going to change with a new pricing model? You want to 00:13:27.920 |
be able to do that in a, in a data informed way, especially if you're running at a, at a reasonable 00:13:32.420 |
scale. Here's some predictions I have as, as we're wrapping up for AI agent pricing. I think, um, you 00:13:39.380 |
know, the, the price wars will continue. We're seeing agents get cheaper and cheaper. Um, and, and, you 00:13:44.780 |
know, I think you're, you, we should expect people to try to give effectively unlimited plans. I think 00:13:51.040 |
there'll still be caps. I think there'll still be guardrails, but I think, um, we, we will move to a 00:13:55.700 |
world where, uh, you know, people just want these agents embedded in every workflow. So we'll try to, 00:14:01.380 |
try to converge to effectively unlimited. Um, I think outcome-based pricing is going to get more 00:14:07.260 |
real. We are seeing some companies, uh, do that already, but the, the, the hard part about outcome 00:14:12.700 |
based pricing is defining what the actual SLA is like, what, what is the outcome and how do you measure 00:14:18.300 |
it? And does it make sense to the end user? Um, I think like, we're going to see a lot of clear 00:14:23.780 |
definition of success language and contracts and like people really trying to get, uh, precise about 00:14:30.260 |
what the outcome is that they're charging for. And then, um, I think there's going to be a lot more 00:14:36.040 |
real-time visibility, spend management, uh, balance alerts. I think the kind of workflow around pricing 00:14:41.860 |
and monetization is going to get, uh, more and more sophisticated. You think about like, you know, 00:14:46.860 |
obviously in, in database queries, you have like query planners. I think it's quite possible that, 00:14:50.700 |
you know, you, you prompt an agent, it tells you like, here's my estimate for how much, uh, how many 00:14:55.780 |
credits I'm going to use, or, Hey, here's three options for, um, how I might execute, how I might plan 00:15:02.800 |
this workload. One's going to take 10,000 credits. The other is going to take, you know, 4,000 credits. 00:15:08.280 |
I think that customer experience of like being able to control the workflow around the spend 00:15:13.300 |
is likely to get much, much more sophisticated, um, this year. So, uh, just, just final note, 00:15:19.200 |
um, we have a whole ebook on how to price your AI agent that goes into much more tactical advice, 00:15:23.440 |
talks about specific, you know, threshold billing, fraud, prepaid pricing models. Um, 00:15:28.300 |
if you want to check that out, that is at this link. Thank you.