back to indexSmall AI Teams with Huge Impact — Vik Paruchuri, Datalab

00:00:00.000 |
Okay, my name is Vikas. I'm the CEO of Datalab, and today I'm going to talk about how we got to 00:00:19.460 |
40k GitHub stars, seven-figure ARR, and trained state-of-the-art models with a team of three. 00:00:25.660 |
So I spent the last year training these models, like Brittany mentioned, Marker, and Surya. 00:00:30.960 |
I also built repositories around them. I left my AI research job, and I started a company and raised a seed round. 00:00:37.180 |
I did not get enough sleep. It's important. And this is Datalab. So we made our first hire in January. 00:00:44.620 |
We're now a team of four. Faraz is new enough that he's not pictured. We've grown revenue 5x since January. 00:00:50.260 |
We're at seven-figure ARR, and our customers include Tier 1 AI labs, universities, Fortune 500, and AI startups, 00:00:57.320 |
including Gamma, who I used to make this presentation. 00:01:01.100 |
So today's focus, I'm going to talk about how we've grown with a small team. 00:01:05.280 |
I'm going to talk about my philosophy on building teams and why I think we're at kind of an inflection point 00:01:11.260 |
in how we think about building teams. And I'm really going to talk about this idea that headcount does not equal productivity. 00:01:17.180 |
There's like this really persistent notion in Silicon Valley that you raise money, you hire a bunch of people, 00:01:21.520 |
and you build more. But it almost never, in my opinion, works out perfectly that way. 00:01:26.240 |
All right. So my last company was called DataQuest. I'm very fond of the data prefix, apparently. 00:01:32.040 |
And we scaled to 30 people and 4 million ARR bootstrapped during COVID. It was an online education startup. 00:01:37.860 |
And then, unfortunately, we had to do two rounds of layoffs post-COVID when online education kind of tanked. 00:01:44.060 |
We went from 30 to 15, and then again from 15 to 7. And it was obviously awful for the people we had to lay off. 00:01:50.100 |
But I noticed something really interesting. Productivity and happiness increased a couple of months after both layoffs, 00:01:56.160 |
to the point where we were actually much more productive after both cycles than we were at the beginning. 00:02:00.920 |
And I started to wonder why that was. Like, how could reducing the team so much actually improve productivity? 00:02:07.540 |
And I came up with these four hypotheses. One, we'd hired a lot of specialists. 00:02:12.420 |
So as you scale, like Grant mentioned in the earlier talk, you end up building these very specialized functions and teams. 00:02:18.220 |
And those specialists often can't flex across the company to solve the key issues of the company. 00:02:23.340 |
Two, we were a remote team, which required a lot of intentional process and heavy syncing, 00:02:29.180 |
which just eats into your time and just makes it really hard to get on the same page. 00:02:33.940 |
Because of that, we had a lot of meeting overload. And especially once we got middle management in place, 00:02:38.640 |
people whose job is kind of professionally to manage, we ended up with just a lot of meetings on people's calendars 00:02:45.700 |
And then senior people, we hired kind of a mix of experience, like most companies do. 00:02:52.320 |
And then senior people ended up getting kind of tied down and doing a lot of work to manage the more junior people. 00:02:58.780 |
We actually had a case where we had a three-person team, and we cut it down to one. 00:03:02.620 |
And the team actually got much more productive because it freed up the senior person's time. 00:03:08.280 |
And kind of every company, I feel like, goes through this journey. 00:03:11.500 |
There's this initial golden period when everyone is aligned, you're on the same page, 00:03:15.640 |
you're building this amazing stuff, and that's really when you build the core thing of your company. 00:03:20.460 |
Like Google with Search or Microsoft with Windows. 00:03:24.380 |
It's kind of when you figure out your business model. 00:03:26.460 |
And then you hire a bunch to fill out the edges around it. 00:03:29.020 |
Like you hire a bunch of enterprise sales, you hire a bunch of marketing, you hire a bunch of engineers 00:03:33.740 |
who are kind of in very small boxes to build very small features. 00:03:36.680 |
I had a friend at Amazon who worked there for two years and built a shopping cart button. 00:03:42.860 |
But, I mean, at that scale of org, that's kind of the tiny box you get fit in. 00:03:46.640 |
And you end up with a lot of bureaucracy, a lot of sinks, a lot of unclear priorities. 00:03:50.300 |
And this pattern is unfortunately very common. 00:03:53.360 |
But I started to think, what if that golden period just lasted forever? 00:03:58.860 |
And as I started working with Jeremy Howard at Answer AI, 00:04:03.680 |
I got to understand his philosophy for building a company a little bit better. 00:04:07.360 |
And his idea is basically hire less than 15 generalists. 00:04:11.520 |
So people who can really do everything across the stack 00:04:14.400 |
and really understand all aspects of the company, 00:04:16.740 |
fill in the edges with AI and internal tooling. 00:04:19.340 |
So Jeremy's invested a lot recently in fast HTML and things like Monster UI 00:04:24.200 |
because he sees them as kind of building block libraries 00:04:26.680 |
to really build out the other tools that the company's working on. 00:04:33.060 |
You don't need a Kubernetes cluster when you're a three-person company. 00:04:37.820 |
But this unfortunately requires kind of a high cultural bar for folks. 00:04:42.240 |
You need people who really want to and can understand everything you're doing. 00:04:49.000 |
You need go-to-market people who actually build. 00:04:56.020 |
So basically, you need people who are in it because they're building something together 00:05:00.580 |
and not in it for other reasons like politics or personal advancement, etc. 00:05:05.100 |
And everyone needs to really care about the customers and focus on them. 00:05:08.520 |
I think these are the prerequisites for this kind of team working, 00:05:12.180 |
this less-than-15-person team of generalists. 00:05:20.220 |
We recently shipped it but have not announced it yet. 00:05:23.380 |
So it's 500 million parameters that supports 90 languages 00:05:25.740 |
and 99% accuracy on our challenging internal benchmarks that include math. 00:05:29.920 |
And it also does some features that no other model does, 00:05:34.820 |
It uses PDF text as grounding at a line level. 00:05:40.080 |
And in order to do it, Tharun, who's a research engineer at Datalab, 00:05:43.700 |
and I had to handle the entire process from end to end. 00:05:46.740 |
So that included talking to customers, figuring out what they wanted. 00:05:52.060 |
and figuring out the right architecture, prototyping, 00:06:00.440 |
So building a data pipeline library, building out the data sets. 00:06:07.980 |
get the inference written for all our customers, 00:06:17.240 |
And every time you hand off between teams in a traditional company, 00:06:23.800 |
lossily communicate it to the people who build, 00:06:26.900 |
who lossily communicate it to the people who train the model. 00:06:31.940 |
You end up eating a lot of time in just syncing context. 00:06:36.200 |
You're not able to build a great end-to-end experience as a result. 00:06:39.120 |
And you have very slow feedback loops, right? 00:06:42.280 |
and it might impact your model training in months. 00:06:45.260 |
Whereas if you have generalists who can work across the stack, 00:06:49.720 |
You never need to share context and do inefficient syncing. 00:06:52.180 |
You get a really tight integration between all aspects of the company 00:06:57.500 |
And the reason we were able to do this is we used AI 00:07:00.440 |
to take kind of the easy, low-leverage pieces of this, 00:07:05.700 |
or helping us really figure out how to integrate it into the API, 00:07:09.200 |
whereas we did the higher-level work in each of these silos. 00:07:11.640 |
So if you get one thing from this talk, this is the thing. 00:07:16.840 |
More people does not equal more productivity. 00:07:24.000 |
So the first thing you have to do is hire senior generalists. 00:07:27.080 |
And senior to me does not mean years of experience. 00:07:30.640 |
You need people who can look at a problem and say, 00:07:35.840 |
And I really care enough to iterate with the customer to solve it. 00:07:45.420 |
Like, hey, let me deploy this Kubernetes cluster 00:07:47.560 |
and multi-stage pipeline to solve like a data extraction problem. 00:07:50.700 |
But in reality, you need people who can go back 00:07:53.240 |
and like kind of set aside the fixation on shiny tech 00:07:58.200 |
which usually is I'm just going to write a shell script 00:08:01.340 |
There's that famous like Hadoop versus shell script blog post 00:08:05.620 |
when you like you could replace a whole Hadoop cluster 00:08:12.140 |
And you need to work in person, I personally think. 00:08:17.000 |
but it's not great for a small team that needs to move fast 00:08:24.660 |
of this really fast collaboration and tight feedback loop. 00:08:33.060 |
but you have to reuse components aggressively. 00:09:02.640 |
All right, so basically, keep everything simple. 00:09:06.760 |
Architecture, as few moving pieces as possible. 00:09:12.600 |
Minimize bureaucracy, high trust, continuous discussions. 00:09:28.360 |
So a challenge we're going to face as we scale 00:09:31.240 |
is this idea that we're a document processing 00:09:34.640 |
And every customer has a slightly different way 00:09:38.140 |
And if you go back kind of to the last generation 00:09:43.180 |
is they hired a bunch of forward-deployed engineers. 00:09:44.800 |
You sat at a client site and you just kind of iterated 00:09:48.980 |
But in the future, you can really train a model 00:09:53.100 |
Like we can train a model to essentially loop over 00:09:55.280 |
customer outputs until it gets to the right state. 00:09:59.880 |
forward-deployed engineering side of the org. 00:10:06.620 |
I don't know exactly when this model falls apart. 00:10:08.880 |
But Gamma, as we just saw, is a great example 00:10:12.080 |
of a small team with very, very meaningful growth in ARR. 00:10:16.020 |
I think the key is being able to say no, right? 00:10:20.240 |
You can choose to go hire a bunch of forward-deployed engineers 00:10:23.920 |
or you can choose to solve it a different way. 00:10:25.840 |
And maybe that different way is slightly less efficient 00:10:27.980 |
in terms of revenue, but it might be more efficient 00:10:30.780 |
in terms of your long-term company trajectory and health. 00:10:33.220 |
So it's really unknown if this will work forever, 00:10:51.360 |
so that explains why this slide is not so well done. 00:11:03.820 |
and research engineer and full-stack engineer 00:11:21.320 |
Like we want people who only care about the work, 00:11:23.340 |
the people around them, and customers, right? 00:11:26.900 |
You need some ego to kind of advance your own ideas, 00:11:29.900 |
but not so much that you're willing to fight for them 00:11:32.140 |
at the detriment of kind of the health of the company. 00:11:52.080 |
Like if you come in, you get to work across the stack, 00:12:08.320 |
And that's another downside of remote culture, 00:12:11.260 |
in my opinion, it gets very hard to tell the two apart. 00:12:22.620 |
okay, let me find the best person and hire them. 00:12:25.940 |
Even though I may not necessarily have a role today, 00:12:29.500 |
This is actually a big debate in NBA and NFL drafting too, 00:12:33.720 |
like best player available versus drafting for fit. 00:12:38.840 |
I think the thing to think about as you scale 00:12:40.560 |
is like, how do we scale productivity, not headcount? 00:12:45.040 |
Like you can raise salary bands as the company grows. 00:12:52.780 |
Like a one researcher with access to eight GPUs 00:12:54.920 |
is less productive than one with access to 64 GPUs. 00:12:57.980 |
You can invest in AI tools that multiply productivity, right? 00:13:03.960 |
that can abstract away a lot of these edges for you. 00:13:17.680 |
Yes, I think we do the microphone for questions, right? 00:13:26.160 |
So when you went from 30 to 15 and then to seven, 00:13:40.300 |
on reducing the domains that you were focusing on 00:13:44.600 |
or it was like basically your same product offering 00:13:50.980 |
So at a very high level, we offered the same product, 00:13:53.660 |
but we cut some features that were less relevant. 00:13:58.240 |
that you kind of like end up building over the years. 00:14:01.340 |
And we ended up slicing a lot of those edges. 00:14:03.480 |
So I think what happens when you hire a lot of people 00:14:10.860 |
that actually aren't that useful to the customer. 00:14:14.820 |
that you actually have to ruthlessly prioritize. 00:14:16.700 |
And I think you always want to be in that zone. 00:14:36.040 |
of a giant company that's been around for 100 years, 00:14:42.900 |
It's got super comfortable with revenue stream. 00:14:48.380 |
And they're clearly folding over on themselves 00:14:57.560 |
I would say the people who want to change the culture, 00:15:02.080 |
go start a small company and build the same thing, 00:15:12.920 |
I've worked at the State Department, Pepsi, UPS. 00:15:24.340 |
and they start to buy up those small startups 00:15:35.780 |
how you source these really good generalists. 00:15:55.200 |
But yeah, I don't have a great answer to that,