back to index

The Future of Work: Toran Bruce Richards, Silen Naihin et al


Whisper Transcript | Transcript Only Page

00:00:00.000 | Thank you, San Francisco, for the warm welcome.
00:00:17.160 | I'm Torrin, the creator of AutoGBT, and I'm excited to show you all what the brilliant minds at AutoGBT have been working on over the past months.
00:00:27.560 | I'm going to hand off the stage now to Selen, one of our founding AI engineers.
00:00:32.320 | Thank you, Torrin.
00:00:35.500 | Thank you.
00:00:37.200 | There you go.
00:00:37.920 | Thank you, Torrin.
00:00:39.560 | I want to talk about something that I think not many of you realize.
00:00:44.580 | I didn't realize this for a long time.
00:00:47.040 | We're not achieving the peak of our potentials.
00:00:49.720 | We can all work faster, we can work better, and we can do more.
00:00:55.120 | With less time and less stress.
00:00:57.540 | Let's take this spreadsheet, for example.
00:01:00.440 | I don't know about you, but I've stared at this interface for hours on end.
00:01:04.780 | And I'm sick of it.
00:01:07.160 | How would you go about filling out this spreadsheet?
00:01:09.580 | It's the lead generating.
00:01:11.500 | Name of the company.
00:01:12.480 | You've got the links.
00:01:13.880 | What you'd probably do is you go on Google, you search, copy, paste, maybe go on LinkedIn, copy, paste, back to Google, over and over and over again for hours, going back to the same interface, going back to the same websites.
00:01:28.520 | But what if instead of all that, you can just chat?
00:01:32.760 | And you get the same end result, a filled out spreadsheet with all the leads.
00:01:36.820 | All right, let me give you another example.
00:01:38.520 | We all have unread messages, right?
00:01:41.580 | Not because we're lazy, allegedly, but because we're overwhelmed.
00:01:46.800 | Now, how would you go about cleaning out your inbox?
00:01:48.720 | You'd sit there for hours and hours, sending the same variation of the same email.
00:01:54.920 | But what if you could just chat?
00:01:59.420 | Last example, I promise.
00:02:01.540 | Actually, these emails will now be leads in your inbox instead of just unread emails.
00:02:09.800 | Last example, say you're a company or a developer, you spend millions of dollars developing apps that take weeks, months, sometimes even years, sitting there, copy-pasting anyways, because you're probably using ChatGPT.
00:02:26.020 | I know I am, copy-pasting.
00:02:27.560 | But what if, instead of all that effort, you just chat?
00:02:32.020 | I think you get the point.
00:02:34.140 | There's a reason you've heard of AutoGPT.
00:02:39.260 | ChatGPT inspired the minds of millions.
00:02:41.540 | It gave hope to what a world could look like where we all reach our full potential, the light at the end of the tunnel.
00:02:48.280 | You could see the sparks of digital artificial intelligence.
00:02:52.140 | And in this world, everyone goes from using their minds mostly to execute menial tasks with only 10% of their brains being used for creative work,
00:03:03.820 | to becoming creative masterminds, orchestrating the peak potential of their lives.
00:03:09.220 | And in this world, we're all AI engineers, whether you know it or not.
00:03:14.420 | And people have noticed, AutoGPT was the fastest repository to 100,000 stars.
00:03:22.460 | Every major news network picked up on this.
00:03:26.500 | Everyone understands what the potential of this is.
00:03:30.260 | And it kicked off a whole new field of development, a whole new paradigm of augmenting humans to give them time back and live a more stress-free life.
00:03:40.000 | And even the major players in the space all realized how big of a deal it is and now work on these agents.
00:03:47.580 | And so, I want to hand off to the primary open-source developer at AutoGPT to talk a little bit more about the open-source repo.
00:03:56.560 | Thank you, thank you, and hello, San Francisco!
00:04:09.080 | I think all of us being here is a real testament to the power of open-source.
00:04:17.540 | And on that note, we have some really exciting news to share.
00:04:22.200 | Because just last week, our open-source repo, AutoGPT, hit 150,000 stars on GitHub.
00:04:31.520 | Of course, metrics are fun, but to me, it is so much more than just a number.
00:04:50.740 | It is the 150,000 people who took an interest in what we're doing and decided to click that button.
00:04:57.900 | It is also the 460-plus contributors who took their time and effort submitting thousands of pull requests and issues in the process.
00:05:16.260 | And to all of them as well, thank you so much!
00:05:19.500 | It is also the 47,000 members of our online community, and all of the interesting and insightful interactions that they've given us.
00:05:31.660 | It's been a wild ride at times, but it has allowed us to do and learn so much in the past six months.
00:05:41.260 | And I'm extremely excited for what is to come, based on that.
00:05:47.820 | Now, I've already said it, but we could not have done this without our community.
00:05:56.540 | And community matters.
00:05:57.740 | So, we are committed to fostering, to growing, and to empower this community, and to build the future together.
00:06:09.820 | And I'll hand it back to Slam to tell you what that means.
00:06:19.020 | Thank you, Boots.
00:06:19.740 | And we haven't stayed stagnant since the open-source agent originally came out.
00:06:24.460 | We've continued to work on it, and we've continued to improve its capabilities and implement the latest cutting-edge research.
00:06:29.820 | But we've also been working on some other things.
00:06:32.060 | To show our commitment to the agent space and the open-source ecosystem, we've built a forge, which is a template for any agent creator to have a better time to develop their agents with a standardized template.
00:06:46.380 | We also built a DevTool UI to easily interact with and iteratively improve your agent using an intuitive interface.
00:06:52.940 | All of these tools are built on top of the agent protocol from the AI Engineer Foundation and other industry standards to maximize compatibility and interoperability.
00:07:02.700 | Anyone who implements this protocol can use our benchmark, front-end DevTool, and other offerings built on top of this protocol.
00:07:11.580 | And while this DevTool template is in beta or in alpha, it has served our participants of the current hackathon we're running, where we have $30,000 in cash on the line.
00:07:20.460 | And we've learned a lot from this.
00:07:22.540 | We've received a lot of great feedback.
00:07:24.220 | We've received a lot of bug fixes and insights that we're going to take into the future.
00:07:28.060 | One of those insights is that code is king.
00:07:35.340 | We've realized that coding agents are the fundamental agents of the world.
00:07:40.300 | Let me tell you, the digital fabric, the fundamental digital fabric is code.
00:07:48.220 | Our goal is to build a generalist agent, yes, but code is the stepping stone to AGI.
00:07:56.060 | A motivated coder can get anything done, except for get a bed frame.
00:08:01.260 | Another thing that we've learned over time is that without a compass, you don't know where you're going.
00:08:08.860 | You know, at the start of the repo, we were getting thousands of pull requests, and that's a pull request every two hours.
00:08:17.260 | We had no way to know whether the pull requests were good, and how do we even test these pull requests.
00:08:22.460 | We didn't have a real direction.
00:08:23.580 | It took time to test these, and it was unnecessarily costly.
00:08:27.420 | And so we created a compass.
00:08:29.660 | We created a benchmark to direct the development of the open source repo and quantitatively know if we were improving.
00:08:35.580 | It's an easy way to know if your agents are improving down different categories, and people are currently benefiting from this for the virtual hackathon.
00:08:44.700 | And, this is just cool, we've been running this in our CI pipeline for the past couple months on different open source agents within the ecosystem.
00:08:52.940 | And what the tests have shown is that we're on the brink of something special.
00:08:56.940 | These agents have showed continual improvement, and don't worry, I wouldn't put this in a research paper.
00:09:01.820 | It's very noisy, it's very messy, but there is a continuous trend from 35 to 55 percent.
00:09:07.660 | This is just a graph of the success rate on the benchmark over time, over the month of August.
00:09:13.180 | Another thing that we're committed to is safety.
00:09:18.300 | As the ecosystem grows, and as the capabilities of agents increase, there's always questions of trust and reliability, and these are problems that AutoGPT is committed to.
00:09:29.820 | One of these problems is prompt injection, which will always be there.
00:09:34.140 | OWASP, one of the big security organizations, has talked about this, and said this is one of the big problems that not just language models face, but also agents.
00:09:43.660 | It's essentially when agents visit a website, what all agents need to do, and the website has something malicious.
00:09:52.060 | And then the LLM is like, all right, I need to be doing that now.
00:09:54.780 | And you can see that there in this example.
00:09:57.580 | Then there's this other category that I like to call innocently malicious, where agents are just bad sometimes.
00:10:07.500 | It's the truth.
00:10:09.180 | And in this example behind me, this person asked an open source agent to delete all the JSON files within a directory, a specific one, and the agent ended up deleting all the JSON files on a laptop.
00:10:22.060 | And this is going to continue to be a problem.
00:10:25.180 | If we want agents to do the things that humans can do, they will need root access.
00:10:30.380 | And so within AutoGPT, we're committed to and think about these problems extensively.
00:10:34.060 | And we've been working on a research paper to solve some of these issues.
00:10:37.820 | And in order for agents to be commercially viable and trusted, these safety problems need to be solved.
00:10:45.420 | You can't have a 99% success rate.
00:10:48.220 | It has to be 100%.
00:10:50.140 | That one email that's sent could be a lost contract or a lost lead.
00:10:53.340 | And so this is fundamental, not just to the development of the open source agent, but to all agents out there.
00:10:59.260 | And so this is a digital AGI, to augment all of humanity.
00:11:05.740 | And I'm going to invite Craig to announce some exciting news regarding some developments with AutoGPT.
00:11:12.860 | Hello.
00:11:17.660 | So, it's been a wild journey from zero to here in six months.
00:11:26.860 | And we keep stressing this because it's so important to us.
00:11:29.580 | We're only here because of our community, because of that shared passion in pushing the frontier of what AI agents can do.
00:11:38.220 | So we're really excited to announce that Redpoint Ventures has invested $12 million in turning this vision to a reality.
00:11:48.540 | Now, this isn't just funding.
00:11:57.980 | This is them showing their deep belief in our mission and their dedication to open source.
00:12:06.780 | That's why we went with them, because they are so dedicated to staying open source.
00:12:11.820 | And that's really important to every single one of us working on this project.
00:12:17.500 | Now, this is where we need you.
00:12:18.780 | With this funding, we want to grow our team and add more passionate individuals.
00:12:24.700 | So, join us, message us, join our Discord community, and let's all help make the world's best open source generalist agent.
00:12:37.100 | Together, we can redefine the future of work.
00:12:40.380 | Thank you.