back to indexSynthetic data + tool use for LLM improvements 🦙
00:00:00.000 |
What I'm excited about is, we had this survey on augmented LLM a year ago, and all the idea 00:00:06.600 |
is like, if you augment your LLM with something else, it can be a retriever, it can be search, 00:00:11.960 |
it can be a tool, it can be a calculator, it can be a code execution, then you are not 00:00:17.560 |
just distillating, like doing some data augmentation with your model, but you're actually adding 00:00:23.440 |
some expert skills that possibly goes beyond the model weights. 00:00:28.200 |
For instance, if your model can calculate something it was wrong before, and now it 00:00:34.520 |
has access to a calculator, and you can retrain your model on that, then you're learning something 00:00:39.840 |
If your model didn't know something about LLM 2, it probably doesn't know a lot about 00:00:43.480 |
LLM 3, but now if it can search online about it, and then you train the model on that, 00:00:48.760 |
Then you have a positive feedback loop like what we call expert iteration.