You can now fine-tune an open-source LLM without writing a single line of code.
This is unprecedented.
No-code fine-tuning is a breakthrough in the open-source world, and it will help companies adopt AI at lightning speed.
Let me show you how you can do this.
I taught a model to recognize negative content on Twitter. Setting everything up took me 10 minutes. It took another 45 minutes to have the model ready.
If you aren't familiar with the term "fine-tuning," it’s the process we use to teach a model how to solve a specific task. Large Language Models have general knowledge but struggle to solve particular problems.
Fortunately, we can fine-tune these models and make them very good at solving specific tasks. In this example, that task is to analyze the sentiment of tweets.
But there’s a massive problem:
Fine-tuning a model is a complex, expensive process. It takes a lot of time, effort, and GPU computing. It's also hard to find experienced people who know how to do it.
The team
@monsterapis built the first platform that offers no-code fine-tuning of open-source models, which changes everything. That’s what I’m using here.
Here is what you need to do:
1. Sign up here:
monsterapi.ai/signup, and use the code SANTIAGO during your purchase to get an 80% discount.
2. Go to the FineTuning option and select your model. I'm using the Falcon 7B model, but you can pick any of the following options:
• Falcon 7B
• LLaMA 7B
• Open LLaMA 3B, 7B
• OPT 125M, 350M, 1.3B, 2.7B, 6.7B
• GPT J 6B
• Stable LM 3B, 7B
• GPT 2 XL
3. Select your task. I'm using "Text Classification" since we want to classify different tweets as Positive or Negative.
4. The last step is to select your dataset. I used a twitter-sentiment-analysis dataset from HuggingFace. They have data for almost anything you can think of, but you can also upload your own.
I didn't change any of the default hyperparameters, and 45 minutes later, I had my fine-tuned version of Falcon 7B ready to go! I spent 2,320 credits in the process, equivalent to $2.50. A fine-tuned state-of-the-art model for the price of a cup of coffee!
That’s one of the
@monsterapis’ advantages: Besides not dealing with code, complexity, or hardware, their pricing is very competitive, thanks to their decentralized GPU platform.
ALT Fine-tuning a Large Language Model diagram.