some thoughts on the business of fine-tuning
1) thinking machines acquired workshop labs, which was doing llm personalization via fine-tuning
2) thinking machines probably acquired workshop labs to build out an application level product on top of their fine tuning apis
3) it's not quite clear whether this product is intended to be b2b or b2c; workshop labs was agnostic; thinking feels b2b so probably b2b
4) anyway, b2b fine-tuning has always been a hard market and no one has really seen success in it yet despite many attempts
5) the first issue is that fine-tuning tends to provide quickly depreciating value even if you get it right
6) if you use open models then you are already behind the frontier and so your fine tuning has to be able to bridge and then exceed that capability gap
7) and, once you do your fine-tuning, a new better model will be released soon anyway, so to retain any benefit you need a whole fine-tune and deploy cycle
8) and, this is assuming that the new model just doesn't already have all the capabilities of your data, such that the fine tuning still provides an additional advantage
9) the second issue is that the value of fine-tuning is expensive to capture in the first place; it is hard to collect data and do deployments
10) organizations are not designed to mine their own task data and create benchmarks; this requires special expertise companies don't tend to have
11) i also believe that this is process is likely to be organizationally difficult for a company to manage and work through; since it involves collaboration between different teams
12) also, if you want to use multiple different fine-tuned models across your application, you have just made that application much more complicated to update
13) so forward deployment seems very important to support enterprise fine-tuning in high value industries like drug development, semiconductors, finance, etc...
14) because you need to help solve their data collection, benchmark creation, model deployment and organizational difficulties in order to help them get value
15) so, my feeling is that anyone that wants to work on this should focus on high value, specialized industries, that the labs will not commoditize and which can afford forward deployment
16) and, you need to develop very comprehensive forward deployment that helps the customer ideate on problems, solve organization difficulties, and do process mining to collect the data
17) if you can solve this for very large scientific and financial customers, you can probably begin to build automated agents that would help extend this to a wider range of specialized enterprise customers
18) anyway, this would be my advice to thinky or anyone that wants to do enterprise fine tuning, develop a forward deployed team, with a set of enabling software solutions for process mining, rubric creation, etc...