Hello again, everyone! Welcome, Qwopus 3.5-Coder 4B!
Lots of awesome model drops are coming out, so we've got so many great new candidates for fine-tuning and dataset generation. We're so pumped and have a lot of great experiments running currently!
We've put together this significantly smaller coder model, Qwopus Coder 4B, and it seems to be impressive for something that could run well on most smartphones, or really fast on older GPUs.
It scored a 43.5% on a 225 slice of swe bench mini for completed patches, 32.5% for all patches, including empties due to missing the specific format required by swe, but on the ones that it output patches, it performed surprisingly well at 73/168 patches submitted for 43.5%
Bear in mind, this is a tiny 4b model with additional coding training and COT improvements. I was able to make a neon snake game (HF space link in comments to try) in just a few turns of the model. It's laser fast running at 270tps at Q8 with MTP on my 5090, with tons of headroom for concurrent instances! I was able to get over 500tps aggregate with parallel requests running SWE bench with it!
It also shows improvement in
@stevibe's BenchLocal agent and coding benchmarks! Check out the full results in the model card!
If you want to do some simple HTML game coding at lightning speeds on older hardware or less VRAM, I strongly recommend playing with it! Or if you want an intelligent model to do some serious swarm data cleaning or large dataset processing, this could be an excellent option!
Blessed to be here; you all are so enjoyable to engage with! Please let us know your thoughts in the comment section, and let us know what use cases jump out to you for a small 4b model like this one!
huggingface.co/Jackrong/Qwop…