I recently switched from Qwen 3.5 9B to LFM2.5-8B-A1B by
@liquidai, and it's quickly become my default local model in Hermes Agent Desktop.
For agentic tasks, it's one of the strongest local models I've used so far. It's surprisingly fast, reliable, and works really well with tools.
Coding is still where it struggles the most.
Other than that, it's been consistently solid and easily one of my favorite local models right now.
Today, we're releasing LFM2.5-8B-A1B, a device-optimized model designed to power real-life applications on phones, laptops, PCs, robots, and fast & lightweight server-side use-cases.
> 8B MoE, 1.5B active
> Expanded 128K context
> LFM2.5 flagship hybrid MoE architecture
> Trained on 38T tokens large-scale RL
> fast, reliable tool calling, punching above its weight, comparable to models with up to 4x its size
> customizable on a single GPU for any specialized task
> LFM2 open-weight license
🧵