NO ONE CARES ABOUT CODE

Joined March 2018
100 Photos and videos
MiniMax just dropped M3's open weights on a Friday, and the LLM wars just got real. Frontier-level coding practical 1M context at 15x lower cost than Opus. The #OpenWeights ceiling just broke. banandre.com/blog/minimax-m3…
1
2
67
Google DeepMind's #Gemma 4 12B just made your laptop a local multimodal AI workstation. Video, audio, text, all on 16GB RAM, no cloud banandre.com/blog/gemma-4-12…
44
Forced to use #Databricks for 100k rows in Postgres? The real cost isn't just $9k, $19k/year, it's the #PlatformOverhead killing your team's velocity. Push back with data banandre.com/blog/databricks…
15
Gemma 4 MTP in llama.cpp b9549 pushes 12B to 140 tok/s on a 12GB RTX 4070. No separate draft model needed, just a co-trained prediction head. #Gemma4MTP is turning dense models into #SpeedDemons But if you're on MoE, don't expect the same boost banandre.com/blog/gemma-4-mt…
36
#VoidZero joins #Cloudflare Vite's edge native tooling just got a global network. But does the build-to-deploy seamless flow mean vendor lock-in for JavaScript toolchains? The architecture is sound, but the gravitational pull is real banandre.com/blog/cloudflare…
24
Nvidia's 550B #Nemotron3Ultra MoE fits 8 H100s, 55B active params, 5x throughput, 1M context. It's not magic, just damn good engineering. Latent MoE and #MOPD training make frontier-level reasoning deployable on a single DGX node banandre.com/blog/nvidia-nem…
13
Google's #Gemma4 12B ditches separate encoders, bringing native vision & audio to 16GB laptops. Single #EncoderFreeTransformer handles text, images, and audio in one pass. 256K context, runs locally. The #LocalMultimodalAgent dream is real banandre.com/blog/google-gem…
2
56
#MiniMax M3 beats GPT-5.5 on SWE-Bench for $0.30/M input tokens. With 1M context and 9.4× speedup on CUDA kernels, the #OpenWeight model just rewrote the #AgenticAI cost math. Implications? banandre.com/blog/minimax-m3…
40