Local AI, LLMs, tech thinker & builder

Joined July 2022
30 Photos and videos
DeepSeek-v4-Flash beats Step-3.7-Flash in head-to-head tool calling benchmark. Full results in: github.com/MiaAI-Lab/DeepSee…
9
31
2,302
Local agentic 'Tool-Call Benchmark' between DeepSeek-v4-Flash to Step-3.7-Flash. Same host, same 69 scenarios, two models. Results: DeepSeek-v4-Flash: 90/100 quality, 59 passed, 6 partial, 4 failed Step-3.7-Flash: 87/100 quality, 55 passed, 10 partial, 4 failed πŸ‘‡
4
1
18
1,312
Bottom line: DeepSeek-V4-Flash wins overall (90/100 vs 87/100) because it’s more reliable across long chains and structured outputs. Step-3.7-Flash is competitive and actually safer/more disciplined in a few specific scenarios, but it drops more partials and struggles more with multi-turn execution.
1
160
Running agentic coding benchmarks on DeepSeek-v4-Flash and Step-3.7-Flash. Will post results soon.
2
25
1,707

Local agentic 'Tool-Call Benchmark' between DeepSeek-v4-Flash to Step-3.7-Flash. Same host, same 69 scenarios, two models. Results: DeepSeek-v4-Flash: 90/100 quality, 59 passed, 6 partial, 4 failed Step-3.7-Flash: 87/100 quality, 55 passed, 10 partial, 4 failed πŸ‘‡
1
126
RepoPrompt for Windows Open any project folder β†’ select exactly which files matter β†’ generate clean, LLM-optimized XML output. πŸ“ Open any project folder βœ… Select exactly which files matter πŸ’° Set your token budget πŸ“‹ Generate clean, LLM-optimized XML output πŸͺŸ Built for Windows πŸ”’ Local & private πŸ“¦ Free Try it out here: github.com/MiaAI-Lab/repopro…
87
Diffusion Gemma is 4x faster, but makes 6x more mistakes.
1
2
140
Jun 12
I just published Slate β€” a fast, light-weight OLED-friendly Markdown/text editor. It supports editing all types of text-based files. One thing I really wanted: a proper OLED-friendly editor. Not β€œdark gray” β€” complete black, so it looks great on OLED displays and feels easy on the eyes at night. Fully developed by local AI. Currently Windows only. Feel free to fork and build for Mac/Linux. Feel free to test it, open issues, report bugs, or suggest ideas. github.com/MiaAI-Lab/Slate
1
160
Mia retweeted
Congrats to the @MiniMax_AI team on the release of MiniMax M3, a long-context multimodal model for text, image, and video reasoning. πŸ™Œ Try it today with our free GPU-accelerated endpoint on build.nvidia.com. Details: nvda.ws/4v4BWhD
MiniMax M3, Open-Weight, Now On Hugging Face , with only ~428B parameters and ~23B activated parameters Weights: huggingface.co/MiniMaxAI/Min… MiniMax Sparse Attention: huggingface.co/papers/2606.1…
51
116
1,315
134,408
Jun 12
Building the things you couldn't find anyone else building has never been easier.
99
Jun 12
πŸ‘€
79
Jun 12
Monster. Can probably fit into 8x @NVIDIAAI DGX Sparks. Out of my reach, for now.
🌘 Kimi-K2.7-Code, our latest coding model, is now released and open-sourced! πŸ”· Improved coding & agent performance over K2.6: 21.8% on Kimi Code Bench v2, 11.0% on Program Bench, and 31.5% on MLS Bench Lite. πŸ”· Reasoning efficiency: Less overthinking, with 30% lower reasoning-token usage compared to K2.6. πŸ”· Long-horizon coding: Improved instruction following, higher end-to-end coding task success rates. ⚑️ 6x High-Speed Mode coming soon! πŸ”Œ Available today via Kimi API and Kimi Code. πŸ”— Kimi Code: kimi.com/code πŸ”— API: platform.moonshot.ai
1
188
Jun 12
Two concurrent sessions with DS4-Flash, getting more than 60 tok/s and insane prefill numbers. Running on 2x @NVIDIAAI DGX Sparks
7
2
52
4,420
Jun 12
Codex app on Windows running DeepSeek-v4-Flash through Codex Shim, running on 2x @NVIDIAAI DGX Sparks. @0xSero Works so well...
1
13
1,860