Mia

Mia

30 Photos and videos

Tweets

Mia

@MiaAI_lab

14h

DeepSeek-v4-Flash beats Step-3.7-Flash in head-to-head tool calling benchmark. Full results in: github.com/MiaAI-Lab/DeepSee…

2,302

Mia

Mia

@MiaAI_lab

15h

Local agentic 'Tool-Call Benchmark' between DeepSeek-v4-Flash to Step-3.7-Flash. Same host, same 69 scenarios, two models. Results: DeepSeek-v4-Flash: 90/100 quality, 59 passed, 6 partial, 4 failed Step-3.7-Flash: 87/100 quality, 55 passed, 10 partial, 4 failed 👇

1,312

more replies

Mia

Mia

@MiaAI_lab

15h

Bottom line: DeepSeek-V4-Flash wins overall (90/100 vs 87/100) because it’s more reliable across long chains and structured outputs. Step-3.7-Flash is competitive and actually safer/more disciplined in a few specific scenarios, but it drops more partials and struggles more with multi-turn execution.

160

Mia

Mia

@MiaAI_lab

14h

Full results here: github.com/MiaAI-Lab/DeepSee…

GitHub - MiaAI-Lab/DeepSeek-v4-Flash-vs-Step-3.7-Flash-Tool-Call-Benchmark: Head-to-head comparison...

Head-to-head comparison of DeepSeek-V4-Flash vs Step-3.7-Flash on tool-eval-bench v2.0.6 (69 scenarios). Full results, summary, and analysis. - MiaAI-Lab/DeepSeek-v4-Flash-vs-Step-3.7-Flash-Tool-Ca...

github.com

118

Mia

Mia

@MiaAI_lab

16h

Running agentic coding benchmarks on DeepSeek-v4-Flash and Step-3.7-Flash. Will post results soon.

1,707

Mia

Mia

@MiaAI_lab

15h

Full results here: x.com/MiaAI_lab/status/20658…

Mia

@MiaAI_lab

15h

126

Mia

Mia

@MiaAI_lab

16h

RepoPrompt for Windows Open any project folder → select exactly which files matter → generate clean, LLM-optimized XML output. 📁 Open any project folder ✅ Select exactly which files matter 💰 Set your token budget 📋 Generate clean, LLM-optimized XML output 🪟 Built for Windows 🔒 Local & private 📦 Free Try it out here: github.com/MiaAI-Lab/repopro…

GitHub - MiaAI-Lab/repoprompt-windows: RepoPrompt for Windows — Context Engineering Tool for AI...

RepoPrompt for Windows — Context Engineering Tool for AI Coding Agents. Windows port of repoprompt/repoprompt-ce. - MiaAI-Lab/repoprompt-windows

github.com

Mia

Mia

@MiaAI_lab

21h

Diffusion Gemma is 4x faster, but makes 6x more mistakes.

0:32

140

Mia

Mia

@MiaAI_lab

Jun 12

I just published Slate — a fast, light-weight OLED-friendly Markdown/text editor. It supports editing all types of text-based files. One thing I really wanted: a proper OLED-friendly editor. Not “dark gray” — complete black, so it looks great on OLED displays and feels easy on the eyes at night. Fully developed by local AI. Currently Windows only. Feel free to fork and build for Mac/Linux. Feel free to test it, open issues, report bugs, or suggest ideas. github.com/MiaAI-Lab/Slate

GitHub - MiaAI-Lab/Slate: A fast, light-weight OLED-friendly MD/text editor.

A fast, light-weight OLED-friendly MD/text editor. - MiaAI-Lab/Slate

github.com

160

NVIDIA AI

Mia retweeted

NVIDIA AI

@NVIDIAAI

Jun 12

Congrats to the @MiniMax_AI team on the release of MiniMax M3, a long-context multimodal model for text, image, and video reasoning. 🙌 Try it today with our free GPU-accelerated endpoint on build.nvidia.com. Details: nvda.ws/4v4BWhD

MiniMax (official)

@MiniMax_AI

Jun 12

MiniMax M3, Open-Weight, Now On Hugging Face , with only ~428B parameters and ~23B activated parameters Weights: huggingface.co/MiniMaxAI/Min… MiniMax Sparse Attention: huggingface.co/papers/2606.1…

116

1,315

134,408

Mia

Mia

@MiaAI_lab

Jun 12

Building the things you couldn't find anyone else building has never been easier.

Mia

Mia

@MiaAI_lab

Jun 12

👀

Mia

Mia

@MiaAI_lab

Jun 12

Monster. Can probably fit into 8x @NVIDIAAI DGX Sparks. Out of my reach, for now.

Kimi.ai

@Kimi_Moonshot

Jun 12

🌘 Kimi-K2.7-Code, our latest coding model, is now released and open-sourced! 🔷 Improved coding & agent performance over K2.6: 21.8% on Kimi Code Bench v2, 11.0% on Program Bench, and 31.5% on MLS Bench Lite. 🔷 Reasoning efficiency: Less overthinking, with 30% lower reasoning-token usage compared to K2.6. 🔷 Long-horizon coding: Improved instruction following, higher end-to-end coding task success rates. ⚡️ 6x High-Speed Mode coming soon! 🔌 Available today via Kimi API and Kimi Code. 🔗 Kimi Code: kimi.com/code 🔗 API: platform.moonshot.ai

188

Mia

Mia

@MiaAI_lab

Jun 12

Two concurrent sessions with DS4-Flash, getting more than 60 tok/s and insane prefill numbers. Running on 2x @NVIDIAAI DGX Sparks

4,420

Mia

Mia

@MiaAI_lab

Jun 12

Codex app on Windows running DeepSeek-v4-Flash through Codex Shim, running on 2x @NVIDIAAI DGX Sparks. @0xSero Works so well...

1,860