Mia

Mia

Users
Tweets

Mia

@MiaAI_lab

12m

Replying to @AlicanKiraz0

Is this with MTP?

Barbara Drnac

Barbara Drnac

@paradaplesa

14m

Vabim na ogled danes ob 18.45 na TV3:) ples ob drogu in izvrsten trio MTP;) paradaplesa.si/tristosestdes…

デストロヤ

デストロヤ

@destroyerOlibs

15m

This is why im putting all my flecktarn in a bag for disposal. DPM and MTP are cooler anyway

Ashley Kitsune🔆🏳️‍⚧️🇺🇦@val_kitsune

Jun 12

Local weirdo gets her jacket

Manas Muduli

Rum Boi retweeted

Manas Muduli

@manas_muduli

23h

A massive industrial project is taking shape in Gopalpur, Odisha. > Investment - Rs 2,675 crore > Capacity - 376,000 MTPA > Project status - Final stages of completion and commissioning > Product - Technical Ammonium Nitrate (TAN) > Investor - Smartchem Technologies Limited (STL), a 100% subsidiary of Deepak Fertilizers and Petrochemicals Corporation Limited (DFPCL)

452

10,069

ママファイ🐎

ママファイ🐎@mummify77

22m

阪神 11R 宝塚記念私の夢はmtp決着 ◎ダノンデサイル ◯レガレイラ 3連単2頭軸マルチ相手もクロワとタバルのみ JCから1着固定のダノンデサイル本命荒れなくていい

596

jello

Val Keynes retweeted

jello @KatSanrio38291

#albertjamesmoriarty #moriartythepatriot #mtp #yuumori i want more info on his mental state🙏

288

Польський карантин

Польський карантин @Gun_tello

26m

Replying to @sakurayukiai @LottoLabs

I’m running Qwen3.6-27b-MTP-UD-Q4-K_M with 96k context on my 3090 without KV cache quantisation. Non-MTP gets 128k context. KV at 8 bit gets more…

Han Xiao

keithofaptos retweeted

Han Xiao

@hxiao

Jun 12

Model using Qwen3.6-35b-A3b-Q3_K_XL-MTP from @UnslothAI which i found the best in both quality & speed on low-budget gpu like L4 24gb, which is also cheap to scale out. Source: github.com/hanxiao/knowledge… Demo: hanxiao.io/knowledge-graph/

GitHub - hanxiao/knowledge-graph-extractor: Turn any document or a whole zip into an interactive...

Turn any document or a whole zip into an interactive knowledge graph, using a self-hosted Qwen3.6-35B-A3B-MTP on a single NVIDIA L4 - hanxiao/knowledge-graph-extractor

github.com

2,407

Zhang Hao_Outfit

𝄞 retweeted

Zhang Hao_Outfit @ZhangHao_Outfit

260613 📌CASIO ⌚|MTP-M305D-1AV ©Chapters_Hao725 #ZHANGHAO #장하오 #章昊 #ジャンハオ #AND2BLE #앤더블

1,712

Pino

Pino @pinocookies

33m

Unsloth just showed Gemma 4 hitting 162t/s on 12B with MTP on 6gb RAM That number is absurd to see, but the real shift that it is showing is what happens when inference outpaces human reading speed. And we’re about already there for single consumer GPU.

Unsloth AI

@UnslothAI

Jun 11

Gemma 4 now runs 2x faster with MTP GGUFs! Run locally on just 6GB RAM. ⚡️ MTP enables Google Gemma 4 run ~1.4–2.2× faster with no accuracy loss. Gemma 4 12B MTP can run at 162 t/s vs. 52 t/s without MTP. 31B reaches 101 t/s. GGUFs Guide: unsloth.ai/docs/models/mtp

Han Xiao

Han Xiao

@hxiao

44m

Replying to @ApplyWiseAi

for mac/5090, 100 tps is already a very very optimistic for a 27b dense/35b-a3b with Q3/MTP/some kv-cache tricks and would only keep this high with <16K context. parallel execution or multi-slot is much much slower.

shinelyy.⚝

neyonta retweeted

shinelyy.⚝@s_shinelyy

Mar 22

#ReadAWrite #วิลเชอร์ #ไมอัล #willsher #myal #mtp ขอฝากฟิคเรื่องนี้ไว้ในใจทุกคนด้วยนะคะ👉🏻👈🏻🩷🩷 (อัปทุกวันเสาร์ค่ะ) #วิชาที่ชอบเธอที่ใช่ | วิลเชอร์ readawrite.com/a/d3555050dbd…

104

2,005

Thomas Xia

Thomas Xia @ThomasXia7ujq

Replying to @henghaer123

做infra吧，国内厂商都得买华为卡，华为卡下限低但是极致优化以后或许性能还算OK，但是太难优化了所以对手都不优化，自己针对自己的旗舰模型优化好了成本就能比对手低的多，那自己实质上就成了唯一一个能卖高性价比token的厂商，然后minimax的思路是把关键的MTP什么的全部藏起来

Asahi

Asahi @Asahi_PC_OTAKU

Replying to @accosan_

インテルMTPまで行かんからななぁ… 最大でも80W前後だから_(:3 」∠)_

Alok

priya joseph retweeted

Alok

@analogalok

Jun 12

unsloth/gemma-4-E2B-it-qat-GGUF at main mtp draft model available in the same repo huggingface.co/unsloth/gemma…

unsloth/gemma-4-E2B-it-qat-GGUF · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

313

Alok

priya joseph retweeted

Alok

@analogalok

Jun 12

You don't even need a laptop to learn local llms! I just ran Unsloth Gemma 4 E2B QAT Multi Token Prediction (MTP) - 12 tokens/sec on a 6 years old phone's cpu with llama.cpp and termux! Unsloth just dropped MTP draft assistant GGUFs for every Gemma 4 model. naturally I yolo'd it straight onto Android to see what happens. not the 2 bit quant. UD-Q4_K_XL. works on any phone with ≥8 GB RAM. # Device: Note 20 Ultra (6 years old) -without MTP -> 7-9 tok/s -with MTP -> 9-12 tok/s ~20-30% faster on a phone. free speedup. I'll take it. # copy the command: LD_LIBRARY_PATH=. ./llama-server \ -m ~/storage/shared/llm/gemma-4-E2B-it-qat-UD-Q4_K_XL.gguf \ --spec-type draft-mtp \ --spec-draft-model ~/storage/shared/llm/mtp-gemma-4-E2B-it.gguf \ --spec-draft-n-max 4 \ --spec-draft-p-min 0.6 \ -c 4096 -t 4 --port 8080 --no-mmap -v beginner friendly Termux guide to run ggufs with llama.cpp on android HuggingFace model link in the comments. no excuses.

0:20

Unsloth AI

@UnslothAI

Jun 11

147

23,582

Sim⁷ 🍓🍷

rumi🎀⁷ needs help🙏🏽 retweeted

Sim⁷ 🍓🍷@kvdotes

20h

i am studying the Medical Termination of Pregnancy (MTP) Act, 1971 right now and i hope NO ONE TOUCHES OR AMENDS THIS LAW EVER

915

26,032

Nimish Malde

Nimish Malde @MaldeNimish

Replying to @NagakrishnaCho1 @MTPHereToHelp @CPMumbaiPolice @myBESTBus @mybmc @mumbaimatterz @public_pulseIN @RoadsOfMumbai @ckdadar @mid_waytimes @Mumbaikhabar9 @mumbaivoice9 @mjdoshi

The educate class are responsible who drove high end cars but don't have common sense and the MTP, BEST, BMC who doesn't act and stop this problem

Williamliao

Williamliao @MagicWilliam

3/3 VRAM & Verdict In tight VRAM (e.g. 16GB) or long context, EAGLE-3's speculator cost can choke performance. If acceptance rate drops, it's slower than raw inference. MTP: Reliable daily driver. EAGLE-3: Specialized nitro-boost for structured outputs.