Filter
Exclude
Time range
-
Near
ranadeepreddybasani retweeted
After I complete my Local LLM series on @NVIDIAAI DGX Spark. Iw ill be writing 7 days of @GeminiApp! Are you excited? The Local LLM series has been doing good rounds on the internet! next is about the inference engines, LFG! I hope you like the animations ;)
1
3
17
502
Replying to @mr_r0b0t @NVIDIAAI
Ws in the chat
2
This is your 3rd spark 😱 how much is that in your local? Here we can’t find it.
4
Did you try with multiple DGX Sparks yet? My frieng bought one and has been very disappointed of the performance, memory bandwidth throttles is to just being a small demo of DGX
5
3 can't be used for TP though... you want it for more concurrent sessions and more kv cache?
7
$ICARUS is live . CA : AxXHcoX6y1GhebHg5dHhgw82yFaLBJmhbb5qNEmkpump 2x NVIDIA DGX boxes linked up to run the newest Nemotron model locally. had to buy another :) @NVIDIAAI
25
Replying to @mr_r0b0t @NVIDIAAI
Ohh my Bro is about to make his own data center 😆
16
88A_BTC Reddio KGeN MemHustle retweeted
Congrats to the @MiniMax_AI team on the release of MiniMax M3, a long-context multimodal model for text, image, and video reasoning. 🙌 Try it today with our free GPU-accelerated endpoint on build.nvidia.com. Details: nvda.ws/4v4BWhD
MiniMax M3, Open-Weight, Now On Hugging Face , with only ~428B parameters and ~23B activated parameters Weights: huggingface.co/MiniMaxAI/Min… MiniMax Sparse Attention: huggingface.co/papers/2606.1…
51
116
1,317
134,896
@NVIDIAAI I'm developing a system to run local AI. I was wondering if I could get a DGX Spark to figure out the real concurrency you can obtain from that machine
19
Is there any place to learn this ?
1
20
Replying to @mr_r0b0t @NVIDIAAI
🔥🔥🔥🔥We up
6
Replying to @mr_r0b0t @NVIDIAAI
I'm waiting for the amd Ryzen AI helo
8
Replying to @NVIDIAAI
Useful eval: compositional failure, not just FPS. When a requested behavior is missing or ambiguous, does the primitive refuse, recover, or choose a plausible wrong motion? For robotics, report contact-rich success under perturbation recovery latency.
6
also I just realized NVDA hosts free opensource models like k2.6 and m3 and glm5.1 low key waiting for k2.7 to drop in build.nvidia.com @NVIDIAAI
11