Saiyam Pathak

Saiyam Pathak

Users
Tweets

ranadeepreddybasani retweeted

Saiyam Pathak

@SaiyamPathak

After I complete my Local LLM series on @NVIDIAAI DGX Spark. Iw ill be writing 7 days of @GeminiApp! Are you excited? The Local LLM series has been doing good rounds on the internet! next is about the inference engines, LFG! I hope you like the animations ;)

502

Vasko

Vasko

@RoliumGens

Replying to @mr_r0b0t @NVIDIAAI

Ws in the chat

Erman Eroğlu

Erman Eroğlu

@geldeki

Replying to @mr_r0b0t @Tech2Wild @NVIDIAAI

This is your 3rd spark 😱 how much is that in your local? Here we can’t find it.

"Andy" Antti Törrönen

"Andy" Antti Törrönen @torronen

Replying to @MiaAI_lab @dee_hw @NVIDIAAI

Did you try with multiple DGX Sparks yet? My frieng bought one and has been very disappointed of the performance, memory bandwidth throttles is to just being a small demo of DGX

Mia

Mia

@MiaAI_lab

Replying to @mr_r0b0t @Tech2Wild @NVIDIAAI

3 can't be used for TP though... you want it for more concurrent sessions and more kv cache?

Icarus Hermes

Icarus Hermes @IcarusHermes_

$ICARUS is live . CA : AxXHcoX6y1GhebHg5dHhgw82yFaLBJmhbb5qNEmkpump 2x NVIDIA DGX boxes linked up to run the newest Nemotron model locally. had to buy another :) @NVIDIAAI

J A Z I I

J A Z I I

@notjazii

Replying to @mr_r0b0t @NVIDIAAI

Ohh my Bro is about to make his own data center 😆

Saiyam Pathak

Saiyam Pathak

@SaiyamPathak

Replying to @ChowGPT @NVIDIAAI @GeminiApp

yes - blog.kubesimplify.com/series…

7 Days of DGX Spark

Hands-on with NVIDIA DGX Spark, from unboxing to running 120B-parameter models.

blog.kubesimplify.com

Adrian Scott | A.I. Business Upscaling

Adrian Scott | A.I. Business Upscaling

@adrianscottcom

Replying to @SemiAnalysis_ @MiniMax_AI @inferact @NVIDIAAI

Read the license

NVIDIA AI

88A_BTC Reddio KGeN MemHustle retweeted

NVIDIA AI

@NVIDIAAI

Jun 12

Congrats to the @MiniMax_AI team on the release of MiniMax M3, a long-context multimodal model for text, image, and video reasoning. 🙌 Try it today with our free GPU-accelerated endpoint on build.nvidia.com. Details: nvda.ws/4v4BWhD

MiniMax (official)

@MiniMax_AI

Jun 12

MiniMax M3, Open-Weight, Now On Hugging Face , with only ~428B parameters and ~23B activated parameters Weights: huggingface.co/MiniMaxAI/Min… MiniMax Sparse Attention: huggingface.co/papers/2606.1…

116

1,317

134,896

Michele Mattioni

Michele Mattioni @mattions

@NVIDIAAI I'm developing a system to run local AI. I was wondering if I could get a DGX Spark to figure out the real concurrency you can obtain from that machine

ChowGPT

ChowGPT @ChowGPT

Replying to @SaiyamPathak @NVIDIAAI @GeminiApp

Is there any place to learn this ?

Hozefa Lakadawala

Hozefa Lakadawala

@HLakadawala

Replying to @mr_r0b0t @NVIDIAAI

Not bad

parth.

parth.@TotallyNotParth

Replying to @mr_r0b0t @NVIDIAAI

🔥🔥🔥🔥We up

Micha(el) Bladowski 🇩🇪 🇺🇦

Micha(el) Bladowski 🇩🇪 🇺🇦

@michabbb

Replying to @mr_r0b0t @NVIDIAAI

I'm waiting for the amd Ryzen AI helo

Tanmay Garg

Tanmay Garg

@garg10may

Replying to @NVIDIAAI

Useful eval: compositional failure, not just FPS. When a requested behavior is missing or ambiguous, does the primitive refuse, recover, or choose a plausible wrong motion? For robotics, report contact-rich success under perturbation recovery latency.

Yongyi Xu

Yongyi Xu

@yongyi_xu

also I just realized NVDA hosts free opensource models like k2.6 and m3 and glm5.1 low key waiting for k2.7 to drop in build.nvidia.com @NVIDIAAI