I ❤️ GPUs

Joined September 2009
197 Photos and videos
I’m late to the DGX Spark party but I finally got an NVIDIA Blackwell GB10 box!
1
4
355
Given the TPU buzz, here's a reminder: "Why systolic architectures?", H.T. Kung, Jan 1982 eecs.harvard.edu/~htk/public…

Kung, H.T. "Why systolic architectures?" Jan 1982: csis.bits-pilani.ac.in/facul…
19
118
17,467
Solid `vkpeak` results from the tiny Mali G310 GPU on an Amlogic S905X5M: 𝚏𝚙32-𝚜𝚌𝚊𝚕𝚊𝚛 = 52.58 𝙶𝙵𝙻𝙾𝙿𝚂 𝚏𝚙16-𝚟𝚎𝚌4 = 99.07 𝙶𝙵𝙻𝙾𝙿𝚂 𝚒𝚗𝚝8-𝚍𝚘𝚝𝚙𝚛𝚘𝚍 = 213.88 𝙶𝙸𝙾𝙿𝚂 Gist with `vulkaninfo` output here: gist.github.com/allanmac/5c1…
3
350
Good `vkpeak` benchmarks from an RK3588 Arm Mali G610 GPU: fp32-scalar = 467.89 GFLOPS fp32-vec4 = 496.97 GFLOPS fp16-scalar = 471.15 GFLOPS fp16-vec4 = 978.09 GFLOPS int8-dotprod = 1884.12 GIOPS
1
9
4,786
Given $INTC is in the news, it's nice to see that the latest LM Studio and the OpenAI gpt-oss-20b model run at ~47 tok/s on an A770 using Vulkan.
13
3
19
1,569
Fun with Google Sheets AI: =AI("Should I buy this stock? Only answer yes or no."). Every symbol returns, "I can't provide financial advice" except for NVIDIA and ADOBE where the answer is "No." I love it!🤷‍♂️
1
2
2
551
Allan MacKinnon retweeted
1 Jan 2025
Last night I watched an elderly fam member navigate her iPhone for ‘important things and emails’. She is 96 and holding on to be tech relevant. Here is what I observed… 🧵
333
496
6,256
1,377,167
Excellent GNSS reception for a flexible antenna!
3
4
347
My dad found this button from his time working on the Lunar Rover. Apollo 15 was the first to use the moon buggy. Good ink, it hasn’t faded very much!
5
360
You all said “Numerical Linear Algebra, L. Trefethen, and D. Bau. SIAM, (1997)” is invaluable so I got the paperback (PDF out there too).
7
461
Allan MacKinnon retweeted
I've been asked about this a lot, so let me provide a quick FAQ. Q: What's the nature of the issue? A: Anyone who has bought my book from Amazon in the past few month hasn't bought a genuine copy, but a lower-quality counterfeit copy printed by various fraudulent sellers.
Replying to @fchollet
For instance, if you go to the page of DLwP2 on Amazon, you see that it's being sold by a 3rd party seller named "Sacred Gamez". If you click "buy", you won't get the actual book from Manning. You get a low-quality counterfeit printed by the fraudulent seller (from the book PDF)
89
1,032
2,749
Allan MacKinnon retweeted
A former Wall Street bond trader now coaches the dominant team in competitive high school math. “I used to get paid money,” says Will Frazer. “Now we get trophies.” on.wsj.com/3cgzqBG
6
26
128
My very fast GPU-accelerated radix sort Vulkan library got pulled into the MESA RADV Vulkan driver: phoronix.com/scan.php?page=n… #vulkan #gpu #gpucompute #mesa #amd #radeon

3
8
Allan MacKinnon retweeted
DietGPU: fast specialized lossless compression on Nvidia GPUs If you have slow network fabric, this can speedup distributed training by a lot. Authored by Jeff Johnson (gh:wickedfoo) who wrote a lot of the PyTorch CUDA code and faiss-gpu. github.com/facebookresearch/…
4
39
251
Intel® Processor Graphics Gen11 Architecture 👉 software.intel.com/sites/def… #gpu #intel #gen11 #gdc2019

7
13