Account not used anymore. Find me on bsky.app/profile/philmod.bsk…

Joined January 2011
245 Photos and videos
Pinned Tweet
How Kaggle load-balances a gRPC application across multiple GKE clusters. buff.ly/2XHiMBG

1
34
88
New compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss. research.google/blog/turboqu…
1
58
"LLMs optimize for plausibility over correctness. In this case, plausible is about 20,000 times slower than correct." x.com/KatanaLarp/status/2029…

1
1
1,055
Is AI singularity here already? x.com/karpathy/status/203037…

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. github.com/karpathy/autorese… Part code, part sci-fi, and a pinch of psychosis :)
2
157
The venue (@lotto_arena) decided to change my @ElectricCallboy concert ticket from the pit to the seats, unilaterally. Really not cool! What's the point of booking my ticket year in advance if I can't enjoy the show the way I'd prefer?
122
I'm leaving Twitter, find me on bluesky: bsky.app/profile/philmod.bsk…

1
113
Why Chatbots Are Not the Future buff.ly/3VlzD9b

127
Le titre de la webpage de @Keytradebank 💩🤣 keytradebank.be/fr/notre-blo…
170
ELI5: Flash Attention buff.ly/3NCQq4A
127
A Visual Guide to Quantization buff.ly/4fsrR78
144
TIL: Python's tarfile is slower than the tar CLI because it uses a compression level of 9 vs 6 for the CLI.
1
1
307
An Open-Ended Embodied Agent with Large Language Models buff.ly/3BWIDsk

163
Interesting write-up about a path to AGI and Superintelligence, and the risks associated buff.ly/4ea1UIE

1
158