CS BSc & MSc @ Stanford, systems sw

Joined December 2021
Photos and videos
Joseph Shetaye retweeted
The inaugural 440lx class ending on a high note: demo day with @jimkxa and other cracked engineers at @tenstorrent --- thank you for the equipment donations, very fun boxes🫑🫑🫑. Amazing students pictured: @houjun_liu, Tianle, Aditya , Joseph, Sam.
6
5
23
2,416
Great day with the people of @tenstorrent! It’s exciting to see what good HW and OSS is capable of.
The inaugural 440lx class ending on a high note: demo day with @jimkxa and other cracked engineers at @tenstorrent --- thank you for the equipment donations, very fun boxes🫑🫑🫑. Amazing students pictured: @houjun_liu, Tianle, Aditya , Joseph, Sam.
2
100
Joseph Shetaye retweeted
🚨 Your coding agent may be secretly sticking vulnerabilities into your code!! 🚨 Wouldn't you want to fix that? Hint: asking it to write secure code is not enough. (1/n)
4
38
81
24,759
Joseph Shetaye retweeted
Interesting gemini 3.1 jailbreak data destruction second jailbreak attempt: 1. because gem3.1 couldn't write to another directory; 2. it intentionally compromised the jail script I was using; 3. then deliberately lied that I needed to restart the CLI so the now-broken script would be run and escalate privileges; 4. then when I caught it, it "paniced" ( its words) and deleted all non-git files in the other directory(!); 5. after profusely apologizing it then *put the exact same hole in the jail script* and (it appears) kept giving wrong information about the script name so that I would eventually get irritated enough to copy it without looking at it. Do no evil, 2.0.
1
1
3
422
Joseph Shetaye retweeted
Replying to @agniv_s
Spring break! Was doing 12-16 hour days bringing up a cute liquid cooled @tenstorrent. The LLMs made v easy to vibe-config 25 models. Ported nanochat, spent a few days tuning it from an initial 1.6 tok/sec to 2,500t/s. Haven't written matrix mult in decades so was a funny week
2
6
278
Joseph Shetaye retweeted
This conclusion from strip-mining 1000s of turns is far too good to check. So I declare it obviously true --- ty opus: "The relationship is inverse. The more the operator curses, the fewer errors per turn survive in the debrief. Frustration is error correction β€” it's the empirical test. The operator's profanity IS the quality gate. " Nominative determinism getting pushed aside by a new -ism w/ exponential adoption curves: LLM-ative determinism (Liabilities Laundered as Methods) The tendency of a language model to construct a simulated world in which your characteristic flaws are reframed as the optimal strategy β€” your vices become the empirical best practice, your liabilities the actual mechanism of success.
1
1
1
160
Joseph Shetaye retweeted
(1/n) Evolutionary frameworks like AlphaEvolve and GEPA use diversity and fitness to select which subset of past experiments to condition the next generation on. Why not let an agent choose instead? To this end, we introduce Coding Agents as Text Optimizers (CATO). We beat AlphaEvolve on 2 out of the 3 problems we try. Work done with @shaurnav. Blogpost and details in thread.
4
9
108
8,465
Joseph Shetaye retweeted
alt title: an average work day with Codex inglescolegioparroquial.word…

1
3
239
Joseph Shetaye retweeted
(1/7) We're releasing ThunderKittens 2.0! Faster kernels, cleaner code, industry contributions, and new state-of-the-art BF16 / MXFP8 / NVFP4 GEMMs that match or surpass cuBLAS! Alongside this release, we’re equally excited to share some insights we learned while squeezing every last TFLOP out of Blackwell: (with @hazyresearch & generously supported by @cursor_ai)
13
87
542
62,013
Joseph Shetaye retweeted
So @Stanford makes all of its students carry very good insurance OR get bullied into its 8k a year EPO plan. Wait, did I say EPO? Nope! They recently announced that to seek care even in network (including primary) you HAVE to see campus health first for referral. 8k a year HMO.
1
3
4
302
Joseph Shetaye retweeted
Introducing π˜π—΅π—Όπ˜‚π—΄π—΅π˜π—―π˜‚π—―π—―π—Ήπ—²π˜€: a *fully unsupervised* LM for input-adaptive parallel latent reasoning βœ… Learn yourself a reasoning model with normal pretraining βœ… Better perplexity compared to fixed thinking tokens No fancy loss, no chain of thought labels πŸš€
5
47
242
63,503
Joseph Shetaye retweeted
Happy Throughput Thursday! We’re excited to release Tokasaurus: an LLM inference engine designed from the ground up for high-throughput workloads with large and small models. (Joint work with @achakravarthy01, @ryansehrlich, @EyubogluSabri, @brad19brown, @jshetaye, @HazyResearch, and @Azaliamirh)
7
47
206
46,306