vpj

vpj

10 Photos and videos

Tweets

hehehehe retweeted

vpj

@vpj

9 Aug 2025

Wrote an annotated Triton implementation of Flash Attention 2. (Links in reply) This is based on the flash attention implementation by the Triton team. Changed it to support GQA and cleaned up a little bit. Check it out to read the code for forward and backward passes along with the math and derivations. Hope this helps understand transformer attention and flash attention better. There's about 60 more annotated deep learning paper implementations on this website.

ALT The annotated code. Click on math symbols or identifiers to highlight them.

4,359

hehehehe

hehehehe @luck_not_shit

21 Jun 2025

Letting co-pilot comment on my pull request and then replying to those comments and resolving them makes me feel like a Schizophreniac . But honestly, some of the suggestions are legit useful, so I’m just gonna keep doing that.

Janet A. Carr

hehehehe retweeted

Janet A. Carr @janetacarr

6 Jun 2025

docker is supposed to solve the "works on my machine" problem but often I find that it just adds another layer to the "works on my machine" problem esp if you use a mac

178

3,190

184,561

hehehehe

hehehehe @luck_not_shit

2 May 2025

I mean, it’s your app.

Elon Musk

@elonmusk

2 May 2025

Never deleting this app 😂

313

hehehehe

hehehehe @luck_not_shit

2 Mar 2025

Going to save this and reply to chat gpt posts on LinkedIn

charlota

@0xCharlota

1 Mar 2025

me, every time i try to read a crypto project's website:

HSVSphere

hehehehe retweeted

HSVSphere

@HSVSphere

25 Feb 2025

> I made <thing> from scratch in Python! >look inside >import <thing>

@levelsio

25 Feb 2025

Works! Made a WebSocket server from scratch in Python with Grok 3 Now need to make it work properly in the flight sim!

435

10,623

375,703

chris

hehehehe retweeted

chris

@chrislevan24

23 Dec 2024

this.

240

4,053

31,510

1,676,772

hehehehe

hehehehe @luck_not_shit

21 Dec 2024

I need to let a LLM "talk" to swift core data. Need a language both the DB and the LLM talks so the obvious solution is SQL. SQL won't work on a key value store though. I wonder how hard would it be to write a SQL like driver for core data. #DoingdumbShitTillImNotDumbAnymore

hehehehe

hehehehe @luck_not_shit

21 Dec 2024

Being dumb off to a great start. Apparently there's a thing called Predicates on core data. Like a poor man's SQL. Entire chain of thought wasted 👍

terminally onλine εngineer

hehehehe retweeted

terminally onλine εngineer

@tekbog

20 Dec 2024

CS grads on suicide watch

217

3,825

326,824

Deedy

hehehehe retweeted

Deedy

@deedydas

20 Dec 2024

OpenAI o3 is 2727 on Codeforces which is equivalent to the #175 best human competitive coder on the planet. This is an absolutely superhuman result for AI and technology at large.

225

666

6,493

2,246,394

Martin Bauer

hehehehe retweeted

Martin Bauer

@martinmbauer

20 Nov 2024

No it isn’t. That’s the whole point

Тsфdiиg

@tsoding

20 Nov 2024

Did you know that this weird elongated S is just a for-loop?

245

593

21,026

2,005,206

hehehehe

hehehehe @luck_not_shit

5 Nov 2024

How to remove .env from git

✧

hehehehe retweeted

✧@northstardoll

3 Nov 2024

126

33,807

254,440

3,004,624

hehehehe

hehehehe @luck_not_shit

5 Nov 2024

“Oh thanks. Didn’t notice that”

goosewin

@Goosewin

3 Nov 2024

code reviews do be like that sometimes

hehehehe

hehehehe @luck_not_shit

19 Oct 2024

Now Our visualization library Inspectus can visualize values related to tokens in LLM outputs. This demo shows some outputs from using entropyx (by @_xjdr) on Llama 3. Had fun making this. (jk I didn’t) 🔗👇

0:16

1,745

hehehehe

hehehehe @luck_not_shit

19 Oct 2024

Pretty easy to get started so give it a try!

120

hehehehe

hehehehe @luck_not_shit

19 Oct 2024

Pip package: pypi.org/project/inspectus/ Docs: labmlai.github.io/inspectus/ Github: github.com/labmlai/inspectus

inspectus

Analytics for LLMs

pypi.org

labml.ai @labmlai

9 Jun 2024

We’ve open-sourced our LLM attention visualization library. It generates interactive visualizations of attention matrices with just a few lines of Python code in notebooks. @luck_not_shit cleaned up and polished the existing code to make it open source.

0:23

112