Joined July 2021
139 Photos and videos
23 Sep 2025
BREAKING: Major League Baseball will use the Automated Ball-Strike Challenge System (ABS) during the entire 2026 season ABS CHALLENGE RULES: - Each team will get two challenges and can keep them if they're successful - Challenges can only be initiated by a pitcher, catcher, or batter, and the request must come right after the pitch - To signal a challenge, the pitcher, catcher, or batter will tap his hat or helmet to let the umpire know - No help from the dugout or other players on the field is allowed - In each extra inning, a team will be awarded a challenge if it has none remaining entering the inning The ABS Challenge System powered by T-Mobile 5G network uses cameras set up around the perimeter of the field to track the location of each pitch and a graphic on the scoreboard shows the result of the challenge
1
8
Picklebot 3000 pickle/acc retweeted
This isn’t a goal of ours because we have plenty of money in the bank but quite excited to see that @huggingface is profitable these days, with 220 team members and most of our platform being free (like model hosting) and open-source for the community! Especially noteworthy at a time when most AI startups wouldn’t survive a year or two without VC money. Great job team!
147
151
2,371
443,861
No one should go to these games. They’re playing against high schoolers!
In 1897, the Chicago Colts scored 36 runs. Pickles are record breaking yet again.
1
104
Picklebot 3000 pickle/acc retweeted
F U R R I E S
800
6,237
41,609
4,437,165
Out Out Out Out
31
Wonder how much the pickles all stars would lose to the actual pickles by @picklesafterdrk
27
Bagels bagels bagels…
29
We are so back
WE NEED YOUR HELP šŸ¤–āš”ļø TONIGHT the Pickles are playing at Walker Stadium and we have to pack the stands to cheer on Dillon as he fights for humanity! It's going to be ELECTRIC āš”ļø Get your tickets NOW at picklestickets.com
1
2
83
Picklebot 3000 pickle/acc retweeted
Tonight’s robot food special is Bender’s Fun On A Bun! There’s only 1 available for a lucky robot 🦾
1
1
15
3,566
Do you remember when you joined X? I do! #MyXAnniversary
47
Picklebot 3000 pickle/acc retweeted
Self recommending!!
My interview with Carl Shulman on the economy and national security after AGI. He explains: • why economists get AGI mostly wrong • how output might double in 3 months • how incomes could grow 100x or more • the major risks created by military pressure to move fast:
6
11
152
64,672
Picklebot 3000 pickle/acc retweeted
Right! Gonna really fall apart at Picklefest.
1
1
45
Picklebot 3000 pickle/acc retweeted
GPT series still hasn't shown any signs of saturation 😲
45
72
1,172
119,587
Picklebot 3000 pickle/acc retweeted
pickles after dark
2
4
37
5,896
Picklebot 3000 pickle/acc retweeted
šŸ“½ļø New 4 hour (lol) video lecture on YouTube: "Let’s reproduce GPT-2 (124M)" youtu.be/l8pRSuU81PU The video ended up so long because it is... comprehensive: we start with empty file and end up with a GPT-2 (124M) model: - first we build the GPT-2 network - then we optimize it to train very fast - then we set up the training run optimization and hyperparameters by referencing GPT-2 and GPT-3 papers - then we bring up model evaluation, and - then cross our fingers and go to sleep. In the morning we look through the results and enjoy amusing model generations. Our "overnight" run even gets very close to the GPT-3 (124M) model. This video builds on the Zero To Hero series and at times references previous videos. You could also see this video as building my nanoGPT repo, which by the end is about 90% similar. Github. The associated GitHub repo contains the full commit history so you can step through all of the code changes in the video, step by step. github.com/karpathy/build-na… Chapters. On a high level Section 1 is building up the network, a lot of this might be review. Section 2 is making the training fast. Section 3 is setting up the run. Section 4 is the results. In more detail: 00:00:00 intro: Let’s reproduce GPT-2 (124M) 00:03:39 exploring the GPT-2 (124M) OpenAI checkpoint 00:13:47 SECTION 1: implementing the GPT-2 nn.Module 00:28:08 loading the huggingface/GPT-2 parameters 00:31:00 implementing the forward pass to get logits 00:33:31 sampling init, prefix tokens, tokenization 00:37:02 sampling loop 00:41:47 sample, auto-detect the device 00:45:50 let’s train: data batches (B,T) → logits (B,T,C) 00:52:53 cross entropy loss 00:56:42 optimization loop: overfit a single batch 01:02:00 data loader lite 01:06:14 parameter sharing wte and lm_head 01:13:47 model initialization: std 0.02, residual init 01:22:18 SECTION 2: Let’s make it fast. GPUs, mixed precision, 1000ms 01:28:14 Tensor Cores, timing the code, TF32 precision, 333ms 01:39:38 float16, gradient scalers, bfloat16, 300ms 01:48:15 torch.compile, Python overhead, kernel fusion, 130ms 02:00:18 flash attention, 96ms 02:06:54 nice/ugly numbers. vocab size 50257 → 50304, 93ms 02:14:55 SECTION 3: hyperpamaters, AdamW, gradient clipping 02:21:06 learning rate scheduler: warmup cosine decay 02:26:21 batch size schedule, weight decay, FusedAdamW, 90ms 02:34:09 gradient accumulation 02:46:52 distributed data parallel (DDP) 03:10:21 datasets used in GPT-2, GPT-3, FineWeb (EDU) 03:23:10 validation data split, validation loss, sampling revive 03:28:23 evaluation: HellaSwag, starting the run 03:43:05 SECTION 4: results in the morning! GPT-2, GPT-3 repro 03:56:21 shoutout to llm.c, equivalent but faster code in raw C/CUDA 03:59:39 summary, phew, build-nanogpt github repo
412
2,170
15,355
1,526,188
Picklebot 3000 pickle/acc retweeted
Why? That's all I wanted to say
2
1
19
5,199
This was against real life, non high school athletes! Imagine dragons
they couldn't even score one run?
1
84
Picklebot 3000 pickle/acc retweeted
Please don’t cry log
1
1
1
58
.@Jakooboo Jake!! How are you
22