Old school industrialist

Joined December 2009
601 Photos and videos
Pinned Tweet
20 Feb 2024
Self-Discover Prompting Implementation: colab.research.google.com/dr… Currently seeing a 86.6% on a random subset of BBH, which puts me in line with the results seen by Deep Mind. @lateinteraction I tried out many of the functions built in to DSPy. If you think it makes sense I can open a pull request to add this to the repo as an example, might be helpful to new people.
7
25
103
12,804
Chris retweeted
I think Gwynne Shotwell should have had much more ownership in the company than she does. She owns about 0.1% of the company, even though she's been with SpaceX since Sep 2002. That's 24 years. SpaceX was founded on March 14 2002, and Shotwell joined in Sep 2002, only 6 months after the founding of the company. Yes, Elon put a lot of money (all of it) into the company at the beginning, and yes, she probably didn't negotiate much equity at the beginning, but she's been a force and a material contributor to SpaceX's success. Would've loved to see her ownership worth a minimum of $10B.
14
2
113
32,087
Chris retweeted
fable 5 down for 12 hours and ur depressed u cant vibe code ur 50th todo list app with it anymore? fear not - @OpenRouter fusion is here we combined a panel of models and came within 1% of fable 5's perf at half the cost 👉 simply "model": "openrouter/fusion"
61
31
721
69,403
Chris retweeted
Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇
475
1,227
10,759
3,169,275
We're going to deorbit the ISS in 2030 and burn 30 years of human history in the atmosphere. We boost it to a higher orbit and make it a museum instead. Future generations will want to see it.
37
Chris retweeted
The whole Transformer fits in 23% of the LUTs and a single Block RAM (activations KV cache live there) But it pins 62 of 64 DSPs at 96%. The multipliers are the wall. This is the actual place-and-route on the Virtex-5
3
3
89
9,304
Chris retweeted
56,000 tokens/sec at just 80 MHz. 🤯 I burned a full Transformer with KV cache into a custom chip. Designed gate by gate as a 100% digital integrated circuit. Prototyped on a FPGA. (No GPU. No CPU) Just pure digital silicon running @karpathy microGPT, spelling out names on a tiny LCD. This is GateGPT 👇
113
297
2,745
212,656
Chris retweeted
NEW: Amazon researchers are reportedly behind the jailbreak report that led to the U.S. crackdown on Anthropic’s top models.
140
1,465
20,250
849,537
Chris retweeted
MiniMax M3, Open-Weight, Now On Hugging Face , with only ~428B parameters and ~23B activated parameters Weights: huggingface.co/MiniMaxAI/Min… MiniMax Sparse Attention: huggingface.co/papers/2606.1…
Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas - MiniMax Sparse Attention scales context to 1M - Natively Multimodal from Step Zero API: platform.minimax.io Token Plan: platform.minimax.io/subscrib… 🚀New! MiniMax Code: code.minimax.io Weights & Tech Report in ~10 Days
114
328
2,764
633,576
Chris retweeted
Jun 13
Anthropic
114
1,307
14,212
683,136
Chris retweeted
Assuming Anthropic is able to restore Fable in the next few days, there's literally zero point doing any meaningful work until it is back. What can be done in 100 hours with Opus can be done in 1 with Fable. Hopefully this is figured out quickly.
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
650
201
5,094
1,027,908
Chris retweeted
1
4
43
2,133
Chris retweeted
According to Grok, Andrej Karpathy is an EB-1 extraordinary ability green card recipient, not a US citizen. Thus under these new restrictions he is not permitted to use, or work on, Mythos 5 or Fable 5 as of 5:21pm tonight.
Replying to @AndrewCurran_
From the statement: 'The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, 𝘪𝘯𝘤𝘭𝘶𝘥𝘪𝘯𝘨 𝘧𝘰𝘳𝘦𝘪𝘨𝘯 𝘯𝘢𝘵𝘪𝘰𝘯𝘢𝘭 𝘈𝘯𝘵𝘩𝘳𝘰𝘱𝘪𝘤 𝘦𝘮𝘱𝘭𝘰𝘺𝘦𝘦𝘴. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance.'
175
537
7,507
806,352
Chris retweeted
does this mean GPT-5.5 might be disabled too?
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
46
2
327
70,053
Chris retweeted
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
12,235
25,451
86,392
85,400,680
Chris retweeted
Do agents need skills? Here's 7,950 skills & workflows from GitHub mapped by what they actually do (out of 175,632 that I indexed). I ran evals on every skill: does it make the model's output better, or worse? Here's some findings👇
6
9
78
12,707
Chris retweeted
我不想大惊小怪,但这真的是claude Fable 5 独立自己完成的. 同屏6万个单位,完全在浏览器运行,运行在macbook m1 pro上. 他不仅做了极致的性能优化,还独立制作了模型,动画,粒子特效. 哎我还能说什么呢,未来已至
27
38
470
96,554
Chris retweeted
Meet Higgsfield Games. For the first time, build and deploy multiplayer games from one prompt, in any genre, 2D or 3D, with best-in-class characters, props, and settings generated by Higgsfield MCP. Powered by Claude Fable 5. Try on Claude via MCP and on our Supercomputer.
248
289
2,089
861,915
Chris retweeted
Jeff Bezos talking to the NYT about his startup Prometheus: 'All societal wealth is driven by invention. Six thousand years ago, somebody invented the plow, and we all got wealthier. Then, much later, somebody invented the steam engine, and we all got wealthier. What Prometheus seeks to do, is to offer a set of tools that dramatically accelerates that invention loop.'
14
43
457
26,453
Chris retweeted
Have you debugged your training data? You might not like what you find. Introducing predictive data debugging: reveal and shape what your model will learn before training. In DPO datasets, we found broken guardrails, hallucinations, and fish fart fan fiction (seriously). (1/9)
26
107
878
170,287