Joined February 2008
429 Photos and videos
Nat Friedman retweeted
Future shock, from here out.
31
41
531
168,025
🄳
Excited to announce that @ManusAI has joined Meta to help us build amazing AI products! The Manus team in Singapore are world class at exploring the capability overhang of today’s models to scaffold powerful agents. Looking forward to working with you, @Red_Xiao_!
34
14
743
213,919
Started work at Meta this week. My job is to make amazing AI products that billions of people love to use. It won't happen overnight, but a few days in, I'm feeling confident that great things are ahead.
330
120
6,067
912,556
Civilization is constant maintenance
55
84
914
149,719
USA: passing 1000 state bills to slow down AI China: passing american AI on leaderboards
DeepSeek’s R1 leaps over xAI, Meta and Anthropic to be tied as the world’s #2 AI Lab and the undisputed open-weights leader DeepSeek R1 0528 has jumped from 60 to 68 in the Artificial Analysis Intelligence Index, our index of 7 leading evaluations that we run independently across all leading models. That’s the same magnitude of increase as the difference between OpenAI’s o1 and o3 (62 to 70). This positions DeepSeek R1 as higher intelligence than xAI’s Grok 3 mini (high), NVIDIA’s Llama Nemotron Ultra, Meta’s Llama 4 Maverick, Alibaba’s Qwen 3 253 and equal to Google’s Gemini 2.5 Pro. Breakdown of the model’s improvement: 🧠 Intelligence increases across the board: Biggest jumps seen in AIME 2024 (Competition Math, 21 points), LiveCodeBench (Code generation, 15 points), GPQA Diamond (Scientific Reasoning, 10 points) and Humanity’s Last Exam (Reasoning & Knowledge, 6 points) šŸ Ā No change to architecture: R1-0528 is a post-training update with no change to the V3/R1 architecture - it remains a large 671B model with 37B active parameters šŸ§‘ā€šŸ’»Ā Significant leap in coding skills: R1 is now matching Gemini 2.5 Pro in the Artificial Analysis Coding Index and is behind only o4-mini (high) and o3 šŸ—ÆļøĀ Increased token usage: R1-0528 used 99 million tokens to complete the evals in Artificial Analysis Intelligence Index, 40% more than the original R1’s 71 million tokens - ie. the new R1 thinks for longer than the original R1. This is still not the highest token usage number we have seen: Gemini 2.5 Pro is using 30% more tokens than R1-0528 Takeaways for AI: šŸ‘Ā The gap between open and closed models is smaller than ever: open weights models have continued to maintain intelligence gains in-line with proprietary models. DeepSeek’s R1 release in January was the first time an open-weights model achieved the #2 position and DeepSeek’s R1 update today brings it back to the same position šŸ‡ØšŸ‡³Ā China remains neck and neck with the US: models from China-based AI Labs have all but completely caught up to their US counterparts, this release continues the emerging trend. As of today, DeepSeek leads US based AI labs including Anthropic and Meta in Artificial Analysis Intelligence Index šŸ”„Ā Improvements driven by reinforcement learning: DeepSeek has shown substantial intelligence improvements with the same architecture and pre-train as their original DeepSeek R1 release. This highlights the continually increasing importance of post-training, particularly for reasoning models trained with reinforcement learning (RL) techniques. OpenAI disclosed a 10x scaling of RL compute between o1 and o3 - DeepSeek have just demonstrated that so far, they can keep up with OpenAI’s RL compute scaling. Scaling RL demands less compute than scaling pre-training and offers an efficient way of achieving intelligence gains, supporting AI Labs with fewer GPUs See further analysis below šŸ‘‡
50
112
946
217,454
Anarchotyranny, but at the nation state level
3
8
139
60,584
We found the title of a scroll for the first time! This cylinder of charcoal turns out to be "On Vices, Book 1" by Philodemus
157
578
7,426
663,298

6
6
308
63,285
Amazingly, this looks like it might be solved, thanks to a lot of help from some wonderful people on X. šŸ¤ž Thank you!
Anyone have a mexico government connection? A project I’m supporting to LIDAR the jungle there just got rejected by some bureaucrat. Please DM if you can help. Thanks!
16
8
540
88,966
Please help us find smart people to work full time on the scrolls!
We’re looking for exceptional people to join our mission to read the Herculaneum Scrolls. Refer a successful hire — earn a $5,000 prize. We're hiring for the following roles: Geometry & Computer Vision Applied Researchers Platform Engineers Synchrotron Tomography Reconstruction Expert If you know someone brilliant who can help recover the ancient scrolls of Herculaneum, your referral could change history! View Open Roles - scrollprize.org/jobs Check the instructions - (scrollprize.org/prizes#refer… Send referals to : jobs@scrollprize.org
10
45
304
98,880
We must find the Diliad
29 Apr 2025
There is a lost Ancient Greek parody of the Illiad called the "Diliad"
30
60
1,085
94,092
Hadn't realized how close the ancient Greeks got to calculus. en.wikipedia.org/wiki/Method…
33
20
516
53,741
Scrolls
26
24
512
45,799
Is anyone genetically engineering unicorns? It seems like that company would do numbers
54
15
514
102,207
Anyone have a mexico government connection? A project I’m supporting to LIDAR the jungle there just got rejected by some bureaucrat. Please DM if you can help. Thanks!
101
55
1,865
421,280
Let's build ships in California!
1/ @CAForever is answering the call to propel American shipbuilding for the next century. With today’s @POTUS Executive Order and the bipartisan SHIPS Act, we’re offering 3 miles of our waterfront to build the Solano Shipyard, the largest shipbuilding complex in America. 🧵
46
35
728
109,488
🄰
4 Apr 2025
Progress on NEO’s AI has been really fast of late. Here are some early clips of a generalist model we’re developing at @1x_tech. The following clips are 100% autonomous, running on a single set of neural network weights. First, a quiet little robot that picks up leaves and puts them in a bag.
9
2
117
34,559
Correct
OpenAI submitted their policy proposal to the US government this morning. They directly link fair use with national security, and said if China continues to have free access to data while 'American companies are left without fair use access, the race for Al is effectively over.'
19
10
343
57,121
This has been helpful
11 Mar 2025
Introducing a new way to use Granola šŸ”„ Our chat feature now works with any person or company - no more digging through old notes šŸ—‚ļø Use Granola to chat across multiple meetings and ask things like ā€œWhat feedback has Jim given about our product?ā€.
4
70
27,292
Exciting PlasticList update: We were honored to work with @bobaguys to identify and eliminate the sources of BPA contamination, and their teas are now BPA-free! They have fully transitioned to BPA-free receipt paper, which PlasticList confirmed to be BPA-free through independent lab testing. They have also switched to brown sugar in BPA-free packaging. We have been impressed with their commitment to get to the bottom of the issue and move fast to remove BPA from their supply chain. If you want healthy and delicious tea, I highly recommend Boba Guys!
Advice for Food Companies Since we launched PlasticList, we’ve been heartened to have quite a few food companies reach out and ask for help interpreting their results and tracking down and eliminating their contamination. I’ve had calls with a bunch of these. I am happy to report that no food company wants this stuff in their food and they are all eager to figure out what’s going on and how to remove it. After a while I noticed the advice we were giving was pretty similar for every company, so I thought it would be useful to write it down and share publicly. So, here are some notes: 1. To track down the source of your contamination, don’t just test a few samples of your product with varied production processes. Instead, test every single one of your inputs: every ingredient and input in the form you receive it before any processing steps, including water and any other consumables. 2. Then, test the food before and after every step in your production process. If you boil something in tap water, test before and after boiling. If you chop something on a plastic cutting board (because wood cutting boards are outlawed in commercial kitchens, apparently), test before and after chopping. 3. You may have to go deep into your supply chain to figure out the source of your contamination. One food company founder we spoke to said that some of the fruit they include in their product is picked, put into plastic bags, and then steamed in the bags before the bags are cut open and the fruit is transferred into another plastic bag, while still warm, for shipping. Whoops. 4. Run at least three samples of every test due to sample-to-sample variation. You can see in our report and in our data that sample-to-sample and lot-to-lot variation should be expected: plasticlist.org/report 5. You should also test any intermediate or final packaging that your product ships in, as leaching can also occur post-production. 6. There are a lot of steps that you need to carefully follow to prevent contaminating your samples during collection and transportation. It’s really easy to miss one of these and mess up your data. We describe many of these on our methodology page: plasticlist.org/methodology 7. You should consider running longitudinal tests, maybe quarterly, as we have heard that there can be seasonal variation in contamination from suppliers, due to things like summer heat, suppliers switching their processes, and suppliers switching their own backend suppliers for their inputs. 8. And most importantly: PICK A GOOD LAB. Unfortunately not all labs are good, and we think many ISO-certified commercial labs will not give reliable results. We rejected many certified labs because we weren’t confident in their work; all-in-all, we spent about 10 weeks finding a lab that we trust for our tests. You can see our lab’s internal methodology here: docs.google.com/document/d/1… Our lab has recently permitted us to identify them publicly, and they are IEH: iehinc.com/ We also worked with Light Labs to produce this study and they can be a big help: lightlabs.com And Million Marker is able to work with food companies to debug their supply chains as well: millionmarker.com/ 9. You should consider hiring an analytical chemist as a consultant to validate that the testing methodology is accurate and to double-check the lab’s results. We hired John Brock to do this and it was well worth it; we would not have been confident in our choice of lab or our results without John. 10. We couldn’t find a lot of evidence that the phthalate substitutes are bad; if you have high-percentile detections in phthalates or bisphenols, though, it’s probably worth figuring out how those chemicals are getting into your products.
74
114
2,514
363,375