Joined January 2012
73 Photos and videos
Philip Monk retweeted
9 Dec 2025
.@essential_ai's rnj-1 model is now on Ollama! ollama run rnj-1 8B parameter, open-weight dense model trained from scratch. The model is optimized for code and STEM with capabilities on par with other state of the art open-weight models. Let's go! 🚀🚀🚀
9
33
246
31,291
6 Dec 2025
It's been a blast to lead the infrastructure effort to train this model. I'm excited to see it out in the world!
We are beyond thrilled to share our first flagship models, Rnj-1 base and instruct 8B parameter models. Rnj-1 is the culmination of 10 months of hard work by a phenomenal team, dedicated to advancing American SOTA OSS AI. Lots of wins with Rnj-1. 1. SWE bench performance close to GPT 4o. 2. Tool use outperforming all comparable open source models. 3. Mathematical reasoning (AIME’25) nearly at par with GPT OSS MoE 20B. ….
2
16
828
6 Dec 2025
It's open weights and a very convenient size to run locally, btw. I get 20 tok/s on an M3 mac with llama.cpp.
8
438
Philip Monk retweeted
Today, we’re excited to introduce Rnj-1, @essential_ai's first open model; a world-class 8B base instruct pair, built with scientific rigor, intentional design, and a belief that the advancement and equitable distribution of AI depend on building in the open. We bring American open-source at par with the best in the world.
35
153
1,049
607,132
24 Nov 2025
You all have it so easy today with your petaflop gpus. In my day we had *floppy disks* that could only handle a few hundred kiloflops/s
1
1
194
28 Oct 2025
It finally happened: I ran into a bug that rust would have caught
14 Aug 2025
The things that are hard about ml infra are not things that rust solves
4
286
Philip Monk retweeted
[1/2] We at Essential are driven by mission to advance fundamental research guided by first principles, rigor and sharing research openly.
1
10
30
5,481
14 Aug 2025
The things that are hard about ml infra are not things that rust solves
The fact that Python is the standard for machine learning is a serious indictment of the field’s engineering standards
3
8
1,342
14 Aug 2025
If you're a PL guy who engages seriously with the problem the answer you come to is Jax and/or torch.compile, not rust
1
6
323
14 Aug 2025
of course, PL guys are often wrong
3
252
23 Jul 2025
Protip: function in the presence of uncertainty in your own mind. Your tweets will be worse but you will be more aligned with reality
23 Jul 2025
Complete certainty is impossible. So for any belief it’s always possible to be wrong. The Sun might not rise tomorrow. Yet at some point, in order to function we must round the chance of error to zero on many beliefs. On what principled basis might we decide to do this?
1
335
Philip Monk retweeted
[1/5] 🚀 Meet Essential-Web v1.0, a 24-trillion-token pre-training dataset with rich metadata built to effortlessly curate high-performing datasets across domains and use cases!
12
51
297
227,003
23 May 2025
Leaving aside the main point of the post, I think people underestimate the degree to which the previous generation was influenced by stories of the one before that. There are more total stories now, but the number that affects any particular person is probably not any higher
22 May 2025
that jony and sam video has me thinking about something - i'll try to explain it the previous generation of silicon valley there were not yet too many stories of silicon valley people did the things they were trying to do, went through twists and turns and then later the stories were written - and they truly are incredible but now our heads are full of these stories - we can't do the thing without thinking "hey this is that thing they talk about" and as soon as that happens it inevitably becomes performative - people moving to sf and posting a pic of their mattress on the floor everyone want to tell the stories in realtime - it feels like a major plot point in the book that openai recruited jony ive! and so content is produced with all the gravitas that this plot point deserves as though we're already looking at it in hindsight the video really feels like two people meeting up again after accomplishing something together and reflecting on how they got there except this isn't the end, it's actually the beginning, we don't know where all of this ends up - it could be a disaster navel gazing has become so rooted in our culture to the point where it becomes the official marketing approach of multi billion dollar companies
1
3
552
23 May 2025
We've always been reaching for the glory of the previous generation
1
2
243
17 May 2025
This was a super interesting project to work on. It showed up when we started using muon at larger scales.
Muon is a serious competitor to AdamW, but it's tricky to scale up. Our infra team has made fundamental advancements in parallelizing Muon on large scale distributed clusters. We're extremely happy with the result and it's now a part of our pretraining pipeline. 🔗Check out our new blog post on this: essential.ai/blog/infra essential.ai/ @ashVaswani @pcmonk @ishaankshah @karlstratos
There's a new version of this post
1
5
788
3 May 2025
Avoiding spoilers is pretty difficult in general, but I was pleasantly surprised this just works
1
4
575
4 May 2025
Update: it doesn't actually know the score
4
333
19 Apr 2025
I don't know who needs to hear it, but for 100s of years there has been a significant tribe that believes basically this. Something in their worldview hits inf or nan and they start believing the end of the world is in the next 10 years. Reject it, don't let it make you impotent.
1
3
22
575
11 Apr 2025
This and the related thread is a good example of precisely where I differ from a lot of trads. I don't actually think the Colossus or Louvre was/is more interesting than what finance and tech has brought us.
I guess my own problem is that I think the most interesting things in the world were a result of *inefficient* allocation of capital. You don't build the Colossus of Rhodes or the Louvre - or almost everything inside of the Louvre - by expecting a return on investment.
1
3
433
11 Apr 2025
I'm more pro-civilization than anything else. But even though the Louvre is a valuable part of our civilization, it's much less valuable than what generally efficient allocation of capital gives us.
1
195