Mikhail Parakhin

Mikhail Parakhin

114 Photos and videos

Tweets

Mikhail Parakhin

@MParakhin

OpenAI has removed 5.2-Pro from ChatGPT. The best model for math/ML is not available anymore. Yes, it was mostly through an unbelievably high reasoning budget, but still. A dark day - and I'm not being facetious :-(

9,797

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

Jun 12

Fable 5 is in the league of its own. Both in quality and price - already on Toloka Arena:

256

16,999

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

Jun 11

Well, Tibo, for a year now I was pleading, arguing for, begging you guys to bring Pro as an advisor model into Codex (really, allow for the LARGE thinking budget)…

Tibo

@thsottiaux

Jun 10

I would like to claim my 1% of royalty fees.

355

68,493

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

Jun 9

I remember, I was there :-) Fun fact, it wasn’t NeurIPS, it was a room they rented in the same hotel, organizing an unrelated “workshop”, because NeurIPS wouldn’t accept them - Neural Networks weren’t cool. The room was packed, though.

Palash

@ABiggerSpalash

Jun 9

Geoff Hinton, before he was Geoff Hinton, once asked NVIDIA for a free GPU for his students Alex Krizhevsky and Ilya Sutskever. Nvidia declined

122

14,222

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

Jun 7

Have been extensively testing Claude Workflows this weekend, with the best model possible. Threw it at my whole code base, combing for bugs. 144 found and fixed! Geez... It is a large code base, for sure, but 144?!! Some are very impactful, some are downright embarrassing...

Mikhail Parakhin

@MParakhin

Apr 3

I keep predicting software quality will improve. I keep being wrong. Models write better-than-average code, yet we use them to write more code - not better code (shoutout to the unmovable, always-on-top Claude Code download and install window).

543

177,687

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

Jun 7

I had exactly the same issue with FedEx and Mackage. $2200 stolen, both agreed it happened, yet refused to engage. Stopped buying from Mackage, of course.

Brandon Avedikian

@bavedikian

Jun 6

Bought a $1,742.80 camera online from BestBuy. The FedEx delivery driver stole it. FedEx admitted it. But BestBuy won’t give a refund. They said we need to “work with local law enforcement.” Thought everyone should know if you buy from @BestBuy and a @FedEx driver steals what you paid for, your money is gone. Neither company will make it right. I’ve spent over $30K at BestBuy and will never spend another penny there.

193

19,898

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

Jun 3

We just published a paper on nitty-gritty technical SimGym details. Come and chat with us at ICML - DeepMind/Cornell/Stanford workshop. arxiv.org/pdf/2605.16116

5,668

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

Jun 3

And I was right! Toloka Arena finished testing - Claude 4.8 did take the first place. And, just as I saw, it did so through higher reasoning budgets - look at the number of tokens used.

Mikhail Parakhin

@MParakhin

May 30

OK, going to call it. Spent a lot of time with Opus 4.8: 1) It is a big step forward. The base model is still inferior to GPT-5.5, but they dramatically upped the thinking budget (for Max) - makes all the difference 2) Instruction following is still worse than GPT-5.5 xhigh 3) Coding, math, reasoning - better! It's not at the Pro level (of course), but the first Anthropic model I can genuinely use for math/ML. Codex app is much better (especially on Windows), but, until 5.6 arrives, I switched to Claude Code as the main system. Hearing great things about 5.6 though!

12,432

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

Jun 2

I feel like I just lost a family member - the new (today's) Codex version is broken (on Windows at least), trying to load the image generation model for some reason. The world stopped :-(

7,880

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

Jun 1

I’ve been watching this technology grow and develop from literally before the very first idea. And I am more and more amazed every day, not less. Codex is almost a family member now.

4,142

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

May 31

Pushed a small, but very useful addition to ml-tidbits today: a well-implemented GPU-friendly ranking loss function with no sorting, full gradient propagation. Nothing new, but so often I saw people using just subpar, nonsensical variants...

Mikhail Parakhin

@MParakhin

Feb 14

For 7 years I’ve been trying to satisfactorily solve an ML problem (deterministic Gaussian Autoencoder). Tried everything. Recently has finally solved it with 5.2 Pro Extended Thinking, planning to add it today to “ML tidbits” :-)

10,249

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

May 31

Greg is unbelievably intense and worked 80hr weeks for decades. Nothing would’ve happened without him. Besides, as Nassim Taleb would teach us, if someone keeps winning the lottery again and again - it’s not random.

Pedro Domingos

@pmddomingos

May 30

Greg Brockman is the biggest lottery winner in history: he contributed nothing to ChatGPT and still made $30B from OpenAI.

1,892

209,347

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

May 30

Model training is a game where GPUs and data are an overwhelming advantage. So when Recraft beats xAI, DeepSeek, Meta, BFL, Microsoft, etc. with a tiny fraction of the resources, the conclusion is: big-company ML talent selection is broken. Very different from "AI experts" :-)

247

168,288

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

May 30

832

104,735

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

May 29

It's like Christmas came early! Except still doesn't work :-(

Codex Changelog @Codex_Changelog

May 29

🚀 Codex app 26.527 is out! 🖥 Computer Use on Windows 📱 Remote Windows control from iOS, Android & Mac 👤 Profile section with usage stats & token activity Changelog: developers.openai.com/codex/…

12,490

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

May 29

After a round of Authorization magic incantations it works now!

2,563

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

May 29

Recraft, my ex-colleagues, keep punching above their weight - #1 independent, #3 after OpenAI and Google.

Recraft

@recraftai

May 29

Two weeks since launch, and the public benchmarks are in. Recraft is officially the #1 independent image generation lab 🚀

4,918

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

May 28

The fact that JAX was even mentioned makes me think of two things: 1) xAI still needs better ML people 2) PyTorch is stagnating and probably is not going to recover :-( It is such an unparalleled achievement, but lost key people, exiled to FAIR now...

Elon Musk

@elonmusk

May 28

SpaceX has almost finished writing V1.0 of an in-house AI training stack in C that exact-maps to 220k GB300s with 800G NICs, making heavy use of pipeline parallelism and getting as close to bare metal as possible. The potential speed improvement vs JAX for large training runs is over an order of magnitude.

632

190,632

Mikhail Parakhin

Mikhail Parakhin

@MParakhin

May 23

IRIX deserves to be reborn - it was SO ahead of its time: a real file system (XFS - still alive!), NUMA done right, kernel support for DMA in graphics (I want that for GPUs on my desktop!) and the coolest of all, the real-time subsystem (REACT/Pro). WSL -> WSI :-)!!!

spaztron64 @ Otomata Labs @spaztron64

May 22

Now I can say that I've RDPed from IRIX to Windows.

5,778