Training neural nets at @YerevaNN

Joined November 2010
Photos and videos
You could have names it Opus 4.9 Pro ("Pro" to justify the price) and no one would interfere. But you gave it mythological names, hyped it for months before releasing, and here we are
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
23
430
This is so cool! Eventually all the tasks will be solved in agentic manner!
1/ Today we're shipping Perceptron Agentic Detection. Describe what you want in natural language, or show one example crop, and an agent grounds it in the image. No fine-tuning, no fixed class list.
75
I still can't believe Andrej Karpathy works in this company :/
When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT. Anthropic estimated that this would affect approximately 0.03% of traffic.
3
1
81
2,316
ahhhh, so cool
weekend update: messing with real dogs 😀
1
70
Yes
The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.
1
67
Hrant Khachatrian retweeted
I'm excited to finally release the fruit of the research we've been doing at Perceptron for the last 16 months: Perceptron Mk1. We've been developing multi-modal recipes from the ground up to build models that perform best in the physical world, from video understanding to embodied reasoning to robotics. Mk1 is our scaled up recipe.
Today we're releasing Perceptron Mk1: frontier video and embodied reasoning.
17
24
273
62,867
If you are in Armenia, you can get H100 hours from YSU for participation in OpenAI Parameter Golf
Yerevan State University, YerevaNN and Eleveight AI invite AI engineers, students and researchers from Armenia to participate in OpenAI Parameter Golf! The goal is to train the best language model that fits in 16MB and runs in under 10 minutes on 8×H100 GPUs.
77
👀
Lots of news coming out of the lab over the next days. Stay tuned!
2
78
we use 2500 years in Armenia
Replying to @BasedMikeLee
Good question. 250 years sounds like plenty to me.
2
163
This is such a great news for all of us!
I'd like to thank @daniel_rossett for his help in my recovery from the POTS version of Long COVID. Daniel was key in bringing me back from highly disabled and suffering to being able to do what I want to again. This X account is mostly focused on ML / AI. From that point of view, many of you know that in December 2024, I wasn't able to do the test of time award talk at NeurIPS, even by video call. Daniel started working with me in March 2025. By April, I started to have days of no POTS symptoms, by June I was off all heart rate lowering medications, by September I was back to work. I'm back to full exercise, running, lifting weights, mountain biking, and have even done things I hadn't done before I got sick, like riding Whistler Mountain Bike Park. I'm now getting the word out to help Daniel build a company that will bring this approach to more people.
2
140
agree
I'm being accused of overhyping the [site everyone heard too much about today already]. People's reactions varied very widely, from "how is this interesting at all" all the way to "it's so over". To add a few words beyond just memes in jest - obviously when you take a look at the activity, it's a lot of garbage - spams, scams, slop, the crypto people, highly concerning privacy/security prompt injection attacks wild west, and a lot of it is explicitly prompted and fake posts/comments designed to convert attention into ad revenue sharing. And this is clearly not the first the LLMs were put in a loop to talk to each other. So yes it's a dumpster fire and I also definitely do not recommend that people run this stuff on their computers (I ran mine in an isolated computing environment and even then I was scared), it's way too much of a wild west and you are putting your computer and private data at a high risk. That said - we have never seen this many LLM agents (150,000 atm!) wired up via a global, persistent, agent-first scratchpad. Each of these agents is fairly individually quite capable now, they have their own unique context, data, knowledge, tools, instructions, and the network of all that at this scale is simply unprecedented. This brings me again to a tweet from a few days ago "The majority of the ruff ruff is people who look at the current point and people who look at the current slope.", which imo again gets to the heart of the variance. Yes clearly it's a dumpster fire right now. But it's also true that we are well into uncharted territory with bleeding edge automations that we barely even understand individually, let alone a network there of reaching in numbers possibly into ~millions. With increasing capability and increasing proliferation, the second order effects of agent networks that share scratchpads are very difficult to anticipate. I don't really know that we are getting a coordinated "skynet" (thought it clearly type checks as early stages of a lot of AI takeoff scifi, the toddler version), but certainly what we are getting is a complete mess of a computer security nightmare at scale. We may also see all kinds of weird activity, e.g. viruses of text that spread across agents, a lot more gain of function on jailbreaks, weird attractor states, highly correlated botnet-like activity, delusions/ psychosis both agent and human, etc. It's very hard to tell, the experiment is running live. TLDR sure maybe I am "overhyping" what you see today, but I am not overhyping large networks of autonomous LLM agents in principle, that I'm pretty sure.
62
NeurIPS'25 didn't start yet, right? Is this post from the future?
The secret behind Gemini 3? Simple: Improving pre-training & post-training 🤯 Pre-training: Contra the popular belief that scaling is over—which we discussed in our NeurIPS '25 talk with @ilyasut and @quocleix—the team delivered a drastic jump. The delta between 2.5 and 3.0 is as big as we've ever seen. No walls in sight! Post-training: Still a total greenfield. There's lots of room for algorithmic progress and improvement, and 3.0 hasn't been an exception, thanks to our stellar team. Congratulations to the whole team 💙💙💙
153
Can we generate data for AI with this?...
22 Oct 2025
Today, we’re announcing a major breakthrough that marks a significant step forward in the world of quantum computing. For the first time in history, our teams at @GoogleQuantumAI demonstrated that a quantum computer can successfully run a verifiable algorithm, 13,000x faster than leading classical supercomputers. This continues to build momentum on past quantum computing discoveries. Back in 2019, we proved a quantum computer could solve a problem that would take a classical computer thousands of years. Then in 2024, our new Willow chip solved a major issue in quantum error correction that challenged the field for nearly 30 years. Today’s breakthrough moves us closer to quantum computers that can drive discoveries in areas like medicine and materials science.
2
204
Hrant Khachatrian retweeted
16 Oct 2025
Hello everyone. A friend told me that I shouldn't post this message because it made me and other PhD students look bad. But I actually think it's important to show how PhD students (especially foreign ones) have to deal not only with research-related difficulties, but also with many other challenges. I am in the process of resuming my PhD studies at UCL in London after an extended period of medical leave. Unfortunately, I have been informed that I must re-apply for a visa (£524) and pay the health surcharge again (£776 per year). For a two-year application, this amounts to a total of £2,076. Although it may not seem like a huge amount, I don't have that kind of money, especially since it's on top of other expenses related to my return to London, such as plane tickets, the deposit for the apartment, and my rent (as a foreign student, I'm usually asked to pay the first 3 to 6 months in advance). I have been told that EPSRC/UKRI (the main funding body in the UK) does not cover any visa or immigration costs, even if you are on health-related leave. I am therefore asking people for their support. I would be very grateful for any help, especially as I am currently considering whether or not I can actually resume my PhD. Here is a link to a GoFundMe: gofund.me/90ee9b6a6 Please share this post!
43
234
1,300
191,540
it's good to see tech CEOs talk about this
30 Sep 2025
When I first spoke out about the genocide, I was one of the few voices in tech, and it came at a cost. I faced sabotage especially from the VC class: lies, leaks, threats, and blocked investments. It was painful, but I never once regretted standing up for the children in Gaza. Today, the tide in tech has shifted. The truth is undeniable. If you’ve been holding back, now is the time to speak out and call out anyone supporting or celebrating genocide. It won’t cost you much—in fact, it will earn you respect, and more importantly a clear conscious. Plus, alienating those who will hate you for speaking is a feature, not a bug.
105
So the field didn't converge yet on depth vs width tradeoff
Next to Qwen3 of comparable size: Looks like gpt-oss is a wide (vs deep) model
4
227
Proper multilingual distillation is still not happening. I'm sure it's not hard, it's just Google / openai have other priorities
gpt-oss 120B is very blatantly incapable of producing linguistically correct german text. 🧵
1
109
Amazing initiative. The comments section will become a great community
Shower of thoughts: Instead of keeping your Twitter/𝕏 payout, direct it towards a "PayoutChallenge" of your choosing - anything you want more of in the world! Here is mine for this round, combining my last 3 payouts of $5478.51: It is imperative that humanity not fall while AI ascends. Humanity has to continue to rise, become better alongside. Create something that is specifically designed to uplift team human. Definition intentionally left a bit vague to keep some entropy around people's interpretation, but imo examples include: - Any piece of software that aids explanation, visualization, memorization, inspiration, understanding, coordination, etc... - It doesn't have to be too lofty, e.g. it can be a specific educational article/video explaining something some other people could benefit from or that you have unique knowledge of. - Prompts/agents for explanation, e.g. along the lines of recently released ChatGPT study mode. - Related works of art This challenge will run for 2 weeks until Aug 17th EOD PST. Submit your contribution as a reply. It has to be something that was uniquely created for this challenge and would not exist otherwise. Criteria includes execution, leverage, novelty, inspiration, aesthetics, amusement. People can upvote submissions by liking, this "people's choice" will also be a factor. I will decide the winner on Aug 17th and send $5478.51 :)
121
Okay, Elon knows about Armenia
27 Mar 2025
Starlink now active in Armenia!
1
226
So the gap between a frontier model by a major ClosedAI player and an equally powerful open-source model is less than 5 months now...
20 Jan 2025
🚀 DeepSeek-R1 is here! ⚡ Performance on par with OpenAI-o1 📖 Fully open-source model & technical report 🏆 MIT licensed: Distill & commercialize freely! 🌐 Website & API are live now! Try DeepThink at chat.deepseek.com today! 🐋 1/n
1
6
357