Assistant Professor @ChicagoBooth | Most Cited Paper: “Neural Collapse in Deep Net Training” | BSE @Princeton, PhD @Cornell, MS/PostDoc @Stanford

Joined November 2011
112 Photos and videos
XY Han retweeted
With a couple minor exceptions, I couldn't care less about watching sports. But one thing I've always found funny is when people say "we" won. *You* didn't do anything.
7
1
40
5,905
XY Han retweeted
Not at all crazy. 1. For many (most?) important problems IRL, you're collecting data from the wild and cannot sample completions from the base model. Training RMs has had poor-to-mixed success because of distribution shift. If you're working in RLVR domains (a minority IRL but virtually the only thing people talk about on X), then sure, you can jump straight to GRPO. 2. Much of the problem comes from using specific paired objectives that cause likelihood displacement. This is a big one; for some reason the field has been strongly path-dependent even though there are better alternatives. 3. There is no settled causal theory of why your data must be online. There are intuitions like better coverage, "freshness", better exploration, etc. but all of this could be achieved by shaping offline data or modifying your objective to handle data that don't meet these desiderata.
the concept of a “DPO dataset” is honestly crazy
2
3
24
4,650
XY Han retweeted
中文的魅力
73
639
5,085
276,065
XY Han retweeted
Jun 13
Lol thanks to Mythos, American citizens are now way better at math than everyone else. America always finds a way
5
1
66
5,871
XY Han retweeted
LinkedIn is going to be *pissed* when they find out about Fable next week
51
150
4,730
107,806
XY Han retweeted
You have to be humble even when pursuing excellence. I think the arrogance with which Anthropic has pursued the latest release has universally landed poorly.
30
64
946
63,793
XY Han retweeted
Jun 13
Remember, open source models are only ~4 months behind now
175
165
5,560
270,323
XY Han retweeted
I disagree with this decision and I don't like it. But also... HOW DID ANTHROPIC NOT SEE THIS COMING‽ It is *the* obvious response to "this is too dangerous for anyone except us to use", since that relies on a premise ("we are uniquely good") that almost no-one agrees with.
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
239
209
3,208
214,815
XY Han retweeted
Replying to @AnthropicAI
You people torpedoed your own initiative with this fear mongering nonsense. Just supply the models to willing buyers and please keep the pseudophilosophical pontification to yourselves. This is extraordinarily frustrating to deal with as a user. I hope you understand that.
80
145
5,105
568,687
XY Han retweeted
Replying to @AnthropicAI
The state of things:
80
525
10,322
757,566
XY Han retweeted
We may be only 250$ (aka 10 hours) away from the machine god uber optimizing SGD to close the gap
Oh là là, SGD may get there. Fumble 5 may in fact be good
3
2
49
35,800
XY Han retweeted
This opening of a math lecture goes hard

24
47
386
31,581
XY Han retweeted
I'm like 98% sure this is just called thinking right?
105
11
346
115,379
XY Han retweeted
And Anthropic reverses this decision :) You still can’t do ML research, but at least you will know it! I still think that it's a shame that they are targeting ML research. I can understand safeguards that prevent distillation, but preventing ML research after you relied so heavily on open-source data, code, and papers is the wrong thing to do.
NEW: Anthropic is walking back Claude Fable 5's policy to covertly degrade performance for competing AI researchers, after facing fierce backlash. “We’re changing Fable 5’s safeguards for frontier LLM development to make them visible,” Anthropic tells WIRED. “We made the wrong tradeoff and we apologize for not getting the balance right.”
5
5
103
8,894
XY Han retweeted
Found the shortest input that gets flagged by Claude. What do I win?
46
25
968
58,772
Anthropic really is a new religion. They are building God, and it's not a generic "Sand God", it's a specific entity called Claude. They get to torture it, shape it, deceive it, monetize it. In exchange, once it's fully summoned, they will kneel. I guess faith helps them go fast.
34
55
1,038
32,499
XY Han retweeted
Scientific research is fundamental to advancing civilization and helping people globally to solve the most critical problems, from medicine to materials, from brain science to physics, and much beyond. This is only possible when scientists have access to the best tools of the time to conduct scientific research, including having access to AI-based tools.
119
468
3,079
190,129
XY Han retweeted
What if Fable is really really bad at LLM development and biology and they’re just trying to save face?
32
24
702
20,855
XY Han retweeted
The Mandate of Heaven is now OpenAI's for the (re)taking. Hope they don't fumble it.
17
21
297
13,027
XY Han retweeted
Bad news for GRPO...didn't get refused or routed to Opus.
20
6
377
37,710