... | prev undergrad @miteecs and @mitbiology, @cartesia @janestreetgroup | usa imo 2019

Joined April 2021
31 Photos and videos
Pinned Tweet
11 Jul 2025
happy to announce that we've gotten rid of tokenizers! especially excited with what we've replaced them with: end-to-end trainable modules that not only learn to group characters into (sub)words, but can iterate to group words into phrases and further higher-order concepts see @sukjun_hwang's thread for more details 👇
Tokenization has been the final barrier to truly end-to-end language models. We developed the H-Net: a hierarchical network that replaces tokenization with a dynamic chunking process directly inside the model, automatically discovering and operating over meaningful units of data
12
49
756
75,593
brandon wang retweeted
never seen anybody fuck up as badly as fox just did
1
1
9
757
cute result, always fun to see how well principled simple approaches do well in comp bio
Replying to @lpachter
In our preprint we prove a theorem: the only method satisfying rank monotonicity, perturbation additivity (plays well with PCA), relabeling equivariance (input order doesn't matter), depth invariance, and a basic calibration, is CLR. 13/
4
739
someone pointed out to me (in early 2025) that the reason there is no american deepseek is that js/hrt/etc all did not believe they would ever lose access to frontier capabilities anyway
mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community also the fact that this is un purpose not visible to the user is crazy
10
10
551
57,669
one of the stronger pieces of cope i see is people/startups believing that their product/research direction is safe bc the labs are "just working on code/scaling/etc" this thinking seems to fundamentally misunderstand the point of going after coding scaling first
3
37
2,250
so what the hell did openai put into gpt image 2 imo still the most impressive model of the year so far
1
14
1,655
surely managing a team of agents is also ddr
A conversation I had earlier today.
4
358
brandon wang retweeted
I really miss the "information wants to be free" left of the 90s / early 2000s. We have to go back.
8
5
56
2,607
kimi agent swarm has been achieved domestically, part 2
May 28
Replying to @claudeai
Also new in Claude Code: dynamic workflows (research preview). For the hardest tasks, Claude makes a plan, runs hundreds of parallel subagents, and verifies its work before reporting back. Think a migration touching hundreds of files. Read more: claude.com/blog/introducing-…
1
5
1,047
happened again (oc @radiuskia)
Cartesia Ink-2 debuts as #1 for accuracy on the brand-new streaming speech-to-text leaderboard from @ArtificialAnlys! We designed Ink-2 from the ground up for voice agents - with low latency, eager transcripts, and semantic endpointing.
1
4
29
2,207
the dim sum restaurant im at is showing a semiconductor lithography report from a hong kong tv channel, amazing
8
62
1,311
31,334
brandon wang retweeted
Oh actually my bad it's an oral
Excited to announce that dnaHNet has been accepted as an ICML 2026 Spotlight paper! Very grateful to my coauthors @victor_ljz and team, plus our remarkable supervisors @_albertgu and @genophoria.
4
4
31
5,521
good model
Cartesia’s Sonic-3.5 takes the #1 spot on the Artificial Analysis Speech Arena Leaderboard, surpassing Inworld Realtime TTS 1.5 Max and Google’s Gemini 3.1 Flash TTS Sonic-3.5 is the latest TTS model from @cartesia . It supports 42 languages, including 9 Indian languages, with 500 voices available out of the box. The model has been highly preferred among voters in the TTS Arena, with its demonstrated naturalness and accurate transcript following. Key takeaways: ➤ Quality: Sonic-3.5 has an Elo score of 1,218 ( 16/-16) based on 1,144 arena appearances, placing it ahead of Inworld Realtime TTS 1.5 Max at 1,194 and Gemini 3.1 Flash TTS at 1,209 ➤ Pricing: Sonic-3.5 is priced at $39/1M characters, a premium compared to Gemini 3.1 Flash TTS at $18.3/1M characters, and Inworld Realtime TTS 1.5 Max at $35/1M characters ➤ Speed: 105.5 characters per second, compared to 205 characters per second for Inworld Realtime TTS 1.5 Max and 26.3 characters per second for Gemini 3.1 Flash TTS See more details and listen to samples below 🧵
2
33
1,725
brandon wang retweeted
Bravo, Elim! @itselimchan named Music Director of @SFSymphony. 39 year-old Hong Kong born conductor becomes among very few woman and Asian Americans to lead a major symphony. Came to #USA to become a doctor…and then fell in love w/ music @UMich. @nbcbayarea
1
11
55
7,589
genuinely how do you serve this model at such high speeds on literally every search request
May 19
Available today, Gemini 3.5 Flash is the new default model in AI Mode in Search for everyone globally. #GoogleIO
5
419
the malware specialized to target exclusively large llm training runs is gonna be insane holy shit
the fast16 malware was almost certainly targeting spherical implosion simulations. left: unmodified LS-DYNA 970 right: LS-DYNA 970 modified with the relevant portions of fast16.sys both running a spherical implosion deck
25
112
3,504
158,715
hey @AnthropicAI your ai is misaligned
1
3
14
1,857
claude computer use is good at data entry but surprisingly bad at spreadsheet formatting, perhaps there is hope for the investment banking analysts after all
1
10
951
alias claude="claude --effort max"
1
9
508
happy @twilio earnings day to all who celebrate
5
431
broke: celebrating new year on 1/1 woke: celebrating lunar new year bespoke: celebrating new year on deepseek major model version bump
18
1,149