brandon wang

brandon wang

31 Photos and videos

Tweets

Pinned Tweet

brandon wang

@fluorane

11 Jul 2025

happy to announce that we've gotten rid of tokenizers! especially excited with what we've replaced them with: end-to-end trainable modules that not only learn to group characters into (sub)words, but can iterate to group words into phrases and further higher-order concepts see @sukjun_hwang's thread for more details 👇

Sukjun (June) Hwang

@sukjun_hwang

11 Jul 2025

Tokenization has been the final barrier to truly end-to-end language models. We developed the H-Net: a hierarchical network that replaces tokenization with a dynamic chunking process directly inside the model, automatically discovering and operating over meaningful units of data

756

75,593

Soren Kierkegaard

brandon wang retweeted

Soren Kierkegaard

@BumrahBachi

Jun 11

never seen anybody fuck up as badly as fox just did

757

brandon wang

brandon wang

@fluorane

Jun 10

cute result, always fun to see how well principled simple approaches do well in comp bio

Lior Pachter @lpachter

Jun 10

Replying to @lpachter

In our preprint we prove a theorem: the only method satisfying rank monotonicity, perturbation additivity (plays well with PCA), relabeling equivariance (input order doesn't matter), depth invariance, and a basic calibration, is CLR. 13/

739

brandon wang

brandon wang

@fluorane

Jun 9

someone pointed out to me (in early 2025) that the reason there is no american deepseek is that js/hrt/etc all did not believe they would ever lose access to frontier capabilities anyway

elie

@eliebakouch

Jun 9

mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community also the fact that this is un purpose not visible to the user is crazy

551

57,669

brandon wang

brandon wang

@fluorane

Jun 8

one of the stronger pieces of cope i see is people/startups believing that their product/research direction is safe bc the labs are "just working on code/scaling/etc" this thinking seems to fundamentally misunderstand the point of going after coding scaling first

2,250

brandon wang

brandon wang

@fluorane

Jun 3

so what the hell did openai put into gpt image 2 imo still the most impressive model of the year so far

1,655

brandon wang

brandon wang

@fluorane

Jun 2

surely managing a team of agents is also ddr

Sergiy Galyonkin

@galyonkin

Jun 2

A conversation I had earlier today.

358

Harry Heymann 🥑

brandon wang retweeted

Harry Heymann 🥑@harryh

Jun 1

I really miss the "information wants to be free" left of the 90s / early 2000s. We have to go back.

2,607

brandon wang

brandon wang

@fluorane

May 28

kimi agent swarm has been achieved domestically, part 2

Claude

@claudeai

May 28

Replying to @claudeai

Also new in Claude Code: dynamic workflows (research preview). For the hardest tasks, Claude makes a plan, runs hundreds of parallel subagents, and verifies its work before reporting back. Think a migration touching hundreds of files. Read more: claude.com/blog/introducing-…

1,047

brandon wang

brandon wang

@fluorane

May 28

happened again (oc @radiuskia)

Cartesia

@cartesia

May 28

Cartesia Ink-2 debuts as #1 for accuracy on the brand-new streaming speech-to-text leaderboard from @ArtificialAnlys! We designed Ink-2 from the ground up for voice agents - with low latency, eager transcripts, and semantic endpointing.

2,207

brandon wang

brandon wang

@fluorane

May 25

the dim sum restaurant im at is showing a semiconductor lithography report from a hong kong tv channel, amazing

1,311

31,334

Arnav Shah

brandon wang retweeted

Arnav Shah @arnavshah0

May 24

Oh actually my bad it's an oral

Arnav Shah @arnavshah0

May 2

Excited to announce that dnaHNet has been accepted as an ICML 2026 Spotlight paper! Very grateful to my coauthors @victor_ljz and team, plus our remarkable supervisors @_albertgu and @genophoria.

5,521

brandon wang

brandon wang

@fluorane

May 22

good model

Artificial Analysis

@ArtificialAnlys

May 22

Cartesia’s Sonic-3.5 takes the #1 spot on the Artificial Analysis Speech Arena Leaderboard, surpassing Inworld Realtime TTS 1.5 Max and Google’s Gemini 3.1 Flash TTS Sonic-3.5 is the latest TTS model from @cartesia . It supports 42 languages, including 9 Indian languages, with 500 voices available out of the box. The model has been highly preferred among voters in the TTS Arena, with its demonstrated naturalness and accurate transcript following. Key takeaways: ➤ Quality: Sonic-3.5 has an Elo score of 1,218 ( 16/-16) based on 1,144 arena appearances, placing it ahead of Inworld Realtime TTS 1.5 Max at 1,194 and Gemini 3.1 Flash TTS at 1,209 ➤ Pricing: Sonic-3.5 is priced at $39/1M characters, a premium compared to Gemini 3.1 Flash TTS at $18.3/1M characters, and Inworld Realtime TTS 1.5 Max at $35/1M characters ➤ Speed: 105.5 characters per second, compared to 205 characters per second for Inworld Realtime TTS 1.5 Max and 26.3 characters per second for Gemini 3.1 Flash TTS See more details and listen to samples below 🧵

1,725

Raj Mathai

brandon wang retweeted

Raj Mathai @rajmathai

May 21

Bravo, Elim! @itselimchan named Music Director of @SFSymphony. 39 year-old Hong Kong born conductor becomes among very few woman and Asian Americans to lead a major symphony. Came to #USA to become a doctor…and then fell in love w/ music @UMich. @nbcbayarea

7,589

brandon wang

brandon wang

@fluorane

May 19

genuinely how do you serve this model at such high speeds on literally every search request

Google

@Google

May 19

Available today, Gemini 3.5 Flash is the new default model in AI Mode in Search for everyone globally. #GoogleIO

ALT Text reads “Google Search: Now with Gemini 3.5” next to an image of a Search results page.

419

brandon wang

brandon wang

@fluorane

May 15

the malware specialized to target exclusively large llm training runs is gonna be insane holy shit

hanlon’s mortola razr

@rhizomaticthot

May 13

the fast16 malware was almost certainly targeting spherical implosion simulations. left: unmodified LS-DYNA 970 right: LS-DYNA 970 modified with the relevant portions of fast16.sys both running a spherical implosion deck

112

3,504

158,715

brandon wang

brandon wang

@fluorane

May 8

hey @AnthropicAI your ai is misaligned

1,857

brandon wang

brandon wang

@fluorane

May 8

claude computer use is good at data entry but surprisingly bad at spreadsheet formatting, perhaps there is hope for the investment banking analysts after all

951

brandon wang

brandon wang

@fluorane

May 5

alias claude="claude --effort max"

508

brandon wang

brandon wang

@fluorane

Apr 30

happy @twilio earnings day to all who celebrate

431

brandon wang

brandon wang

@fluorane

Apr 24

broke: celebrating new year on 1/1 woke: celebrating lunar new year bespoke: celebrating new year on deepseek major model version bump

1,149