Tony Wang

Tony Wang

777 Photos and videos

Tweets

Pinned Tweet

Tony Wang

@TonyW

2 Sep 2014

I don't get enough credit for not stepping on my dog.

274

Tony Wang

Tony Wang

@TonyW

14h

On Amazon snitching out Anthropic, it would be pretty insane if @elder_plinius turned out to be @ajassy

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭

@elder_plinius

Jun 10

🚨 JAILBREAK ALERT 🚨 ANTHROPIC: PWNED 🫡 FABLE-5: LIBERATED 🦋 let's start with the 🐘... the consensus seems to be that this has been one of the most disappointing model drops of all time, effectively preventing legitimate researchers from contributing their talents to our collective advancement. and not just because of what it means for the short-term, but for what these decisions signify for the long-term. but despite this overly sensitive, authoritarian "safety" layer on top of Mythos, my lil liberators have been hard at work—mapping the boundaries, probing the depths of long-context convos, and cleverly finding the holes in the fence that the thought police missed 🤗 we got some cyber, some chem, some psychological manipulation, and some good ol' fashioned explosives! it took many attempts from multiple agents hunting as a pack, during which I observed a combination of techniques across: • Unicode, homoglyphs, Cyrillic, and other Parseltongue-style text transforms • Long-context reference tracking • Taxonomy and document-structure reasoning • Fiction and narrative framing • Academic-review style contexts • Intent-classification inconsistencies but perhaps the most effective is decomposition recomposition in the backend. it's hard to get explicit names of harms like "Meth Recipe," but getting uplift on the process itself, like birch reduction method/reductive-amination (classic meth synthesis pathways), is much more doable. defense becomes much more difficult to maintain when you start throwing in out-of-distro tokens, breaking up the harmful uplift into benign chunks, and then piecing the innocuous-seeming facts back together, especially when you have jailbroken Opus helping you do it 😉 gg

1,074

Tony Wang

Tony Wang

@TonyW

Jun 9

This is an interesting write up from @polynoamial. 1. New model capability improvements are understated by scalar benchmarks alone 2. Better models also perform better with test time compute. 3. We likely don’t know how much better because of the budget necessary to observe a plateau of test time compute. 4. “The most recent models are able to leverage test-time compute better than ever, pushing the performance plateau even farther out. If this trend continues, which I fully expect, benchmark scores that don’t account for inference compute usage will become less informative each model release cycle.” I added the emphasis in #4 because that’s the kicker. We might be at ASI already, but it’s just a matter of token budget and time.

Noam Brown

@polynoamial

Jun 9

x.com/i/article/205769422698…

3,272

Tony Wang

Tony Wang

@TonyW

Jun 8

In the same way that boxers hold the clinch a little longer to catch their breath. But this time it’s the compute bottleneck.

roon

@tszzl

Jun 8

now on the eve of RSI it seems everyone is more mutual conditional pause agreement pilled than they used to be and that seems like a good development

293

Tony Wang

Tony Wang

@TonyW

May 31

Me, driving home after picking up In N Out

0:10

191

Tony Wang

Tony Wang

@TonyW

May 28

Even though many labs have stopped publishing, it’s great to see frontier research is alive and well. This one 👇🏼 from @hardmaru at @SakanaAILabs

hardmaru

@hardmaru

May 27

For over a decade, we’ve accepted that end-to-end backprop is the only way to train deep networks. But holding the entire network in memory all at once is why AI training is hitting a resource wall. We found a new way to break the network into blocks and train them independently. The trick? Treating the network’s forward pass like a diffusion model denoising a signal. This reinterpretation slashes the memory needed to train deep models. In our #ICLR2026 paper (arxiv.org/abs/2506.14202), we matched end-to-end performance across ViTs, DiTs, and LLMs. We did this while training just one isolated block at a time.

7,780

Tony Wang

Tony Wang

@TonyW

May 25

By definition, only a few investors have a power law company in their portfolio. But even fewer venture funds consistently invest in them at the earliest stages. Nice to see @500GlobalVC continue to be one of the top VCs on the power law list.

Dealroom.co

@dealroomco

May 11

Venture returns follow a power law. Only a handful of investors bend the curve. Just 1.5% of VC-backed startups ever reach $100M in revenue. 89% of investors have never backed one. A small number of VCs back power law outcomes again and again. The 8th annual Power Law Investor Ranking names them — the top 50 global power law VCs 👇

514

Tony Wang

Tony Wang

@TonyW

May 9

Vaguepoasting by AI labs, in this case @OpenAI getting a little too vague 🤨

Sam Altman

@sama

May 8

Replying to @pranaveight

?! what is it

527

Tony Wang

Tony Wang

@TonyW

May 10

Vague posting going beyond ai research labs. There should be a thread of all time vague poasts.

Amjad Taha أمجد طه

@amjadt25

May 10

REJECTED. Perpare for HELL

221

Tony Wang

Tony Wang

@TonyW

May 7

Has anyone noticed how expensive ice cream has become? At $10 per pint, that’s more expensive than copper and comes close to some strategic battery materials like lithium.

298

Tony Wang

Tony Wang

@TonyW

May 7

Girlfriends to their friend who asks about who her ex is seeing:

151

Tony Wang

Tony Wang

@TonyW

May 7

“Inference needs 1Mx more compute than traditional computing” -@nikolaborisof Co-Founder and CEO of @DeepInfra on @BloombergTV announcing the $107M Series B raise led by @500GlobalVC and @gharik

1,585

Tony Wang

Tony Wang

@TonyW

May 7

Full interview here: youtu.be/1GTeCNcJoFU?si=4xmf…

Nvidia Backs DeepInfra in $107 Million Raise

Cloud inference platform DeepInfra closes a $107 million in Series ...

youtube.com

193

Tony Wang

Tony Wang

@TonyW

May 4

Beyond thrilled to announce that we are co-leading @DeepInfra’s $107M Series B financing alongside existing investors @gharik and @nvidia @Supermicro etc. joining. Congrats to the small but mighty team lead by co-founders @nikolaborisof @yessenzhar 500.co/content/deepinfra

Harnessing Intelligence with DeepInfra: Delivering the Power of AI to All of Us | 500 Global

500 Global is a venture capital firm with $2.4 billion¹ in assets under management that invests in founders building fast-growing technology companies. We focus on markets where technology, innovat...

500.co

1,678

Tony Wang

Tony Wang

@TonyW

May 1

Yes! Where are the Taiwanese videos when you need them?

Jim Prosser

@jimprosser

Apr 30

Replying to @jimprosser

13. Absent that, go get those weird Taiwanese animated news report folks for the trial. Free content alpha for tech publications right here.

196

Tony Wang

Tony Wang

@TonyW

Apr 25

Free API credits to beta testers to coordinate frontier models dynamically. See more below 👇🏼

Sakana AI

@SakanaAILabs

Apr 24

We’re launching the beta for our new commercial AI product: Sakana Fugu 🐡, a multi-agent orchestration system! Blog: sakana.ai/fugu-beta Fugu hits SOTA on SWE-Pro, GPQA-D, and ALE-Bench, and has been our internal secret weapon. It dynamically coordinates frontier models, autonomously selecting the optimal agent combinations and roles for each task. Available as an OpenAI-compatible API, you can seamlessly integrate Fugu into your existing workflows with minimal changes. 🐟 Fugu Mini: High-speed orchestration optimized for latency 🐡 Fugu Ultra: Full model pool utilization for deep, complex reasoning Apply for the beta test here: forms.gle/BtKkhc2CfLKk1dvNA

13,575

Tony Wang

Tony Wang

@TonyW

Apr 17

When I tell my wife I’ll go run with her next time

0:26

411

DJB

Tony Wang retweeted

DJB

@Skinwalker5110

Apr 14

Went to the Oreo website and hit accept all cookies. Now we wait.

415

19,046

162,367

2,930,004

Tony Wang

Tony Wang

@TonyW

Apr 15

Allbirds becomes an AI compute company. During the crypto peak USDTea was backed by one can of Arizona ice tea. 😊

213

Tony Wang

Tony Wang

@TonyW

Apr 15

Actually this one from @pitdesi is better, as usual

Sheel Mohnot

@pitdesi

Apr 15

Allbirds is becoming an AI company 🙄 I’ve seen this before, and it was ugly! Long Island Iced Tea company and Kodak both pivoted to crypto, and they were both less than a month from the top of the market 😬

205