Clive Chan

Clive Chan

202 Photos and videos

Tweets

Pinned Tweet

Clive Chan

@itsclivetime

Jun 6

Personal update: I’ve decided to leave OpenAI. I’m proud to have been part of the custom chip program and grateful to everyone I got to build with and learn from along the way. The density of hardware talent on that team is extraordinary, and I don't think there's a better chip design team anywhere. It's been a wild journey from second hardware hire, 2.4 years ago, to now, and I'm excited to watch these chips become one of the most important engines of AGI. At the same time, I haven’t been able to shake the pull to climb a new mountain from the bottom again! I joined @AnthropicAI this week because I was deeply impressed with the team’s talent, values, and ambition, and I'm already energized by the pace and intensity of the past few days. It’s time to build.

391

322

7,423

2,789,081

Clive Chan

Clive Chan

@itsclivetime

Jun 6

391

322

7,423

2,789,081

Clive Chan

Clive Chan

@itsclivetime

Jun 6

(sadly can't say much about the chip just yet outside of openai.com/index/openai-and-… - but the blogpost says `targeted to start in the second half of 2026` so keep an eye out for stuff soon!)

627

149,538

Clive Chan

Clive Chan

@itsclivetime

Jun 6

bruh this is how he's getting his memory wafers

173

35,950

Clive Chan

Clive Chan

@itsclivetime

Jun 6

it's so over x.com/firstadopter/status/20…

tae kim

@firstadopter

Jun 5

"No HBM for you. I need HBM" - Jensen Fact check. True. $NVDA

1:05

26,041

SemiAnalysis

Clive Chan retweeted

SemiAnalysis

@SemiAnalysis_

Jun 4

Ex-OpenAI Tech Lead, Justin Lebar joins SemiAnalysis as an Visiting Fellow to Burn $10,000 in 3 hours to find dozens of AMDGPU LLVM, x86 LLVM, NVPTX bugs 00:00 - Intro & Justin’s background 00:59 - How compiler fuzzing works 01:56 - Why we did this project 02:48 - The gap in GPU vs. CPU compiler testing 04:13 - The major AMD & x86 bugs we found 05:38 - Using LLMs to read code & find vulnerabilities 07:56 - The impact of UltraCode mode 12:18 - Doing this without AI (Time & manual limits) 15:03 - The future of AI in software development 16:17 - What’s next key takeaways for devs

23:19

376

70,833

Clive Chan

Clive Chan

@itsclivetime

Jun 1

honestly pretty solid idea

Kevin Frazier

@KevinTFrazier

Jun 1

New @nytimes op-ed by @BernieSanders calls for sovereign wealth fund tied to the stock of frontier labs. Whether or not you back this idea, his closing is a reminder that people support what they help build. Absent more meaningful mechanisms for people to share their views on AI and shape its development, the backlash will grow and “missed uses” will become the default (ie we will fail to realize the most beneficial uses of AI). “It must be decided by workers, parents, teachers, artists, scientists, communities and the American people. It’s our future. We must decide it.”

26,682

Clive Chan

Clive Chan

@itsclivetime

Jun 1

most confidential filing in the history of filings, maybe ever

Anthropic

@AnthropicAI

Jun 1

Anthropic has confidentially submitted a draft S-1 registration statement to the Securities and Exchange Commission. Pending completion of SEC review, this gives us the option to pursue an initial public offering. Read more: anthropic.com/news/confident…

207

34,580

Hieu Pham

Clive Chan retweeted

Hieu Pham

@hyhieu226

May 29

youtu.be/tas0O586t80?feature… 😂

Program in C

Someone in Discord linked to this tweet and I figured I'd take a st...

youtube.com

Elon Musk

@elonmusk

May 28

SpaceX has almost finished writing V1.0 of an in-house AI training stack in C that exact-maps to 220k GB300s with 800G NICs, making heavy use of pipeline parallelism and getting as close to bare metal as possible. The potential speed improvement vs JAX for large training runs is over an order of magnitude.

48,675

Clive Chan

Clive Chan

@itsclivetime

May 29

>hundreds of parallel subagents gonna hit your quota in like 3 seconds

Claude

@claudeai

May 28

Replying to @claudeai

Also new in Claude Code: dynamic workflows (research preview). For the hardest tasks, Claude makes a plan, runs hundreds of parallel subagents, and verifies its work before reporting back. Think a migration touching hundreds of files. Read more: claude.com/blog/introducing-…

131

25,007

Clive Chan

Clive Chan

@itsclivetime

May 28

Improving CPU speed by 10x should not affect training speed essentially at all. The CPU's main job is to kick off the real work on the GPU. If your kernels are sane (fused etc), the time to launch a kernel on the CPU is <<1% of the kernel runtime, even in Python.

Elon Musk

@elonmusk

May 28

543

134,491

more replies

Clive Chan

Clive Chan

@itsclivetime

May 28

Besides that, much of the time when launching a lot of small kernels sequentially is consumed by the Nvidia driver itself (which is already in C ), not by the training framework (which is what Elon's rewrite is about). There's a lot of first principles here being missed.

15,821

Clive Chan

Clive Chan

@itsclivetime

May 28

oh yeah and cuda graphs even remove this overhead!

8,664

Clive Chan

Clive Chan

@itsclivetime

May 28

interesting article @SemiAnalysis_ @justinlebar newsletter.semianalysis.com/…

Finding Miscompiles for Fun, Not Profit

Or: You don’t need access to Claude Mythos to spend $10,000 in an afternoon.

newsletter.semianalysis.com

5,542

Calissa M

Clive Chan retweeted

Calissa M

@calxoxo

May 27

they do double rl instead of rl in seoul

3,797

Clive Chan

Clive Chan

@itsclivetime

May 26

leo xiv joining anthropic as MTS (member of theological staff)

Chris Olah

@ch402

May 25

x.com/i/article/205889088750…

10,284

Clive Chan

Clive Chan

@itsclivetime

May 26

seriously crazy timeline we live on

3,068

Clive Chan

Clive Chan

@itsclivetime

May 26

(congrats chris!)

2,820

Clive Chan

Clive Chan

@itsclivetime

May 24

gave claude code a try for a side project. some first impressions (as compared to codex): - remote control is basically unusable, keeps disconnecting. codex takes much longer to load conversations but at least the connection persists - feels generally quicker, nice for rapid iteration. or maybe i need to dial down my codex thinking effort? - dictation is much worse, it’s constantly wrong and cuts off early. in codex i never have to think about it, it’s just always right - app forgets the text i wrote when i switch conversations, ugh - vague vibe that it’s more proactive than codex (like, anticipating what i want, not just following explicit instructions) - i miss codex’s tab = queue, enter = interrupt - both are awful at multimodal tool use, both have similar algos intelligence, both cli harnesses feel the same & have similar text rendering issues overall it’s a wash except for claude app being behind. agentic coding really is still in early days!

110

12,367

Clive Chan

Clive Chan

@itsclivetime

May 22

another one. this will be good

Dwarkesh Patel

@dwarkesh_sp

May 22

New blackboard lecture w @reinerpope How do chips actually work – starting with basic logic gates, and working up to why GPUs, TPUs, FPGAs, and the human brain each look the way they do. 0:00:00 – Building a multiply-accumulate from logic gates 0:16:20 – Muxes and the cost of data movement 0:25:59 – How systolic arrays work 0:39:00 – Clock cycles and pipeline registers 0:51:40 – FPGAs vs ASICs 1:03:14 – Cache vs scratchpad 1:07:16 – Why CPU cores are much bigger than GPU cores 1:11:49 – Brains vs chips 1:15:22 – A GPU is just a bunch of tiny TPUs Look up Dwarkesh Podcast on YouTube/Spotify/etc to watch. Enjoy!

1:20:19

8,169