light

light

24 Photos and videos

Tweets

Pinned Tweet

light @reprompting

Jun 9

I’ll be posting threads of chapter summaries (sort of) here. I’m still not sure how accountable I can keep myself, but one can always try.

light @reprompting

Jun 7

Programming Massively Parallel Processors (PMPP) has 22 chapters in total. If I do one chapter a day, it should take me about three weeks? 🤔 I want to do this.

122

light

light @reprompting

Jun 9

I’ll be posting threads of chapter summaries (sort of) here. I’m still not sure how accountable I can keep myself, but one can always try.

light @reprompting

Jun 7

Programming Massively Parallel Processors (PMPP) has 22 chapters in total. If I do one chapter a day, it should take me about three weeks? 🤔 I want to do this.

122

more replies

light

light @reprompting

33m

Chapter 6/22 (day 6) x.com/reprompting/status/206…

light @reprompting

19h

Summarizing chapter 6 of PMPP. This chapter focues on off-chip memory (DRAM) architecture and discusses performance considerations such as memory coalescing, memory latency hiding and thread coarsening.

light

light @reprompting

33m

Chapter 7/22 (day 7) x.com/reprompting/status/206…

light @reprompting

34m

Summarizing chapter 7 of PMPP. This chapter uses convolution as a case study for GPU optimization. It progresses from naive implementation to constant memory and then shared-memory tiling.

light

light @reprompting

34m

Summarizing chapter 7 of PMPP. This chapter uses convolution as a case study for GPU optimization. It progresses from naive implementation to constant memory and then shared-memory tiling.

light

light @reprompting

34m

The main challenge compared to tiled matrix multiplication is handling of halo cells. An output tile required neighboring input elements beyond its boundaries, making the input tile larger than the output file.

light

light @reprompting

16h

CUDA matrix multiplication using shared memory tiling. It really does feel nice when it clicks.

230

13,362

light

light @reprompting

19h

more replies

light

light @reprompting

19h

The chapter then discusses hiding memory latency. To achieve high memory throughput, there must be enough threads issuing memory requests simultaneously. Thread-level parallelism and memory-level parallelism go hand in hand.

light

light @reprompting

19h

Another optimization introduced is thread coarsening. Instead of assigning one unit of work per thread, each thread performs mutiple units of work. This can reduce redundant loads, redundant works and synchronization overhead.

light

light @reprompting

Jun 13

Surely, surely, they must have developed something.

light @reprompting

Jun 13

If anything, the Lazarus Group should have built something like Fable internally. They could do the funniest thing and release it publicly.

light

light @reprompting

Jun 13

If anything, the Lazarus Group should have built something like Fable internally. They could do the funniest thing and release it publicly.

light

light @reprompting

Jun 13

All the fearmongering worked. Anthropic is now the arm of the U.S. government. The next step is Fable/Mythos "escaping" the lab.

Anthropic

@AnthropicAI

Jun 13

The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…

light

light @reprompting

Jun 12

elon was pretty cool back then, all about high tech stuff. showed up in big bang theory, iron man and generally seems like real life tech genius. then something seemed to change after that thailand cave rescue controversy

Elon Musk

@elonmusk

24 Dec 2011

Kanye stopped by the SpaceX rocket factory today.