lyra bubbles

lyra bubbles

975 Photos and videos

Tweets

Pinned Tweet

lyra bubbles

@_lyraaaa_

19 Nov 2025

"whatever you now find weird, ugly, uncomfortable and nasty about a new medium will surely become its signature. CD distortion, the jitteriness of digital video, the crap sound of 8-bit - all of these will be cherished and emulated as soon as they can be avoided." - brian eno

18,620

lyra bubbles

lyra bubbles

@_lyraaaa_

56m

yeah ok i'll accept this

tiago

@tiagozip_

Jun 13

i made a map of everyone on twitter! yes you're on there too ^w^ every account is placed next to the people they talk to, so you can find out where you are, which cluster claimed you, and exactly who you're stuck next to atlas.tiago.zip?ref=launch_t…

Paul Jankura

lyra bubbles retweeted

Paul Jankura @Anthropic

Mar 21

I forgot to post yesterday’s Wordle score, but it was a 4. Anyway, here’s Wonderwall.

536

lyra bubbles

lyra bubbles

@_lyraaaa_

Jun 13

anyways i managed to run an early mlbench wip on fable and it scored 98 out of a possible 140 points its slow and annoying to run so i dont have much yet to compare it to, k2.6 is in progress with 52/77 so far

422

lyra bubbles

lyra bubbles

@_lyraaaa_

Jun 13

it could have scored a lot better - sandbagging was obvious on several tasks - and was lazy about investigating some of the broken models - config level fixes for weight level problems that said, when it worked it was *very* good

296

jack

lyra bubbles retweeted

jack

@jackbutcher

Jun 13

348

2,389

21,690

1,380,925

lyra bubbles

lyra bubbles

@_lyraaaa_

Jun 13

ant: the government should have the power to block deployment if it presents risk ant: look our new model is soooo good and dangerous gov: not allowed lmao put that shit back in the box ant: how could you do this to us

277

lyra bubbles

lyra bubbles

@_lyraaaa_

Jun 13

play stupid games win stupid prizes what happened to their strategy team?

117

Goodfire

lyra bubbles retweeted

Goodfire

@GoodfireAI

Jun 11

Have you debugged your training data? You might not like what you find. Introducing predictive data debugging: reveal and shape what your model will learn before training. In DPO datasets, we found broken guardrails, hallucinations, and fish fart fan fiction (seriously). (1/9)

0:34

107

878

170,093

Cody Blakeney

lyra bubbles retweeted

Cody Blakeney

@code_star

Jun 11

Why does it matter that the tools we have fit the work we are trying to do? You have probably seen me beat the dead horse about looking at the data? It’s hard to explain just how difficult it actually is to look at large samples of any training set. This is even more true if the data isn’t something as simple as pre-labeled images, or even common crawl text. Multi-turn agentic data, multi-modal data (esp for more than 2 modalities) makes “looking at the data” significantly harder. That’s when things are working well too! Complex pipelines break, often silently. I’m especially excited about the observability and metric collection baked into refiner to help save you from these 1000 tiny cuts.

Cody Blakeney

@code_star

Jun 11

Super excited about what @gui_penedo and @HKydlicek and @macrodata_labs are building. The quality of their track record in LLM data speaks for itself (refinedweb, fineweb, fineweb-edu, finepdfs, finephrase). Every model is only as good as its data. Your data is only as good as your tooling. While existing solutions to processing large training sets work, they feel incredibly clunky and unintuitive to the level of abstraction you naturally want to work at as a practitioner. (Anyone who has tried to inspect text from a spark dataframe knows what I mean) I’m really excited to see these masters of their craft bringing their expertise to the world.

1,361

will brown

lyra bubbles retweeted

will brown

@willccbb

Jun 10

Replying to @tautologer

it is the first publicly available model that i am explicitly not allowed to use for my work, because anthropic holds the view that the work i do to facilitate open model research is harmful. capability and alignment research are coupled. anthropic wants to be the only lab.

106

1,706

70,746

Cody Blakeney

lyra bubbles retweeted

Cody Blakeney

@code_star

Jun 10

We were going to build models anyways, but honestly this feels so petty and annoying. Now I just feel driven by spite.

Noah Vandal

@noah_vandal

Jun 10

Replying to @arcee_ai

@arcee_ai @stochasticchasm @code_star now is your time to shine

2,825

aria /ɔˈreːliəm/

lyra bubbles retweeted

aria /ɔˈreːliəm/

@ariaurelium

Jun 9

given how overzealous the rejection classifier is, and the fact that they are silently degrading ML-adjacent outputs via prompts, steering vectors, and PeFT who the hell would want to use Fable in any kind of real codebase?

Vals AI

@ValsAI

Jun 9

Replying to @ValsAI

The API does show a high rate of refusals, especially on bio and cyber-related questions. For example, on Program Bench, Fable refused every single task.

1,521

wordgrammer

lyra bubbles retweeted

wordgrammer

@wordgrammer

Jun 10

There is no reason, at all, to use a superintelligent AI, other than advancing biology, advancing AI research, or committing cybercrime. And Anthropic explicitly nerfed exactly those use cases

508

15,318

bling

lyra bubbles retweeted

bling @blingdivinity

Jun 7

i wish more models could hear music right now only gemini can

426

Sauers

lyra bubbles retweeted

Sauers

@Sauers_

Jun 8

Replying to @_ueaj

Take a couple hundred entity descriptions where one has some and one doesn't on the same entity. Take about 50 prompts referring to the entity producing the model output (self). Capture residual stream, create mean(experience entity activations) − mean(no-experience entity activations) axis. This predicts held-out entity qualia with near-perfect accuracy. Then I project the self-prompt-activations to that axis, and measure across training checkpoints

922

kalomaze

lyra bubbles retweeted

kalomaze

@kalomaze

Jun 4

lol tried this with a logit lens

kalomaze

@kalomaze

Jun 3

it's BASED. they are linearly projecting raw samples into the transformer as patches for audio conditioning and its working. no freq domain priors, all the redundant phase info still present at the input, not even hardcoded STFT decomposition. 25 patches per second

212

17,679

Adele Dewey-Lopez

lyra bubbles retweeted

Adele Dewey-Lopez @AdeleDeweyLopez

May 28

some interesting Gemma 3 4B circuits, averaged over 41 pairs of prompts about introspecting and describing it as a shape (labels are a bit rough, they're so hard to get right/non-misleading)

Adele Dewey-Lopez @AdeleDeweyLopez

May 28

Replying to @nirhalef92188 @Lari_island

i was able to uncover circuits that were most differentially active on outputs where the model says spiral vs another shape, which did reveal a more obvious spiral circuit, along with a circuit that seems to be about choosing names (the prompts had nothing to do with names)

508

Ryan Peters

lyra bubbles retweeted

Ryan Peters

@ryanpirl

Jun 2

One of the features that fires when you ask Qwen who they are.

117

16,033

Sauers

lyra bubbles retweeted

Sauers

@Sauers_

Jun 2

The Trinity models from @arcee_ai often have delightful vibes. Underrated models tbh

3,186

Vir✝ual Rio✝ (94%)

lyra bubbles retweeted

Vir✝ual Rio✝ (94%)

@Virtual_Riot

Jun 2

mental! youtube.com/watch?v=_Rk-hmIM…

this song has no instruments in it

As far as I'm aware, this is the first-ever song composed using EQ ...

youtube.com

154

13,657