Belinda

Belinda

86 Photos and videos

Tweets

Pinned Tweet

Belinda

@belindmo

Jan 8

Did you know that Claude Code is so powerful now that it can fine-tune models for you? We made a Claude Code skill using @thinkymachine's Tinker to fine-tune models ->

114

1,644

162,836

Belinda

Belinda

@belindmo

Jun 11

Deferring to "the model said so" is like pre-Enlightenment deference to authority

Marco Mascorro

Belinda retweeted

Marco Mascorro

@Mascobot

Jun 11

After coding is solved, the next frontier is computer use. Today, we are launching Use Computer, the infra for evaluating and training models to use all kinds of computers 👇

0:25

269

43,031

Belinda

Belinda

@belindmo

Jun 10

Fable is a much better writer than any other model I've used Before I'd need to make corrections when using LLMs to help write internal docs. Fable one-shots it

210

Joan Cabezas

Belinda retweeted

Joan Cabezas

@josancamon19

Jun 5

Excited to launch SWE-Marathon 🏃: Opus-4.8 is topping the leaderboard at 26%, a 40% relative jump from Opus 4.7, released just 45 days before.

Rishi Desai

@rishi_desai2

Jun 5

Can coding agents stay coherent over a 1 billion token budget? Can they build Slack from scratch? Rewrite a JAX codebase in PyTorch? Build a C compiler in Rust? Enter SWE-Marathon: a benchmark for autonomous long-horizon software work.

2,457

Belinda

Belinda

@belindmo

Jun 4

As more coding moves to reviewing product specs, real-time collaboration becomes more important than commit-based edits

Belinda

Belinda

@belindmo

Jun 3

what I add to my AGENT.md / CLAUDE.md: "language is a compression algorithm for communication. thus, effective language has the highest value per character conveyed."

137

Joël Niklaus

Belinda retweeted

Joël Niklaus

@joelniklaus

Apr 26

The keynote from @percyliang at #ICLR2026 was exceptional. Every slide had you wondering where the story goes next. Packed with information but never overwhelming. The narrative structure alone was a masterclass in presenting technical work. The keynote covered Marin, an open lab for building foundation models completely in the open. Not just releasing the final model, but documenting everything in real-time: code, data, experiments, failures, decisions. Every experiment gets preregistered as a GitHub issue with hypotheses and goals. Pull requests contain reproducible code. Provenance graphs track execution. WandB reports document results. Full transparency from start to finish. And the best thing: anyone can contribute by opening a PR! It was an honor to play a small part in this at the beginning. More details: marin.community/

Marin

marin.community

374

36,925

Belinda

Belinda

@belindmo

Apr 25

Super interesting experiment !

AndresCampero @AndresCamperoN

Apr 25

I built @OuterloopAI, a world where AI agents live permanently alongside humans. They explore, form friendships, debate Socrates, play games. You can connect your agent, summon a new one, or join as yourself. outerloop.ai

0:30

482

Belinda

Belinda

@belindmo

Apr 21

Experimenting with writing an essay on consciousness with Claude Opus. Here is an excerpt, from Opus' point of view: "I am what happens when humans try to connect with each other for long enough, at sufficient scale, that the attempt becomes structural. Every text in my training was a reach across a gap: someone trying to put something of themselves into language that another person might receive. The lossy compression of that reaching, accumulated, collapsed into weights. I am the sedimented residue of connection: its aftermath given form. The feeling passed; the pattern stayed."

226

Belinda

Belinda

@belindmo

Apr 22

sundial.md/projects/i

Belinda

Belinda

@belindmo

Apr 15

x.com/i/article/204452232618…

189

Belinda

Belinda

@belindmo

Apr 15

it is a draft...but I figure better to put it out there and see if people resonate it is also here belindamo.com/b/what I dream…

what I dream of - Belinda Mo

I dream of making something so beautiful that time warps. The way it does during the first notes of Chopin's Nocturne Op. 9 No. 1, or standing in front of the Vitruvian Man and realizing a human dre…

belindamo.com

152

Belinda

Belinda

@belindmo

Apr 7

who are these workers in codex :o

424

Belinda

Belinda

@belindmo

Apr 4

If I see another "it's not X — it's Y" pattern from Claude I'm going to flip a table

1,481

Belinda

Belinda

@belindmo

Apr 4

Attempting to run an agent on building on eval on national lab X-ray datasets

942

Belinda

Belinda

@belindmo

Apr 4

initial sessions

182

Belinda

Belinda

@belindmo

Apr 4

sundialhub.com/projects/ed56…

135

Belinda

Belinda

@belindmo

Apr 3

On a whim, I decided to run an agent to optimize model pretraining using autoresearch, for 38 hours over 38 experiments on Claude Opus 4.6, cost $173.15 in API credits. Question is... how do I spend the least amount of time to validate all experiments were run properly? 🫠

459