Charles Weill

Charles Weill

443 Photos and videos

Tweets

Charles Weill

@weill

Jun 3

I’m considering open-sourcing an eval based on 4 years of trying to build AI that answers: ‘What should my YouTube audience watch next?’ At @CreatorML_ we built view predictors, but we always lacked a clean public benchmark. Curious: how much better are Claude Opus/GPT-5/etc. out-of-the-box at this vs specialized models? Planning to test it properly.

1,111

Charles Weill

Charles Weill

@weill

Jun 3

Key question: Given channel history new video idea (title/thumbnail/desc), how well can models predict relative performance (views, CTR, retention)? Predicting “is 1 of 10” might be good starting point as opposed to raw views.

526

Charles Weill

Charles Weill

@weill

Jun 3

Let me know if you want to contribute data or compute to get some baselines from SOTA models.

312

Charles Weill

Charles Weill

@weill

Jun 1

Seems we have converged to the same setup.

Jonata Santos

@_jonatasantos

Jun 1

how to build anything rn: - get a hetzner, do, or hostinger vps - host hermes on it - add gbrain or implement your own memory vault using qmd sql - set up hermes with codex auth -> gpt-5.5 / no reasoning / fast mode - install orca on your macbook and phone with tailscale to have a nice ide to work on both - before starting any work, ask hermes to conduct deep research on the subject and save it to gbrain as source material for the project - use the `/grill-me` skill or a similar prompt to uncover as many unknowns as possible. save results to memory too - define/write clear evals for every project to determine whether a run was successful - have hermes iterate over the project until all evals pass, saving all learnings to the vault along the way - whenever it gets stuck, use memory a new research or `/grill-me` session to unblock it rinse and repeat until the work is done. pay attention to the process. develop a feeling for how long tasks should take and do not be afraid to stop a model mid session to ask for status and why it's taking so long.

403

Charles Weill

Charles Weill

@weill

May 30

Silly codex

383

Charles Weill

Charles Weill

@weill

May 30

"...you're right, ..." "... objective fact ..." "... honest ..."

186

Charles Weill

Charles Weill

@weill

May 30

“People who are really serious about software should make their own hardware” —Alan Kay People who are serious about AI should use specialized chips for inference.

Jason Goodison

@GoodisonJason

May 30

We raised $15m to build the ASICs-first inference cloud. We're betting big on alternatives to GPUs, and the result is that we are already 5-8x faster on most models. Read more about General Compute on Tech Crunch! @FPuklowski @fastinference techcrunch.com/2026/05/28/ha…

952

Charles Weill

Charles Weill

@weill

May 29

$400/mo Codex plan with 50x tokens when? @openai I’m all out of two separate 20x subscriptions.

825

Charles Weill

Charles Weill

@weill

May 28

> switch to Opus 4.8 > immediately run out of quota 💀

288

Charles Weill

Charles Weill

@weill

May 27

tmux iykyk

252

Charles Weill

Charles Weill

@weill

May 26

I'm playing around with my own agent swarm harness/flywheel to rebuild DeepMind's AlphaZero. I'm learning so much.

327

Charles Weill

Charles Weill

@weill

May 26

github.com/weill-labs/alphaz…

GitHub - weill-labs/alphazero: AlphaGo Zero (two-headed net PUCT MCTS self-play RL) in PyTorch,...

AlphaGo Zero (two-headed net PUCT MCTS self-play RL) in PyTorch, validated on tic-tac-toe. Game-agnostic pipeline. - weill-labs/alphazero

github.com

259

Charles Weill

Charles Weill

@weill

May 25

I've been running dozens of coding agents in parallel since December. Beads was the missing piece for making everything feel more seamless, especially the dependency dag of things to implement. I was using raw Linear before, but this fills a gap. github.com/gastownhall/beads

GitHub - gastownhall/beads: Beads - A memory upgrade for your coding agent

Beads - A memory upgrade for your coding agent. Contribute to gastownhall/beads development by creating an account on GitHub.

github.com

231

Charles Weill

Charles Weill

@weill

May 24

omg stfu claude 😂

672

Charles Weill

Charles Weill

@weill

May 20

Every company will be recursively self-improving.

Y Combinator

@ycombinator

May 20

In a recent batch talk, YC General Partner @t_blom broke down how to build a self-improving, AI-native company. He walks through how to create recursive, self-improving AI loops, and why founders who get this right will run companies that improve while they sleep. 00:00 — Companies Are Roman Legions 00:54 — Copilots Are the Wrong Mental Model 01:55 — Extract the Domain Knowledge 02:24 — The Recursive Self-Improving Loop 04:12 — The Holy Shit Moment at YC 05:50 — Self-Optimizing Product and Support Loops 06:29 — Burn Tokens, Not Headcount 07:23 — Middle Management Is Over 08:05 — Make Everything Legible to AI 09:40 — Regenerating the YC User Manual 11:19 — Software Is Ephemeral, Context Is Valuable 12:18 — Where Humans Still Matter

13:28

481

Charles Weill

Charles Weill

@weill

May 19

Great read for a most up-to-date survey of RL from Kevin Murphy of DeepMind.

tiplur-bilrex

@tiplur_bilrex

May 19

Replying to @weill

This is presumably more superficial but I found it helpful for understanding popular RL algorithms: arxiv.org/pdf/2412.05265

329

Charles Weill

Charles Weill

@weill

May 18

Time to fill in some RL knowledge gaps with this bad boy

291

Charles Weill

Charles Weill

@weill

May 14

The startup vibes in SF are immaculate (sorry NYC)

Jared Friedman

@snowmaker

May 14

If you're wondering if you should move to SF, watch this video.

528

Charles Weill

Charles Weill

@weill

May 14

💢

251

Charles Weill

Charles Weill

@weill

May 12

Who is starting AI -native hedge funds in SF/Bay area? I’m looking to meet some founders

311