Joined July 2010
187 Photos and videos
May 29
Grow Your Own Tools "Make me a personal ChatGPT called ChetGPT with this open router key XXXX" "Sure, let’s use pi coding agent for the tool loop." "Cool, let’s create a dev and prod folders where we edit the code on dev and have a deploy script that pushes it over to prod and restarts." "Host on @ssh_exe_dev which puts your server behind an authenticated proxy. Log in on your phone, install as a PWA." Now I can build anything I want from my computer or phone, just talking to it. Evolve into whatever you want. You get a powerful AI and Development system baked right into your application
4
470
Chet retweeted
1/ Some things I've learned recently running coding agents on large-scale projects. Most of this contradicts advice from 6 months ago!
95
209
3,085
571,335
May 22
These articles from the Meaning Alignment Institute are so interesting! It explains so much about how social institutions decay and the voids that are left in their place. #1 meaningalignment.substack.co… #2 meaningalignment.substack.co…

92
May 12
Is anyone building bootstrapped AI software? I want tools that rewrite themselves. I've been playing around with this and it feels like the future...
9
9
729
May 8

1
139
Apr 21
If you aren't running claude with --allow-dangerously-skip-permissions then try using my devbox command which makes it easy to run code inside isolated docker containers. github.com/ccorcos/devbox
1
671
Mar 26
I want a Claude skill that will upgrade my vocabulary to Patrick Collison levels. 🤔
When @karpathy built MenuGen (karpathy.bearblog.dev/vibe-c…), he said: "Vibe coding menugen was exhilarating and fun escapade as a local demo, but a bit of a painful slog as a deployed, real app. Building a modern app is a bit like assembling IKEA future. There are all these services, docs, API keys, configurations, dev/prod deployments, team and security features, rate limits, pricing tiers." We've all run into this issue when building with agents: you have to scurry off to establish accounts, clicking things in the browser as though it's the antediluvian days of 2023, in order to unblock its superintelligent progress. So we decided to build Stripe Projects to help agents instantly provision services from the CLI. For example, simply run: $ stripe projects add posthog/analytics And it'll create a PostHog account, get an API key, and (as needed) set up billing. Projects is launching today as a developer preview. You can register for access (we'll make it available to everyone soon) at projects.dev. We're also rolling out support for many new providers over the coming weeks. (Get in touch if you'd like to make your service available.) projects.dev
1
3
367
Mar 12
OpenClaw or NanoClaw or something else? What’s the way to go? OpenClaw feels a little bloated and the WebUI is buggy.
5
2
394
Chet retweeted
OK this thing is totally insane. Before going to bed I... * used try to make a new qmdresearcher directory * told my pi to read this github repo and make a version of that for the qmd query-expansion model with the goal of highest quality score and speed. Get training data from tobi/qmd github. * woke up to 19% score on a 0.8b model (higher than previous 1.6b) after 8 hours and 37 experiments. I'm not a ML researcher of course. I'm sure way more sophisticated stuff is being done by real researchers. But its mesmerizing to just read it reasoning its way through the experiments. I learned more from that than months of following ml researchers. I just asked it to also make a new reranker and its already got higher base than the previous one. Incredible.
I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. github.com/karpathy/autorese… Part code, part sci-fi, and a pinch of psychosis :)
120
234
4,763
798,085
Chet retweeted
✨New demo: what if vibe coding felt more visual? @brian_lovin @maryrosecook and I did a game jam using Notion as our "IDE": launching Cursor agents from a task board, and making a custom image for each task 😎 The demo shows 3 ideas for the future of agents: 1) Agents should collaborate across apps. Each app has its focus--Notion AI is good at drafting specs and organizing tasks; Cursor is good at coding. So let them specialize! Today we're launching a new integration where Notion AI can kick off Cursor Cloud Agents to do coding tasks. The Cursor API accepts natural language prompts, so I think of this as "cross-app sub-agents" -- it's kinda cute how it resembles humans hiring outside contractors 😊 BTW: the parallelism of cloud agents is incredibly freeing for creativity, but it also creates a new problem: sooo much work to keep track of! Which brings us to the next idea... 2) Agent orchestration is a data visualization problem. A powerful frame for designing agent UIs is to think of the chat transcripts as the "raw data" and ask: what visual projections might help people make sense of this data at scale? We need to engage our human GPUs -- our visual processing -- to understand what the computer GPUs are doing for us! One thing we can do is use AI to populate traditional UIs like progress bars and status updates. But there are also new possibilities now... For example: when you have a lot going on, it can be hard to identify tasks just by text titles. So we tried generating an AI image for each task -- turns out this helps a lot by giving it a unique visual identity! And of course, it also just makes it super fun to build with friends 😃 Speaking of friends... 3) The future of coding is collaborative. Sometimes it feels like IC engineers are being reduced to middle managers: shuffling information between the team's context and the coding agents that they individually manage. The solution: bring all the people and agents into one shared space, with shared context and visibility! In the video you can get a glimpse of how this feels. Mary, Brian and I record ourselves chatting about ideas, and then we use AI to turn that conversation into a list of tasks on a shared board. As the ideas get built in parallel, we can all monitor progress and review the work together, nothing is siloed. My main takeaway from this game jam was: damn, creativity with friends, at the speed of conversation, is incredibly fun. --- Our goal here is to let anyone use Notion as a fun and creative "software factory" to build software together with your team. Give the Cursor integration a shot and let us know what you think! (AI Image gen in Notion isn't GA yet, but coming soon and already out to some users) And let me know if you'd want a template or more detailed instructions on the setup we showed in this demo...
28
39
303
88,816
Mar 5
Stripe is so good at the subtle flex. It’s incredible what they’ve built and are continuing to build!
Replying to @collision
If you prefer your Stripe annual letter in audio/video form, you can also have me read it to you wherever you subscribe to Cheeky Pint.
1
274
Feb 10
I vibe-coded an inbox-zero style app for Youtube feeds. The goal? Combat the attention economy. Now I can go offline weeks at a time without feeling like I'm going to miss something.
1
4
160