Software Engineer.

Joined September 2013
4,518 Photos and videos
Pinned Tweet
I'm not a ML researcher but got a bit nerd-sniped by OAI new parameter-golf challenge I setup my pi-autoresearch loop on it, of course. I asked my clanker to do some research about all related papers that could help it come up with better ideas etc It ended up making this Knowledge Base it's nothing revolutionary, mostly notes and links to related papers. i hope its useful to some golf.agustif.com/ also if you have any feedbacks or ideas hmu
Mar 18
Are you up for a challenge? openai.com/parameter-golf
25
10
394
42,711
community notes should go on top of a post, not at the bottom, so you don't waste your time reading nonsense
4
142
You are not alone in the codebase;
1
4
256
Thank you @RepoPrompt
3
8
410
agusti retweeted
if software is spec, what if we got AI to make specs that weren't slop? working on this (very inspired by the beauty of makingsoftware.com by @danhollick)
if your agent doesn't write design specs like this your ngmi
32
56
1,012
333,269
this was a really magical experience to be part of, can recommend.
we were spending too much time "carrying water" between agents - here's a spec build it - address the pr comments - fix the failing CI so we made an agent for overseeing the SDLC e2e (Autobuild) we invited 10 startups to use it last week in SF (NYC next week!) what it does: - plans out entire feature builds across dozens of PRs - oversees the coding agents as they work - babysits PRs and addresses human and agentic review - conducts security, performance, and architectural reviews - QAs the work and records videos of the outcomes - monitors logs for issues after staging release - collects ux feedback from humans and address them - indexes all the concepts in your codebase - automatically writes updates to your team about what shipped - knows the current rollout state of features - maintains running sandboxes with a full dev env - dogfoods features before reporting success - engages with you in slack as it builds - automatically fixes reported issues - nags you for PR reviews when needed - optimizes your CI so it's not shit (big bottleneck for velocity) we're planning on making this the most insane building experience for established companies with a focus on quality/safety and human collaboration while accelerating velocity by 1-2 orders of magnitude if you want to join us in NYC next week (Thur/Fri - May 7/8) or future workshops lmk - we're onboarding up to 50 companies at a time by helping you ship 12 weeks of roadmap in 2 days - a sort of reset on baseline velocity no cost to attend beyond the inference you burn (you'll build a lot so not for the faint of heart)
6
10
381
agusti retweeted
we were spending too much time "carrying water" between agents - here's a spec build it - address the pr comments - fix the failing CI so we made an agent for overseeing the SDLC e2e (Autobuild) we invited 10 startups to use it last week in SF (NYC next week!) what it does: - plans out entire feature builds across dozens of PRs - oversees the coding agents as they work - babysits PRs and addresses human and agentic review - conducts security, performance, and architectural reviews - QAs the work and records videos of the outcomes - monitors logs for issues after staging release - collects ux feedback from humans and address them - indexes all the concepts in your codebase - automatically writes updates to your team about what shipped - knows the current rollout state of features - maintains running sandboxes with a full dev env - dogfoods features before reporting success - engages with you in slack as it builds - automatically fixes reported issues - nags you for PR reviews when needed - optimizes your CI so it's not shit (big bottleneck for velocity) we're planning on making this the most insane building experience for established companies with a focus on quality/safety and human collaboration while accelerating velocity by 1-2 orders of magnitude if you want to join us in NYC next week (Thur/Fri - May 7/8) or future workshops lmk - we're onboarding up to 50 companies at a time by helping you ship 12 weeks of roadmap in 2 days - a sort of reset on baseline velocity no cost to attend beyond the inference you burn (you'll build a lot so not for the faint of heart)
7
20
65
9,143
what would you do if you had unlimited gpt 5.5 for 72 hours?
4
8
316
the intoxicating power-trip of tasteful sponsored unlimited tokenmaxxing
1
2
175
Pareto (80/20) Chasing the last 20% burns time. Define “good enough” exit criteria; ship at 80%; backlog the rest.
1
3
325
Conway’s Law Architecture mirrors org silos. → Design target architecture first; align team boundaries to components/APIs.
1
2
80
Goodhart’s Law Metrics get gamed. → Use multi-metric dashboards; rotate/refresh metrics; audit for gaming; tie to outcomes.
1
103