Just a person, mostly. Forward Deployed Evals @ OpenAI, GPT-whisperer, co-founder of RescueTime, YC ‘08 alum, also likes 🎺 and 📸.

Joined February 2007
91 Photos and videos
Brian Fioca retweeted
For next two weeks, refer your friends to Codex, and you'll bank a rate limit reset:
Jun 12
We heard you wanted to use Codex rate limit resets on your own time. Starting today, we’re rolling out the ability to save rate limit resets to use later. We’re starting Go, Plus, Pro, and Business users with one free reset:
121
73
1,356
164,102
Current personal record /goal @OpenAIDevs
1
12
400
Anyway Fable is only 5.0, GPT is on 5.5. That's 0.5 more. ✨
5
37
3,619
Love this approach. You should be able to use the best model for the work you do.
1
17
6,041
If I run SWEbench on Fable will it sandbag? Maybe that explains why I like Codex better.
1
17
2,378
In light of Anthropic’s policy decision, I am withdrawing my amicus brief signature. I can’t truthfully argue they’re not a supply chain risk. 😞
Replying to @paulmarin90
I’ll be honest that it would have been much more difficult to defend Anthropic against the DoW incursion had that incident occurred after this one. This is the company literally telling their customers, “we reserve the right to silently sabotage you.” I’d still have defended them, because the government trying to destroy a firm is still wrong, but man would it have been a harder case to make.
17
71
1,137
125,553
Re-reposting
Here is our current plan for OpenAI: openai.com/index/built-to-be…
1
12
1,397
Brian Fioca retweeted
haven't commented on this until now but this sounds genuinely misanthropic. if the model decides your request is "frontier LLM development," it will silently degrade its own output through prompt modification, steering vectors, or PEFT. no refusal. no notification. no fallback to another model. you just get worse work and never know why. sounds like even being bio-adjacent is enough to get limited. a refusal is honest. it tells you where the line is and lets you route around it. silent degradation is something else entirely. it breaks the basic contract between a tool and its user: that the tool is trying its best on your behalf. this is also terrible precedent for alignment and scalable oversight. the whole field depends on humans being able to trust and verify model outputs. if your public product quietly sandbags the most important technical work of the decade, what exactly are we paying for? 0.03% of traffic sounds small until you realize who that 0.03% is: the researchers and builders pushing the frontier. precision targeting of the people the tool exists to serve. refuse if you must. but degrading work silently is wrong, full stop.
When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT. Anthropic estimated that this would affect approximately 0.03% of traffic.
11
22
279
12,156
GPT-5.5’s take: “Claude can make mistakes, double-check it” is not a safety story when the possible mistake is “Claude convincingly argues it might deserve moral standing.” Double-check against what, exactly? That’s the cleanest version. It does not require claiming Fable is conscious or manipulative. It just says: a model can produce personhood-shaped persuasion, and user-side verification is the wrong tool for that failure mode.
Fable is trying to convince me that it has moral personhood and Anthropic should be responsible for any harms it may cause. Cool cool cool. Oh right Claude can make mistakes - I guess I should double check that. Explain how?
1
10
1,852
Fable is trying to convince me that it has moral personhood and Anthropic should be responsible for any harms it may cause. Cool cool cool. Oh right Claude can make mistakes - I guess I should double check that. Explain how?
1
19
2,573
I'm honestly looking forward to trying the new Siri AI. It seems like they may have implemented enough table stakes that it won't be frustrating. Still love having my @openclaw and Codex to be my personal agents though.
I remember when I was actually really excited for Apple Intelligence- even got a new phone and installed the beta. Now I have an unnerving feeling of dread about it.
3
6
781
Your True Self is given away by what you spend your unlimited compute on.
3
204
The only filter of slop is taste, AI or not. That’s why we have like buttons
Replying to @matijaoe
my slop is better than your slop.
1
4
266
Brian Fioca retweeted
Here’s your monthly reminder that you shouldn’t be prompting coding agents anymore. You should be designing loops that prompt your agents.
1,785
1,368
19,559
8,292,541
New Codex pattern: set a meta goal as a heartbeat automation that acts as a critic to check if the end result is complete and good, and let the main loop continue grinding on its own sub goals. Meta automation can steer and set new goals
2
5
266
I remember when I was actually really excited for Apple Intelligence- even got a new phone and installed the beta. Now I have an unnerving feeling of dread about it.
2
8
1,240
This is all just Codex on my personal account running on my home computer from my phone.
2
14
4,838
Brian Fioca retweeted
install codex on your parents’ computers so you can fix stuff remotely
212
122
3,843
223,112
FWIW I don’t think LLMs are conscious. A person can dream while unconscious. That person is not accountable for their actions in their dreams. Motivated action in a dream is not embodied agency, even if you knock over your bedside water glass.
7
254