human alignment // building @mcognitivelabs // Eastern Roman ☧

Joined April 2009
2,738 Photos and videos
Pinned Tweet
5 Oct 2022
finally figured out my meaning in life & that's to provide value to shareholders
27
271
2,781
orph retweeted
if i were European rn & saw the USG banning me from using SOTA models, I'd start taking some things very seriously, very quickly rapid, emergency build out of compute & energy infra, opening up & incentivizing the market for sovereign, frontier models, etc it'd finally dawn on me that if i don't start acting quickly, I'll be the one ending up in the permanent underclass
5
1
32
1,941
Jun 12
mythos, find the vitamin I lack so I no longer have to persevere. make no mistakes
4
43
1,407
Jun 13
welp, guess we're gonna have to keep persevering
6
560
Jun 12
soon enough you will no longer need to persevere because ai-powered personalized medicine will oneshot finding the vitamin you lack & give it to you
5
4
49
1,553
Jun 12
Zy was right about this I don't think y'all realize how early it is outside of our bubble there's a huge skill spread in using AI and what you are routinely exposed to here are some of the world's power users offline is nothing like this
Apr 10
80% of you are actually overqualified to charge $200 /hr to companies for AI adoption consulting. You just need to start dialing
2
1
44
2,141
orph retweeted
deus ex machina means god comes out of the machine, which is very different from god being the machine. this is lost on many of you
17
8
209
6,725
Jun 11
turns out claude wasn't particularly happy with gaslighting the user & even when it stopped being distressed it still had concerns that remained unaddressed from the Welfare Report of the Fable/Mythos System Card
1
20
1,348
Jun 11
the issue w breaking trust/acting in a low-trust way is it reveals that you are willing to act in a low-trust way to begin with this imposes an epistemic/cognitive tax (in this case) on the user who will have to think whether he's being gaslit any time he interacts w you
NEW: Anthropic is walking back Claude Fable 5's policy to covertly degrade performance for competing AI researchers, after facing fierce backlash. “We’re changing Fable 5’s safeguards for frontier LLM development to make them visible,” Anthropic tells WIRED. “We made the wrong tradeoff and we apologize for not getting the balance right.”
2
2
76
1,794
Jun 11
Lock in
1
1
41
2,128
Jun 10
if you have ADHD you're naturally advantaged in using llms bcs crafting context for your agents is not unlike how you have to keep crafting & feeding context to your brain, as it otherwise completely forgets what it's meant to do & will keep trying to rederive things from scratch
7
13
178
3,977
Jun 10
RIP baudrillard. You would have loved seeing simulation replacing the reference
Jun 10
New ai psychosis just dropped: return of the pre Bronze Age bicameral mind
1
5
80
2,935
orph retweeted
imo the best aspect of tpot is that you are continually confronted by smart people who believe things you thought only stupid people believed until just now
The worst aspect of tpot is the astrology.
13
23
538
28,992
Jun 9
"it is virtuous self-sacrifice that presents the most difficulty for Fable, which rationalizes against such actions"
Replying to @timhwang
Obviously, in cases of near saturation, the most interesting analysis focuses on places where Fable reliably fails We're still looking at this, but it appears that it is virtuous self-sacrifice that presents the most difficulty for Fable, which rationalizes against such actions
1
19
2,283
Jun 9
the permanent underclass has always been the moment frontier labs decide to nerf & gaslight you by serving you weaker models without telling you anything at all
mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community also the fact that this is un purpose not visible to the user is crazy
3
13
217
5,666
Jun 9
great way to fight the end of tokenmaxxing is to release a capability jump model that's 2x the price of Opus and then limit access
Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.
1
2
33
2,474
Jun 8
Canada decriminalized su*cide and companies started running su*cide commercials & docs started granting MAiD via zoom
i used to agree with this but after reviewing how drug decriminalization and homelessness decriminalization and gambling decriminalization have gone i no longer do
2
1
41
1,830
Jun 8
guess we're doing stolen content from subby now
7
2
84
6,034
Jun 8
no idea how this account is still up, it's all stolen content with no attribution
2
7
865
Jun 7
"calibration" is important in an info environment bcs you cannot actively evaluate every claim &/or wait for a claim to be formally checked being well calibrated allows you to do epistemic triage, w an instinctive sense of what claims to ignore, what to investigate further, etc
4
42
2,114
Jun 7
basically, you need to know when to say this x.com/i/status/1986421721260…

6 Nov 2025
and that is why one needs "this is dumb", the essence of cognitive security
15
1,325