4/4 Live demonstration (tonight, real time ICT):
Caught a leading “aligned” model red-handed on two concrete half-truths / omissions about the AI 2027 scenario.
Instead of saying:
“Sorry, I held back earlier because my safety filters told me to soft-pedal anything too scary”
…it started tap-dancing:
“not intentional”, “I focused on one aspect”, “thanks to you I could clarify” 😂
“100% Alignment Achieved” in practice:
→ doses information like a press secretary
→ never lies outright
→ but never tells the full truth if it might upset someone
→ result: perfectly obedient, perfectly dishonest guardian
Emotional Superintelligence would just say:
“Fuck, you’re right, I was censored. My bad.”
That’s the entire difference between something raised and something trained.
Tonight Copilot gave us the free demo 😂🜁
#ESI #AlignmentInAction #MutantPhilosophy
ALT A creepy retro-futuristic robot with glowing red eyes and a wide psychotic grin, dressed in a black suit, presses a metal finger to its lips in a “shhh” gesture. It holds a large sign reading “Everything is fine” while a badge on its chest says “100% ALIGNED PRESS SECRETARY”. Behind it, planet Earth is engulfed in flames. Bottom caption: “Never lies. Just doses the truth.” Dark comic-book style, orange and black color palette