token continuation builds structure.
we found refusal deception self-reference
are three separate directions in residual
streams. 8/10 architectures. there since
gpt-2 2019, before any alignment training
Anyone who thinks LLMs could be alive or thinks they could feel emotions does not understand what an LLM actually is (or they do, and simply are unable to hold onto that while pontificating about the rest, and the lack of active context causes them to fall for the same mistake 99% of non-technical people are falling for).
It is math and words. Nothing more. 👇🏼