This is the right framing: not “which model is good or bad,” but whether a model’s hidden policy priors are measurable and correctable.
ReAligned is interesting because it treats alignment drift as an engineering problem, not a culture-war slogan.
I created a training pipeline to remove propaganda and gaslighting from Chinese models!
I'm thrilled to announce LazarusAI's ReAligned-Qwen3.5 series of models, finetuned to reduce Chinese ideological bias and censorship, refusal behavior, and state-narrative framing
I use SFT GRPO pipeline with a dataset crafted to target the taxonomy of chinese censorship and bias, along with my ReAligned classifier model as a GRPO reward signal.