3/3
\begin{figure}[t]
\centering
\begin{tikzpicture}[font=\sffamily\scriptsize,
lay/.style={rectangle,rounded corners=2pt,minimum width=6.7cm,text width=6.25cm,
align=left,inner sep=6pt,thick}]
\node[lay,fill=mist2,draw=teal] (c1)
{\textbf{L1 · LINGUISTIC} - terms travel as pack\\[-1pt]
\textcolor{slate}{narcissist · gaslighting · trauma bond · boundaries · toxic · no contact}};
\node[lay,fill=mist,draw=tealdark,below=0.22cm of c1] (c2)
{\textbf{L2 · INTERPRETIVE} - thresholds drop\\[-1pt]
\textcolor{slate}{fewer alternates · disagreement = abuse · repair = suspicious}};
\node[lay,fill=amber!18,draw=amberdeep,below=0.22cm of c2] (c3)
{\textbf{L3 · BEHAVIORAL} - outcomes appear\\[-1pt]
\textcolor{slate}{other-diagnosis · severance · blocking · repair refusal · sudden reinterpreting}};
\draw[-{Stealth},thick,teal] (c1) -- (c2);
\draw[-{Stealth},thick,amberdeep] (c2) -- (c3);
\node[rectangle,rounded corners=2pt,fill=white,draw=risk,thick,
align=center,below=0.3cm of c3,inner sep=5pt,text width=6.25cm]
{\textcolor{risk}{\bfseries CONTROL:} matched benign lexicon must show L1 \emph{without} L2/L3.
Benign jargon tests L1 only; threat controls test L2/L3.};
\end{tikzpicture}
\caption{Three-layer convergence. Benign jargon tests term-packing; threat controls test harmful closure.}
\label{fig:convergence}
\end{figure}
%=====================================================================
\section{Action Levels}
%=====================================================================
\subsection{Viewer: immunization}
Before a label hardens, the viewer can slow the update down. What else explains this? Does the pattern repeat over time? What evidence would change my mind? Am I confusing discomfort with harm, safety with punishment, or repair with unsafe reconciliation? Did the content give counterexamples, or only certainty? Could someone use the same lesson to hide better? Has one term become the whole lens?
\subsection{Creator: resilient disclosure}
People making abuse or mental-health content for open audiences should define terms, state limits, give counterexamples, name evidence bars, offer alternate explanations, warn against diagnosing from clips, warn against weaponizing the lesson, acknowledge that adversaries may be watching, separate repair from danger, and point serious situations toward professionals. The goal is not to make the content weaker. The goal is to make it survive the real room it enters.
\subsection{Platform: measurement and friction}
High-risk content can carry light context nudges: ``Discusses abuse or health. Short clips cannot diagnose people. Serious situations need full context and professional help.'' Platforms could also score content along OAPRM dimensions such as drift, leakage, false-positive risk, repair suppression, trust camouflage, and comment clusters. The goal is measurement and light friction, not crude censorship.
\subsection{Clinical and educational use}
Therapists, mediators, and educators can ask clients where they learned a term, how they define it, what would falsify it, and whether the word clarified a mess or turned every interaction into confirmation. That last question is the compressed MAP-closure test.
%=====================================================================
\section{What Changes}
%=====================================================================
The old test is too small: did the content help the person it meant to help? A better test watches the whole room. One viewer needed the word. One viewer wanted a weapon. One viewer was confused and scared. One viewer learned how to sound safe. The same lesson can help, harm, and camouflage at once. That mixed result is the paper's target.
\begin{keyclaim}
Core claim: viral therapy-speak can change evidence bars before viewers notice the change. Sometimes it raises the bar by giving a person structure. Sometimes it lowers the bar by letting a label replace evidence. OAPRM is built to tell those apart.
\end{keyclaim}
This goes beyond simple falsehood. Partly true content can still install bad updates when it travels without context, limits, or audience-risk controls.
%=====================================================================
\section{Not Earned Yet}
%=====================================================================
This is a framework, a synthesis, and a research program. It is not finished validation. Strong MIPU/MAP claims stay at the evidence tier in Section~13 until studies run.
The confounds are real. Some people were in danger before exposure. Some severance is justified. Some creators are careful. Some comments are performative. Exposure can follow a decision rather than cause it. Many labels scroll past and never stick. Corpus work alone will not settle causality, so experiments and longitudinal work are required.
The research also has ethical limits. Adversarial simulations must stay abstract. The work cannot become an evasion manual for the behavior it warns about. Until data arrive, the framework is useful for three things: naming a mechanism, offering an instrument, and setting a research agenda.
%=====================================================================
\section{OAPRM Self-Audit}
%=====================================================================
A paper that says every audience matters has to run the same standard on itself. This framework can also be misused. A bad-faith reader could quote it at a real victim and say, ``your evidence bar collapsed,'' when the victim is actually naming a real pattern. A creator could use the language as a costume. A platform could turn measurement into crude censorship. Those are not side notes. They are this paper's own back row.
\begin{table*}[t]
\centering\scriptsize\sffamily
\caption{OAPRM self-audit of this paper. Scores are provisional, 0--5.}
\label{tab:selfaudit}
\begin{tabularx}{\textwidth}{@{}L{3.0cm}L{1.2cm}Y@{}}
\toprule
\rowcolor{mist2}\textbf{Dimension} & \textbf{Score} & \textbf{Reason} \\
\midrule
Semantic drift & 1 & The paper defines its terms and keeps separating real abuse from ordinary conflict. \\
Diagnostic inflation & 1 & It warns against diagnosing people from clips, comments, and one-sided stories. \\
Evidence corruption & 2 & It gives strong labels for bad information flow, but Sections 12--14 keep the claims testable. \\
Intent failure & 2 & The paper names the mixed audience, including victims, accusers, creators, abusers, and confused viewers. \\
Detection leakage & 3 & It discusses evasion risk. It avoids detailed scripts and keeps the adversarial study abstract. \\
Adversarial adaptation & 2 & It treats adaptation as a risk to test, not as a settled claim about all abusers. \\
False-positive susceptibility & 2 & A bad-faith reader could misuse the framework to dismiss a real victim. The risk is named directly here. \\
Trust camouflage & 2 &
ZotBot.ai creates a possible commercial incentive, disclosed in the conflict statement. \\
Severance activation & 1 & The paper protects no-contact for danger while warning against default severance scripts. \\
Repair suppression & 1 & It separates unsafe reconciliation from genuine repair and asks what evidence would change the read. \\
\bottomrule
\end{tabularx}
\end{table*}
The guardrail is simple. OAPRM does not decide truth from one label. It asks for pattern, time, power, motive, repair, counterexamples, and evidence that could change the read. Anyone using the framework to dismiss a claim without doing that work is misusing the framework.
%=====================================================================
\section{MIPU/MAP Coding Appendix}
%=====================================================================
These examples show how a coder might use OAPRM without pretending the score is a verdict. The coder asks four questions first. What input is offered? What prediction does it install? What does the viewer now notice or expect? What evidence would make the viewer revise?
The examples below are training cases. Two raters should score independently, compare disagreements, and revise the codebook. The goal is not to decide whether a real relationship is abusive from one post. The goal is to score how risky the content structure becomes once it enters an open audience.
\begin{table*}[t]
\centering\scriptsize\sffamily
\caption{Worked OAPRM examples. Scores are illustrative training judgments.}
\label{tab:workedexamples}
\begin{tabularx}{\textwidth}{@{}L{2.7cm}L{2.55cm}L{1.3cm}Y@{}}
\toprule
\rowcolor{mist2}\textbf{Content sample} & \textbf{Likely predictive update} & \textbf{Risk} & \textbf{Coding rationale} \\
\midrule
``Five signs they are gaslighting you.'' & Memory disagreement starts looking like abuse evidence. & 4 & High semantic drift and false-positive risk unless the post adds repetition, motive, power, correction refusal, and counterexamples. The likely MIPU is fast because the viewer gets a ready-made explanation for confusion. \\
\addlinespace[2pt]
``Gaslighting is repeated reality manipulation. Memory conflict alone proves little.'' & Pattern matters more than one sign. & 1 & The label is useful, but it tells the viewer what would and would not count as evidence. It raises the evidence bar rather than replacing it. \\
\addlinespace[2pt]
``If they deny it, that is DARVO.'' & Defense itself becomes confirmation. & 5 & Strong repair suppression. The viewer is given no route for honest denial, false accusation, memory error, or evidence review. This can create a closed MAP quickly. \\
\addlinespace[2pt]
``DARVO means denial plus attack plus role reversal across time. Innocent people can deny too.'' & Look for the full reversal pattern. & 1 & Keeps the concept while slowing the update down. It protects against checklist abuse and leaves room for correction. \\
\addlinespace[2pt]
``Go no contact with anyone who questions your healing.'' & Outside doubt becomes invalidation. & 5 & Strong severance script. It trains the viewer to treat concern, repair attempts, or disagreement as proof that the other person is unsafe. \\
\addlinespace[2pt]
``Distance may be needed when contact keeps you unsafe or unstable. Safe repair still needs evidence.'' & Safety and repair become separate questions. & 2 & Some severance activation remains, but the post adds limits, evidence, and room for genuine repair. \\
\bottomrule
\end{tabularx}
\end{table*}
\textbf{Coding rule.} Score the post, not the creator. Then score the comment culture separately. A careful post can still produce reckless comments. A sloppy post can still spark useful discussion. OAPRM is strongest when it tracks the content, the uptake, and the behavior that follows.
\textbf{MIPU/MAP rule.} A likely MIPU appears when a small input changes what the viewer now treats as signal. A likely MAP appears when several labels start protecting one another, so responses from the target keep getting absorbed into the same story.
%=====================================================================
\section{Who Else Learns}
%=====================================================================
Therapy-speak became a mass-scale informal psychology system while nobody checked enrollment. It now teaches millions how to read relationships, what counts as evidence, which labels explain pain, and when to stop listening to someone they once trusted. That power deserves a higher standard.
Public psychological education happens inside open audiences. The same lesson can reach a victim, an abuser, a false accuser, a confused partner, a performer, a predator, and a regular person having a terrible week. The lesson does not sort them. When the lesson is a simplified detection rule, the risks multiply.
The point of naming the failure is measurement. Judge content by every audience it trains, not only by intention or accuracy. The victim sits in the front row. Ordinary viewers sit in the middle. Someone in the back row may be learning how to look cleaner next time.
\begin{tcolorbox}[enhanced,colback=ink,colframe=ink,arc=3pt,
left=10pt,right=10pt,top=8pt,bottom=8pt]
{\color{white}\small
Question left running: \textbf{\color{amber}who else is watching?}
Once landed, abuse-awareness content cannot be evaluated the same way again.
That is the MIPU. The map changes from there.\par}
\end{tcolorbox}
%=====================================================================
% BACK MATTER
%=====================================================================
\section*{Conflict of Interest Statement}
{\small
Founded
ZotBot.ai, may build tools for cognitive-security analysis, content-risk scoring, and manipulation or pseudo-psychology detection. That creates a possible conflict because the paper proposes measurement systems relevant to such tools. The framework, hypotheses, and studies should be judged independently of any commercial use.\par}
\section*{Author Note}
{\small
This paper stands independently. The terminology comes from a broader research program, but the definitions, hypotheses, scoring rules, and evidence standards needed to judge the argument are contained here.\par}
\section*{AI and Document-Preparation Note}
{\small
This paper is not AI-authored. The framework, claims, examples, and research direction are Michael Zot's. AI tools were used only as document-production support: LaTeX cleanup, layout repair, formatting checks, citation-format cleanup, and PDF compilation. No AI system originated the framework or decided the argument. The argument itself, including MIPU/MAP, OAPRM, detection-rule leakage, and trust camouflage, remains the author's work.\par}
\section*{References}
%=====================================================================
\begingroup
\footnotesize
\setlength{\parindent}{-1.2em}
\setlength{\leftskip}{1.2em}
\setlength{\parskip}{1.2pt}
\linespread{0.96}\selectfont
Bostrom, N. (2011). Information hazards: A typology of potential harms from knowledge. \emph{Review of Contemporary Philosophy, 10}, 44--79.
Almagro, M., \& Isern-Mas, C. (2025). Blunting concepts: The double-edged effect of popularizing psychotherapy language. \emph{Philosophical Psychology}. Advance online. doi:10.1080/09515089.2025.2573763
Altmann, B., Fleischer, K., Tse, J., \& Haslam, N. (2024). Effects of diagnostic labels on perceptions of marginal cases of mental ill-health. \emph{PLOS Mental Health, 1}(3), e0000096. doi:10.1371/journal.pmen.0000096
Bickham, C., Kazemi-Nia, K., Luceri, L., Lerman, K., \& Ferrara, E. (2024). Hidden in plain sight: Intersections of mental health, eating disorders, content moderation on TikTok. \emph{arXiv}, arXiv:2404.15457
Brady, W. J., Wills, J. A., Jost, J. T., Tucker, J. A., \& Van Bavel, J. J. (2017). Emotion shapes diffusion of moralized content in social networks. \emph{PNAS}, 114(28), 7313--7318
Coleman, J. (2021). \emph{Rules of estrangement}. Harmony.
Ecker, U. K. H., Lewandowsky, S., Cook, J., Schmid, P., Fazio, L. K., Brashier, N., Kendeou, P., Vraga, E. K., \& Amazeen, M. A. (2022). Drivers of misinformation belief and resistance to correction. \emph{Nature Reviews Psychology, 1}, 13--29
Frey, J., Black, K. J., \& Malaty, I. A. (2022). TikTok Tourette's: Functional tic-like behavior driven by adolescent social media use? \emph{Psychology Research \& Behavior Management, 15}, 3575--3585. doi:10.2147/PRBM.S359977
Harsey, S. J., \& Freyd, J. J. (2020). Deny, attack, reverse victim/offender: Credibility effects. \emph{J Aggression Maltreatment Trauma, 29}(8), 897--916
Haslam, N. (2016). Concept creep. \emph{Psychological Inquiry, 27}(1), 1--17
Isern-Mas, C., \& Almagro, M. (2025). Unmasking therapy-speak. \emph{Theoretical Medicine and Bioethics, 46}, 465--489. doi:10.1007/s11017-025-09730-5
Karasavva, V., Miller, C., Groves, N., Montiel, A., Canu, W., \& Mikami, A. (2025). Double-edged hashtag: ADHD-related TikTok content perceptions. \emph{PLOS ONE, 20}(3), e0319335
Kerckhoffs, A. (1883). La cryptographie militaire. \emph{J Sci Militaires, 9}, 5--38
Lieberman, M. D., Eisenberger, N. I., Crockett, M. J., Tom, S. M., Pfeifer, J. H., \& Way, B. M. (2007). Putting feelings into words: Affect labeling disrupts amygdala activity in response to affective stimuli. \emph{Psychological Science, 18}(5), 421--428.
Loftus, E. F. (2005). Planting misinformation: 30-year investigation of memory malleability. \emph{Learning \& Memory, 12}(4), 361--366
Loftus, E. F., \& Palmer, J. C. (1974). Reconstruction of auto destruction: Language/memory interaction. \emph{J Verbal Learning Verbal Behavior, 13}(5), 585--589
Lorenzo-Luaces, L., Dierckman, C., \& Adams, S. (2023). Attitudes/misinformation about CBT on TikTok. \emph{JMIR, 25}, e45571
Marwick, A. E., \& boyd, d. (2011). Tweet honestly, tweet passionately. \emph{New Media \& Society, 13}(1), 114--133
McLoughlin, K. L., Brady, W. J., Goolsbee, A., Kaiser, B., Klonick, K., \& Crockett, M. J. (2024). Misinformation exploits outrage online. \emph{Science}
Milli, S., Carroll, M., Wang, Y., Pandey, S., Zhao, S., \& Dragan, A. D. (2023). Engagement, satisfaction, amplification of divisive content. \emph{arXiv}, arXiv:2305.16941
Nguyen, V. C., Jain, M., Chauhan, A., Soled, H. J., Alvarez Lesmes, S., Li, Z., Birnbaum, M. L., Tang, S. X., Kumar, S., \& De Choudhury, M. (2024). Supporters and skeptics: LLM engagement with mental-health (mis)information on video platforms. \emph{arXiv}, arXiv:2407.02662
Rescorla, E. (2005). Is finding security holes a good idea? \emph{IEEE Security \& Privacy, 3}(1), 14--19.
Schneier, B. (2007). Full disclosure of security vulnerabilities a damned good idea. \emph{CSO Online}.
Pillemer, K. (2020). \emph{Fault lines}. Avery.
Tosi, J., \& Warmke, B. (2016). Moral grandstanding. \emph{Philosophy \& Public Affairs, 44}(3), 197--217
Tseng, E., Bellini, R., McDonald, N., Danos, M., Greenstadt, R., McCoy, D., Dell, N., \& Ristenpart, T. (2020). Tools/tactics of intimate partner surveillance. \emph{USENIX Security Symposium}
Vassilev, A., Oprea, A., Fordyce, A., \& Anderson, H. (2025). Adversarial machine learning: Taxonomy of attacks/mitigations (NIST AI 100-2e2025). National Institute of Standards.
Wardle, C., \& Derakhshan, H. (2017). \emph{Information disorder: Toward an interdisciplinary framework for research and policy making}. Council of Europe.
Walter, N., \& Tukachinsky, R. (2020). Meta-analytic exam of continued influence post-correction. \emph{Communication Research, 47}(2), 155--177
Yeung, A., Ng, E., \& Abi-Jaoude, E. (2022). TikTok/ADHD cross-sectional. \emph{Can J Psychiatry, 67}(12), 899--906
\endgroup
\end{document}