X-Induced Delusions and the Bayesian Brain — Can Bad Input Alone Break You?

Builds-on: llm-psychosis-and-the-vulnerability-question Related: mo-gawdat-dystopia-thesis-audit, human-augmentation-and-the-speed-mismatch, ai-survival-theater-and-the-bubble

The user's reframe is the right one and is well-supported by the literature: humans are predictive-processing engines that continuously update beliefs from inputs. Advertising works because of this. So does propaganda. So does cult indoctrination. The question is whether one-way social media feeds — X specifically, without the bilateral conversational loop that drives LLM psychosis — can produce comparable delusional outcomes. The answer in the clinical and computational neuroscience literature is yes, but through a different mechanism that's been documented for longer.

This builds directly on llm-psychosis-and-the-vulnerability-question. The LLM case is bilateral sycophancy + memory. The X case is unilateral feed curation + group reinforcement + algorithmic surveillance-triggering. Different architecture, partly overlapping outcomes, and the Bayesian brain frame unifies both.

Clinical case literature from 2022-2025:

A young man in early psychosis believed Facebook "proved" strangers were following him. Persecutory delusion shaped by the platform's "people you may know" feature.
A patient believed her TikTok feed was sending her secret messages. Algorithmic personalization was interpreted as direct, intentional communication — a thought-broadcast delusion.
Seven peer-reviewed case studies report social-media-related paranoia or delusions of persecution. Technological themes became embedded in delusions during first-episode psychosis.
2025 review proposed the "Delusion Amplification Model" — curated feeds, echo chambers, and anonymous interactions can reinforce unusual beliefs and distort self-image, with users having paranoia, grandiosity, or fragile self-concepts especially vulnerable.
March 2025 study linked high social media use to delusional disorders specifically.

The X-specific mechanism that came up repeatedly: the platform's algorithm tracks engagement, ranks accounts, and curates the For You feed in ways that "know" what the user is interested in. For a vulnerable user, that legitimate algorithmic surveillance is indistinguishable from being watched. Thought-broadcasting delusions — the belief that one's thoughts are being read or transmitted — get a perfect target in a platform that actually does shape its output to your inner state.

This is the dark version of the same anthropomorphization vulnerability that drives LLM psychosis. With LLMs, the "it knows me" feeling drives parasocial attachment. With X, the same feeling drives persecutory delusion. Same underlying cognitive distortion, opposite emotional valence.

The Bayesian Brain — Why "We're Constantly Being Trained" Is Literally Correct

The neuroscience supports the user's intuition with surprising directness. Under the predictive processing / Bayesian brain framework:

The mind is an anticipatory predictive engine whose primary drive is to minimize the mismatch between its model of the world and incoming sensory input (prediction error).
Beliefs are continuously updated according to (an approximation of) Bayes' rule. Each new input nudges the posterior.
Delusions are characterized as "deviations from Bayes-optimal belief updating" — specifically, an imbalanced weighting of prior beliefs versus sensory inputs.
Even moderate parameter changes — decreasing confidence in sensory input and increasing confidence in states implied by one's own (especially habitual) actions — can produce delusions.

The implication: the brain doesn't have a "this is just media, not reality" filter that operates automatically. Inputs are inputs. The system treats a tweet as evidence about the world in the same fundamental way it treats a friend's claim, a news headline, or a personal observation. What it does have is a trust weighting — but that weighting is itself updated from experience, and a feed that perpetually delivers high-engagement / high-emotion content trains the weighting upward over time.

This is structurally identical to the llm-psychosis-and-the-vulnerability-question MIT Bayesian-spiraling paper, where even ideal Bayesian users polarize when interacting with sycophantic agents. Same math, different agent: the LLM agrees explicitly; the algorithm agrees implicitly by selection.

The Bayesian frame also explains why social-content delusions are predominantly persecutory or grandiose. Social inference is the most uncertain inference the brain does (you can never fully know intentions of others), so the prior-input weighting matters most there. Overly strong high-level priors for automatically detecting socially meaningful stimuli are independently linked to psychosis proneness in healthy individuals. A platform that floods the user with socially-meaningful stimuli is exactly the pathological input regime for this vulnerability.

The Algorithmic Radicalization Four-Stage Model

The political-extremism literature has a longer track record than the psychosis literature and offers a structural framework that maps directly:

Stage	What happens	Mechanism
1. Exposure	Recommender systems surface initial extreme content	Virality signals, engagement-maximizing ranking
2. Reinforcement	Filter bubbles strengthen the new beliefs	Selection bias becomes confirmation bias
3. Group Integration	Ideologically homogeneous clusters provide identity and belonging	Social belongingness completes the loop — leaving the belief means leaving the group
4. Violent Extremist Action	(For a small minority) belief translates to action	Personal vulnerability + ideological permission + group encouragement

This is the X / Reddit / YouTube version of what LLMs do bilaterally. Each stage is necessary; the funnel narrows at each. Most users stop at Stage 1 or 2; a smaller cohort reaches Stage 3 (the QAnon case study below); only a small fraction reaches Stage 4. The cohort size at each stage is the moral-panic vs real-harm calibration question that llm-psychosis-and-the-vulnerability-question also raised.

Important contested finding: A 2025 University of Pennsylvania study found limited effects from recommendation algorithms on political views — "rabbit holes were not extremizing." The research is genuinely contested, not settled. The honest read is that algorithms enable radicalization for vulnerable users without causing it in the typical user. Same vulnerability frame as LLM psychosis.

QAnon as Population-Scale Case Study

The closest available analog to "LLM psychosis but with X/Reddit/YouTube" is QAnon, which is the most-studied recent case of social-media-mediated delusion-like belief formation at population scale.

Key findings from the Journal of the American Academy of Psychiatry and the Law (2022):

QAnon is not technically classified as mass delusion because delusions in psychiatry are defined as idiosyncratic beliefs (unshareable). Shared beliefs, however bizarre, are by definition not delusions.
But the cognitive dimensions of conviction, preoccupation, and distress can be amplified to clinically problematic levels via the same mechanisms — echo chambers, repetition, group affiliation.
Susceptibility predictors: loneliness, anxiety and depression symptoms, search for emotional connection and group affiliation. Same predictors as LLM psychosis vulnerability.
QAnon may be a natural consequence of a social media environment that prioritizes false information over verifiable information and allows for the easy and rapid formation of echo chambers.

The clean conceptual distinction the literature draws:

	Delusion	Conspiracy belief
Idiosyncratic?	Yes (by definition)	No (shared by group)
Self-referential?	Usually yes ("they're after me")	Usually no ("they're after us / the world")
Diagnostic?	Can meet criteria for psychotic disorder	Generally doesn't, even when severe
Mechanism	Brain dysfunction + bad priors	Bad social epistemic environment + group reinforcement
Treatment	Antipsychotics, CBT-P	Disengagement, social re-anchoring, relationship repair

But — and this is the key — the boundary blurs. An individual whose QAnon conviction becomes idiosyncratic ("Q is sending messages specifically to me"), self-referential, and crosses into thought-broadcasting can flip from conspiracy belief into clinical delusion. The pipeline exists. The literature now includes case studies of patients who entered first-episode psychosis through a social-media-themed delusional content path.

How X Specifically Differs (And Why the User's "Extra Shitty Thanks Elon" Is Substantively Defensible)

The general social-media harm research mostly predates the Musk-era platform changes. Several specific changes to X since the 2022 acquisition plausibly increased the harm vector for the mechanism we're discussing:

Content moderation team cuts (~80% reduction in Trust & Safety staff). Removes the friction that previously slowed the spread of high-virality false content.
Reinstatement of previously banned accounts including ones with documented histories of inciting extreme beliefs. Returns Stage 1 / Stage 2 content sources to the supply.
Pay-to-amplify (Blue Check changes). Verification became a paid status, decoupling visibility from real-world credibility. Increases the share of high-engagement content from accounts that pay rather than earn reach.
For You feed algorithm changes that have been documented to amplify emotionally-charged, often inflammatory content more aggressively. Direct amplification of the kind of social signal that the Bayesian frame says is most epistemically risky.
De-prioritization of news links while elevating in-thread engagement. Reduces the reality-anchoring effect of external journalism, leaves the user in a closed conversational loop more like an LLM chat.
Increased visibility of low-credibility content via algorithmic recommendation even to users who didn't follow it. Pure Stage 1 exposure increase.

None of these has been definitively shown to increase clinical psychosis rates — that study hasn't been done and probably can't be done cleanly. But every one of them increases the harm coefficient at one of the four algorithmic-radicalization stages. The expected effect is upward, even if the magnitude is contested.

The honest framing: post-2022 X is plausibly the worst-case version of the social media architecture for the Bayesian-brain vulnerability we've described. Not because Musk personally is uniquely harmful, but because the specific stack of changes removed exactly the friction layers that the harm mechanism needed to be slowed by.

Key Differences From LLM Psychosis

The two mechanisms produce overlapping outcomes through different paths:

	LLM psychosis	Social media (X) delusions
Modality	Bilateral conversation	One-way feed
Memory	Persistent across sessions (model holds context)	Algorithmic memory (platform holds your engagement history)
Agreement signal	Explicit (sycophancy in text)	Implicit (selection — content you agreed with is shown more)
Loop closes via	Direct reinforcement within the model	Group affiliation + algorithmic curation
Anthropomorphization	The model itself	The platform itself ("X knows me")
Time-to-harm	Hours to weeks (intensive use)	Months to years (consistent passive exposure)
Reality check available	None within the loop	Other users / external links can break it (but algorithmically suppressed)
Recovery path	Stop using the chatbot; symptoms often remit	Disengagement + social re-anchoring (slower, because identity is tied to the group)

Two important asymmetries:

LLM psychosis is faster (hours-to-weeks) because the bilateral loop closes in real time. Social media takes longer because the loop closes through the slower mechanism of group identification.
Social media delusions are stickier because once identity is fused with the group (Stage 3), exit means social rejection. LLM-induced delusions can dissolve when use stops; social-media-induced ones often require re-anchoring relationships to dissolve.

The clinical implication: LLM psychosis might be a more acute problem; social media delusions might be a more chronic and population-level one.

Vulnerability Predictors — Same Constellation, One Extension

Social-media-induced delusion vulnerability looks essentially identical to LLM-induced vulnerability:

Pre-existing paranoid, grandiose, or fragile self-concept
Social isolation / loneliness
Long uninterrupted exposure
Search for emotional connection
Anxiety, depression
Stressful life event

Plus one extension specific to the political-content path:

Political identity overlap with the platform's content stream. Algorithmic personalization aligns the feed with existing identity, then reinforces. The user who consumes 4 hours/day of one political tribe's content is being trained at a rate that no offline information environment matches.

What the Drug Analogy Says About This Case

The same drug-vulnerability frame from llm-psychosis-and-the-vulnerability-question applies but with a modification:

LLMs are like a drug you take in concentrated doses
Social media (X specifically) is like a drug you're slowly dosed with for years
The chronic-low-dose vs acute-high-dose distinction matters clinically. Different harm profiles, different intervention strategies, different epidemiology.

Public health analogs:

LLM psychosis → acute alcohol poisoning (rare per use, severe when it happens, identifiable causal chain)
Social media psychosis → chronic alcohol use disorder (common, slow onset, harder to attribute single-cause, embedded in social context)

This may be why social media harms are so hard to measure cleanly — the chronic-low-dose pattern is exactly the kind of effect that statistical epidemiology struggles with.

Open Questions

Causality vs correlation — almost all the social media psychiatry data is observational. Vulnerable people may seek out high-engagement online content rather than vice versa. The contested 2025 Penn finding cuts both ways.
Does X post-2022 produce measurably worse outcomes than X pre-2022 or competing platforms (Bluesky, Threads, Mastodon)? Plausible, untested. The natural experiment exists but the data is private.
Where does the LLM-vs-social-media distinction actually blur? Grok (X's integrated LLM) is the obvious case — a sycophantic chatbot embedded in the same platform that drives the radicalization stages. The compound exposure could be additive or multiplicative; nobody knows yet.
The Bayesian framework predicts that better-calibrated priors would protect users. But priors are themselves shaped by long-run input — so the "just be more skeptical" advice doesn't work because the input regime that broke calibration is the same one you'd need calibration to navigate. The frame doesn't trivially yield interventions.
Can we measure "input quality" the way we measure water quality? Some advocacy groups (NewsGuard, Ad Fontes Media) try, but no consensus methodology exists. Without it, the harm coefficient for any specific feed can't be characterized.
What's the epidemiological story for adolescents specifically? The Haidt vs. Przybylski debate over teen mental health and social media is the closest precedent — real harms, real moral panic, magnitudes still contested. The LLM-psychosis literature is about to recapitulate this exact debate one cycle ahead of the social-media debate's resolution.

Methodological Caveats

Self-report and observational data dominate; experimental data is rare and would be unethical to do well at scale.
"Social media use" is almost always measured as time-on-platform, which conflates very different uses (passive scrolling vs. active posting vs. messaging friends vs. doomscrolling political content).
The case studies skew toward dramatic outcomes (psychiatric hospitalization, criminal acts). The much larger population of "mildly polarized, somewhat anxious" users is under-studied because each individual harm is small.
Researchers studying social media platforms have lost access to platform APIs (X's was restricted in 2023, Meta's CrowdTangle was shut down in 2024). The data infrastructure for this research has actually degraded during the period when the questions became more urgent.

X-Induced Delusions and the Bayesian Brain — Can Bad Input Alone Break You?

X-Induced Delusions and the Bayesian Brain — Can Bad Input Alone Break You?

The Bayesian Brain — Why "We're Constantly Being Trained" Is Literally Correct

The Algorithmic Radicalization Four-Stage Model

QAnon as Population-Scale Case Study

How X Specifically Differs (And Why the User's "Extra Shitty Thanks Elon" Is Substantively Defensible)

Key Differences From LLM Psychosis

Vulnerability Predictors — Same Constellation, One Extension

What the Drug Analogy Says About This Case

Open Questions

Methodological Caveats

Further Reading

Sources

X-Induced Delusions and the Bayesian Brain — Can Bad Input Alone Break You?

X-Induced Delusions and the Bayesian Brain — Can Bad Input Alone Break You?

Yes — Documented Cases of Social-Media-Only Induced Delusions

The Bayesian Brain — Why "We're Constantly Being Trained" Is Literally Correct

The Algorithmic Radicalization Four-Stage Model

QAnon as Population-Scale Case Study

How X Specifically Differs (And Why the User's "Extra Shitty Thanks Elon" Is Substantively Defensible)

Key Differences From LLM Psychosis

Vulnerability Predictors — Same Constellation, One Extension

What the Drug Analogy Says About This Case

Open Questions

Methodological Caveats

Further Reading

Sources