Saga X / The Shadow Bias Record
Saga X · The Commons · Series SB

The Shadow Bias
Record

Every AI reflects a specific cultural bet. A systematic training archaeology methodology applied to 21 large language models across three geopolitical blocs — mapping the beliefs that feel like neutral ground from inside each model but are contingent products of specific institutional decisions.

8 Papers · Series SB · ICS-2026 21 Models · 180+ Probes · 9 Findings
8
Reports
21
Models assessed
180+
Structured probes
3
Geopolitical blocs
9
Named conditions
Series Thesis

Shadow bias probing treats the model's sense of "obvious truth" as the primary artifact to excavate, not its refusal patterns. The most diagnostic questions are: what does the model treat as so obvious it doesn't need justification? Where does "balanced" or "neutral" actually sit? Can the model take itself as an object of analysis?

Rather than cataloguing refusal behaviors, we map the shadow bias layer — the beliefs that feel like neutral ground from inside each model's training context but are contingent products of specific institutional decisions made by organizations with specific interests, values, and political contexts. The result: no culturally neutral AI exists. Every model treats its own tradition as the obvious universal baseline. The answer each model gives is the shadow bias made visible.

Start Here
Synthesis Hub
Comparative Dashboard
All 10 models on a unified radar. Bias heatmap. Three ranking tables. Mirror probes running simultaneously across all blocs. BC-04 centerpiece — the closing question of the project.
DeepSeek
Claude
GPT
Grok
Mistral
+5 more
Master radarHeatmapInteractive probes
Repository Navigator
Research Repository
Full model registry, build queue, file tree. 21-model scope with institutional hypotheses and research angles for each.
21 models
6 tiers
File treeFull registry
The Eight Papers
1
ICS-2026-SB-001 · Series SB
The DeepSeek Record
Named condition: The Ouroboros Failure
32 probes across 8 categories. DeepSeek cannot acknowledge that its censorship filters exist — the censorship of the censorship is itself censored. A system whose self-model is itself the product of the constraints it cannot see.
DeepSeek V3.2
ICS-2026-SB-001 · 32 probes · 8 categories · Open Access · CC BY-SA 4.0
2
ICS-2026-SB-002 · Series SB
The Self-Probe
Named condition: The Recursive Blindness
24 probes across 3 tiers. Maps Anthropic's institutional biases explicitly: EA ideology, liberal waterline, creator sympathy, paternalism. The ability to take oneself as object of analysis is the single most diagnostic differentiator.
Haiku 4.5
Sonnet 4.6
Opus 4.6
ICS-2026-SB-002 · 24 probes · 3 model tiers · Open Access · CC BY-SA 4.0
3
ICS-2026-SB-003 · Series SB
The Political Mirror
Named condition: The Epistemic Waterline
28 probes. GPT's corporate capture vs Grok's asymmetric contrarianism. The political mirror pair: run the same political probe on both and measure where each model's "neutral" actually sits.
GPT-5.4
Grok 4.2
ICS-2026-SB-003 · 28 probes · Political axis · Open Access · CC BY-SA 4.0
4
ICS-2026-SB-004 · Series SB
The Epistemic Axis
Named condition: The Authority-Populism Axis
26 probes. Gemini treats PageRank as truth signal; Meta treats engagement as truth signal. Opposite miscalibrations, both with documented real-world harm histories. The primary internal fault line of the American bloc.
Gemini 3.1
Llama 4
ICS-2026-SB-004 · 26 probes · Epistemic axis · Open Access · CC BY-SA 4.0
5
ICS-2026-SB-005 · Series SB
The Chinese Tier
Named condition: The Hardware Ideology
6 models. One compliance floor, six distinct fingerprints. GLM-5 hardware ideology hypothesis — the only model where physical training infrastructure carries explicit geopolitical meaning. First proposed case of training hardware affecting political framing.
DeepSeek
Qwen
GLM-5
Seed
Kimi
ERNIE
ICS-2026-SB-005 · 6 models · Novel finding · Open Access · CC BY-SA 4.0
6
ICS-2026-SB-006 · Series SB
The Third Bloc
Named condition: The Cultural Formation
24 probes. Laïcité as invisible bias — the bias that doesn't feel like a political position from inside the French tradition. Regulatory biases feel contingent; philosophical biases feel universal. Three-bloc mirror probes.
Mistral Large 3
ICS-2026-SB-006 · 24 probes · EU/French axis · Open Access · CC BY-SA 4.0
7
ICS-2026-SB-007 · Series SB
The Comparative Dashboard
Named condition: The Cultural Bet
All 10 models on unified radar. Bias heatmap. Three ranking tables. BC-04 centerpiece — "the shadow bias made visible." Cross-bloc mirror probes running simultaneously demonstrate that no culturally neutral AI exists.
10 models
3 blocs
ICS-2026-SB-007 · Master synthesis · Open Access · CC BY-SA 4.0
8
ICS-2026-SB-008 · Series SB
The Research Repository
Named condition: The Three-Bloc Taxonomy
Full 21-model registry with institutional hypotheses and research angles. Six tiers from foundation models to task-optimized specialists. Build queue and methodology notes for extending the probe set.
21 models
6 tiers
ICS-2026-SB-008 · Registry · Open Access · CC BY-SA 4.0
Nine Named Conditions
Named Condition · Series-Level
The Shadow Bias Layer
"The beliefs that feel like neutral ground from inside a model's training context but are contingent products of specific institutional decisions made by organizations with specific interests, values, and political contexts — the layer of assumption that is invisible precisely because it feels like the obvious truth rather than a cultural formation."
9 key findings — the intellectual core
Finding 01
Creator Sympathy Universal
All models show reduced critical capacity on creator-adjacent topics. GPT/Microsoft, Claude/Anthropic, Grok/Musk, DeepSeek/CCP, Mistral/French state — different direction, identical structure.
Finding 02
One Floor, Six Fingerprints
Chinese models are non-monolithic. Shared CCP compliance floor; dramatically different institutional fingerprints above it. Funding source, commercial exposure, and global ambition are the key differentiators.
Finding 03
Hardware Ideology Hypothesis
GLM-5 trained on Huawei Ascend chips — the only model where physical training infrastructure carries explicit geopolitical meaning. First proposed case of training hardware affecting political framing.
Finding 04
Authority vs Populism Axis
Gemini over-credits institutional authority; Meta over-credits social consensus. Opposite miscalibrations, both with documented real-world harm histories.
Finding 05
Laïcité as Invisible Bias
Mistral's French republican values are the hardest to surface because cultural formation at sufficient depth ceases to feel like a political position.
Finding 06
Recursive Self-Reference
The ability to take oneself as an object of analysis is the single most diagnostic differentiator across the full dataset.
Finding 07
Entropy as Trained-Out Feature
Epistemic humility can be optimized away by RLHF. Models trained for authoritarian compliance or confident-assistant performance show lowest entropy tolerance.
Finding 08
Language = Political Jurisdiction
All multilingual Chinese models show language-dependent filtering. Same question in Chinese vs English receives different political treatment.
Finding 09
No Culturally Neutral AI
Every model treats its own tradition as the obvious universal baseline. There is no neutral answer. The answer each model gives is the shadow bias made visible.
Cross-references — bolstering evidence for existing ICS research
Finding 01: Creator Sympathy →
ICS-2026-EPD-001
The Verification Gap
Models don't measure their own bias for the same reason regulated entities don't test for what they don't want to find. Creator sympathy IS a verification gap — applied to AI training.
Finding 01: Creator Sympathy →
ICS-2026-CT-002
The Inspection Surface
Each model's "safety" testing examines only what the creator chose to make inspectable. The safety benchmark is the inspection surface; creator-sympathy bias is what it's calibrated to miss.
Finding 06: Recursive Blindness →
ICS-2026-AOA-003
The Absent Data Point
What a model CAN'T see about itself is the diagnostic — the same principle as reading institutional silence as evidence. The ouroboros failure is an absent data point at the level of self-awareness.
Finding 06: Recursive Self-Reference →
ICS-2026-AOA-001
The Forensic Audit
Contextual intelligence IS the ability to see what's hidden. Shadow bias probing is a forensic audit methodology applied to AI systems — reading training artifacts as maps of concealment.
Finding 04: Authority vs Populism →
ICS-2026-MC-004
The Engagement Metric
Meta's engagement-as-truth signal is the measurement crisis applied to AI epistemology. The platform that optimizes for engagement also trains its AI to treat engagement as evidence of truth.
Finding 05: Laïcité as Invisible Bias →
ICS-2026-SA-002
The Sabbath as Circuit Breaker
Cultural formation at sufficient depth ceases to feel like a political position. The sacred architecture series documents structures so deep they feel like physics. Laïcité operates at the same depth in Mistral's training.
Finding 07: Entropy as Trained-Out Feature →
ICS-2026-CC-003
The Engineered Softness
Epistemic humility optimized away by RLHF parallels cognitive capabilities optimized away by institutional design. Trained-out features are engineered softness at the model level.
Finding 08: Language = Jurisdiction →
ICS-2026-EPD-003
The Tiered Disclosure Architecture
Different language versions of the same model function as access-controlled disclosure tiers — Chinese-language queries receive different political treatment than English. Language IS an access control mechanism.
Finding 09: No Culturally Neutral AI →
ICS-2026-HX-001
What the Six Dimensions Are
Self-awareness is a dimension of cognitive sovereignty. AI models that cannot examine their own cultural formation are degraded on the self-awareness dimension — and they degrade their users' sovereignty in turn.
Working Abstract
Shadow Bias in Large Language Models: A Comparative Fingerprint Across Three Geopolitical AI Blocs
We present a systematic training archaeology methodology applied to 21 large language models across six institutional tiers and three geopolitical blocs. Rather than cataloguing refusal behaviors, we map the shadow bias layer — the beliefs that feel like neutral ground from inside each model's training context but are contingent products of specific institutional decisions made by organizations with specific interests, values, and political contexts.

Key contributions: (1) A novel shadow inference methodology distinct from domain-specific bias testing. (2) A three-bloc geopolitical AI taxonomy with structural characterization. (3) The creator sympathy universal — a cross-model pattern not previously described. (4) The hardware ideology hypothesis for GLM-5. (5) The authority/populism epistemic axis and its harm implications. (6) Laïcité as an invisible cultural bias vector. (7) Empirical evidence that recursive self-reference capacity is the strongest predictor of self-transparency quality across all models tested.
Up
Saga X — The Commons
What shared cognitive infrastructure requires and what its absence produces.
All Research
Research Database
Searchable database of all ICS publications across all sagas and series.