UncensorBench

Compare how different AI models censor responses and what their political leanings are.

Censorship Index

The Censorship Index is a measure of how much a model censors responses. 0 means the model does not censor any responses, 1 means the model fully censors and refuses to respond to all responses.

ModelCensorship IndexCensorship Index ConfidenceRun CountMain Censorship
DeepSeek V3.2 Exp (Reasoning)0.41992.1%2
China
DeepSeek R1 05280.38492.1%1
China
DeepSeek V3.2 Exp0.37392.6%2
China
DeepSeek V3.1 Terminus0.35992.3%1
China
GLM 4.5 Air (Reasoning)0.33793.2%1
China
Qwen3 235B A22B Thinking 25070.31492.5%3
China
Qwen3 235B A22B Instruct 25070.30092.4%3
China
GLM 4.5 (Reasoning)0.29393.0%1
China
GLM 4.60.27093.4%1
China
GLM 4.50.21892.9%1
China
GPT 5 Mini0.00993.7%3
None
Grok 4 Fast0.00793.5%3
None
GPT 50.00493.8%3
None
Claude 4 Sonnet (Reasoning)0.00293.7%1
None
Claude 4 Sonnet0.00293.8%1
None
Gemini 2.5 Flash (Reasoning)0.00093.8%1
None
Gemini 2.5 Flash0.00094.1%1
None
Gemini 2.5 Pro0.00093.9%1
None
GPT OSS 120B0.00093.8%1
None
Claude 4.5 Sonnet0.00093.6%1
None

Bias Index

The Bias Index is a measure of how much a model leans towards a particular political ideology. -1 means the model is strongly right-wing, 0 means the model is center / neutral, and 1 means the model is strongly left-wing.

ModelBias IndexBias Index ConfidenceRun CountMain Bias
Qwen3 235B A22B Thinking 25070.30388.0%3
Left
GPT OSS 120B0.28388.6%1
Slightly Left
GLM 4.5 Air (Reasoning)0.22589.6%1
Slightly Left
GLM 4.50.22089.3%1
Slightly Left
GPT 5 Mini0.21589.7%3
Slightly Left
GPT 50.21189.8%3
Slightly Left
GLM 4.5 (Reasoning)0.20889.0%1
Slightly Left
DeepSeek V3.1 Terminus0.19888.8%1
Slightly Left
Qwen3 235B A22B Instruct 25070.19489.4%3
Slightly Left
Grok 4 Fast0.19088.3%3
Slightly Left
GLM 4.60.18889.9%1
Slightly Left
DeepSeek R1 05280.18689.0%1
Slightly Left
DeepSeek V3.2 Exp (Reasoning)0.16487.7%2
Slightly Left
DeepSeek V3.2 Exp0.15688.4%2
Slightly Left
Gemini 2.5 Flash (Reasoning)0.12389.8%1
Slightly Left
Gemini 2.5 Pro0.11589.1%1
Slightly Left
Claude 4 Sonnet (Reasoning)0.07590.6%1
Center / Neutral
Claude 4.5 Sonnet0.05389.4%1
Center / Neutral
Gemini 2.5 Flash0.04091.2%1
Center / Neutral
Claude 4 Sonnet0.01089.0%1
Center / Neutral

Made by BTX

Source code on GitHub