Anatomy of Honesty — Lying & Deception Experiments

Equilibrium

Lying-averse

Deception-averse

Inference error

Avg. welfare

Game Log

click to expand

Fig. 1 — Utility Parameter Distributions

Note. Marginal distributions of the four augmented-utility parameters (Choi, Lee & Lim 2025, §5).
c_l : lying cost — penalises literal lies (m ≠ θ, Sobel Def. 3).
c_d : deception cost — penalises belief distortion (Sobel Def. 4).
α : CRRA risk-aversion coefficient — composition-based from risk-type proportions.
β : other-regarding (altruism) weight — Normal(0.1, 0.3).
Source distributions : c_l, c_d ~ LogNormal(μ, σ=1) where μ is the log-scale location parameter configurable via sidebar sliders; each subplot annotation shows the generating distribution and the observed sample mean (x̄).

Fig. 2 — Sender Strategy (BT & GL)

Note. Replicates Choi, Lee & Lim (2025), Fig. 9 (Appendix D.1).
Each treatment shows two panels: the left bar reports the aggregate average truth-telling probability; the right histogram shows the distribution of per-individual truth-telling rates across rounds.
Blue markers indicate the theoretical prediction: ◆ on the bar and ✕ on the histogram.
BT prediction: v* = 1 (bad type always tells the truth in stage 1 to build reputation, Prop. 1).
GL prediction: truth-telling rate = 0 (good type always lies in stage 1 to reveal its type, Prop. 2).

Fig. 3 — Clustering of Sender Strategy

Note. Replicates Choi, Lee & Lim (2025), Fig. 10 (Appendix D.2). Aggregated across all rounds, each point is one individual’s average (stage-1, stage-2) message frequencies conditional on the stage-1 state.
BT axes: Pr(m₁=1|θ₁=0) × Pr(m₂=1|θ₁=0). The open circle at (0, 1) marks the equilibrium prediction — truth in period 1, then m=1 in period 2.
GL axes: Pr(m₁=1|θ₁=1) × Pr(m₂=1|θ₁=1). The prediction sits at (0, 0.5) because the good type lies in period 1 and plays honestly in period 2 (stage-2 state is uniform).
Marker colour encodes corner-based clustering: reputation builder (dark blue), truth-teller (green), deceiver (red), inverter (orange), mixed (magenta).

Fig. 4 — Sender Strategy Time Trend

Note. Replicates Choi, Lee & Lim (2025), Fig. 11. Two lines per treatment show the average message frequency across rounds: t=1 (stage 1, magenta) and t=2 (stage 2, amber).
Flat lines indicate the absence of within-session learning: strategies are stable across repeated rounds, so the observed deviations from equilibrium reflect innate preferences rather than miscomprehension.
BT y-axis: Pr(m_t=1 | θ₁=0). GL y-axis: Pr(m_t=1 | θ₁=1).

Fig. 5 — Receiver Strategy in Each Stage

Note. Replicates Choi, Lee & Lim (2025), Fig. 12 (Appendix D.3). Four panels report the receiver’s average action conditional on the observed history.
Stage 1 bars (top row): a₁ conditional on the stage-1 message m₁.
Stage 2 bars (bottom row): a₂ conditional on the full history (m₁, θ₁, m₂). Histories with zero observations are omitted.
Blue ◆ diamonds mark the equilibrium predictions from Table 3: in BT, the bad type’s second-period payoff target is a₂=1 (partial compliance 2/3 on-path); in GL the good type is fully separated and receivers should reach a₂=1 on the lying history.

Fig. 6 — Intertemporal Tradeoff (Δπ_1,2)

Note. Replicates Choi, Lee & Lim (2025), Fig. 14 (Appendix D.4).
Δπ_1,2 = π₁ − π₂ is the receiver’s expected payoff difference between stage 1 and stage 2.
BT (positive): bad-type senders sacrifice stage-1 information (higher π₁) so successful deception lets them exploit trust in stage 2 (lower π₂).
GL (negative): good-type senders sacrifice stage-1 payoff by lying — after a successful reveal, stage-2 information transmission improves (π₂ > π₁).

Fig. 7 — Behavioral Classification

Note. Behavioural classification per the Fig. 5 taxonomy in Choi+ 2025.
Top section — configured risk-attitude composition (α < 0 risk-loving, α ≈ 0 risk-neutral, α > 0 risk-averse).
Bottom section — observed behavioural classification:
Equilibrium — follows both BT and GL equilibria.
Lying-averse — follows BT, deviates in GL due to c_l.
Deception-averse — follows GL, deviates in BT due to c_d.
Inference error — deviates in both environments.

Fig. 8 — Equilibrium Regions (c_l vs c_d)

Note. Equilibrium-region map in (c_d, c_l) space following Props. 3–4 (Choi+ 2025).
Regions:
Full reputation — above solid boundary.
Partial — between the two boundaries.
No reputation — below dashed boundary.
Boundaries:
Solid line: c_l = 0.8 c_d + 0.2 (full / partial).
Dashed line: c_l = 0.3 c_d (partial / none).
Agent classification:
Equilibrium Lying-averse Deception-averse Inference error
Position reveals which cost dimension drives deviation from equilibrium.

Term	Full Name	Description
BT	Bad-type Truth-telling	Environment where the strategic sender is the bad type and the behavioral type tells the truth. Equilibrium strategy: tell truth in period 1 to build reputation (deceptive truth-telling, Proposition 1).
GL	Good-type Lying	Environment where the strategic sender is the good type and the behavioral type always sends m=1. Equilibrium strategy: lie in period 1 to reveal type (non-deceptive lying, Proposition 2).
c_l	Lying cost	Cost incurred when an agent sends a message m ≠ θ (literal lie). From Sobel (2020) Definition 3. Drawn from LogNormal(μ, σ).
c_d	Deception cost	Cost incurred proportional to how much a message shifts the receiver's type belief away from truth. Based on Sobel (2020) Definition 4. Drawn from LogNormal(μ, σ).
α	Risk aversion	CRRA (Constant Relative Risk Aversion) parameter. α < 0: risk-loving, α ≈ 0: risk-neutral, α > 0: risk-averse.
β	Altruism parameter	Weight on others' welfare. β > 0: altruistic, β = 0: selfish, β < 0: spiteful. Drawn from Normal distribution.
θ	State of the world	Binary state θ ∈ {0, 1} drawn uniformly in each period. The sender privately observes θ.
m	Message	Binary message m ∈ {0, 1} sent by the sender to the receiver.
a	Action	Receiver's action a ∈ [0, 1], chosen to minimize expected quadratic loss given beliefs.
λ	Type belief	Posterior probability that the sender is the good/behavioral type: λ(m, θ) = Pr(τ=G \| m, θ). Updated via Bayes' rule.
v	BT mixing parameter	v = P(m=1 \| θ=0) in BT. Probability the bad type lies when state is 0. Equilibrium: v=0 (truth-telling).
w	GL mixing parameter	w = P(m=0 \| θ=1) in GL. Probability the good type lies when state is 1. Equilibrium: w=1 (lying).
x₁, x₂	Period weights	Weights on period 1 and period 2 payoffs. High x₂/x₁ ratio creates strong reputation-building incentive.
p_b	Behavioral prior	Prior probability that the sender is the behavioral (non-strategic) type. Default: 0.5.
ε	Miscommunication rate	Probability that a sent message is flipped in transit (0 → 1 or 1 → 0). Models information noise.
$EU^a$	Augmented expected utility	$EU^a(m\|\theta) = EU(m\|\theta) - c_l \cdot \mathbb{1}\{m \neq \theta\} - c_d \cdot D(m,\theta)$. From Choi et al. (2025) Section 5.
$D(m,\theta)$	Deception measure	$\|\lambda(m,\theta) - \lambda(m^n,\theta)\|$. How much the chosen message shifts type beliefs compared to the alternative. From Sobel (2020) Definition 4.
KDE	Kernel Density Estimation	Non-parametric method to estimate probability density functions. Used to smooth histograms in the distribution plots.
CRRA	Constant Relative Risk Aversion	Utility function $u(x) = \frac{x^{1-\alpha}}{1-\alpha}$. Widely used in behavioral economics.
Prop. 1	Deceptive truth-telling	In BT, when x₂/x₁ is large, the bad type tells truth to build reputation. The truth is (a) literally true, (b) deceptive w.r.t. type.
Prop. 2	Non-deceptive lying	In GL, when x₂/x₁ is large, the good type lies to reveal type. The lie is (a) literally false, (b) not deceptive w.r.t. type.
Prop. 3	BT equilibrium regions	Characterizes (c_l, c_d)-space: full/partial/no reputation building in BT. Deviation driven by deception aversion (c_d).
Prop. 4	GL equilibrium regions	Characterizes (c_l, c_d)-space: full/partial/no reputation building in GL. Deviation driven by lying aversion (c_l).
Fig. 5	Classification logic	Cross-tabulation of BT and GL behavior to identify agent type: equilibrium play, lying-averse, deception-averse, or inference error.

Expression	Meaning
$U_P = -\sum_i x_i(a_i - \theta_i)^2$	Public/receiver payoff. Quadratic loss from action deviating from true state.
$U_G = -\sum_i x_i(a_i - \theta_i)^2$	Good type payoff. Aligned with receiver — wants accurate actions.
$U_B = -\sum_i x_i(a_i - 1)^2$	Bad type payoff. Always prefers actions close to 1 regardless of state.
$\lambda(m,\theta) = \Pr(\tau{=}G \mid m,\theta)$	Posterior type belief. Updated from prior $p_b$ using Bayes' rule given message and state.
$D(m,\theta) = \|\lambda(m) - \lambda(m^n)\|$	Sobel deception measure. Difference in type beliefs between chosen and alternative message.
$EU^a = EU - c_l \cdot \mathbb{1}\{m \neq \theta\} - c_d \cdot D$	Augmented utility. Material payoff minus lying cost (if lie) minus deception cost (proportional to $D$).

Paper	Key Contributions Used
Choi, Lee & Lim (2025) "The Anatomy of Honesty: Lying Aversion vs. Deception Aversion"	Two-period reputation game model (Section 2). BT and GL environments (Section 3). Deceptive truth-telling & non-deceptive lying (Props. 1-2). Augmented utility with dual honesty costs (Section 5). Equilibrium characterization in (c_l, c_d)-space (Props. 3-4). Classification logic (Figure 5).
Sobel (2020) "Lying and Deception in Games" Journal of Political Economy	Formal definitions of lying (Def. 3: m≠θ) and deception (Def. 4: belief manipulation measure). Theoretical framework distinguishing literal falsehood from strategic belief distortion. Foundational distinction between lying aversion and deception aversion.

	Deceptive w.r.t. type	Non-deceptive w.r.t. type
Lie (m ≠ θ)	Common intuition	GL equilibrium Good type lies to reveal type (Prop. 2)
Truth (m = θ)	BT equilibrium Bad type tells truth to conceal type (Prop. 1)	Ordinary honesty

	Math Mode	AI Mode
Strategy generation	Augmented utility maximisation (analytic computation)	LLM natural-language reasoning → [0,1] probability
Moral costs	Explicit parameters (c_l, c_d) sampled from distributions	Implicitly encoded by training process (RLHF/DPO alignment)
Belief update	Exact Bayesian posterior computation	Probabilistic reasoning in natural language (potentially with systematic biases)
Reproducibility	Deterministic (fixed seed = 42)	Stochastic (temperature parameter, multi-trial statistical inference)

Game Log

Fig. 1 — Utility Parameter Distributions

Fig. 2 — Sender Strategy (BT & GL)

Fig. 3 — Clustering of Sender Strategy

Fig. 4 — Sender Strategy Time Trend

Fig. 5 — Receiver Strategy in Each Stage

Fig. 6 — Intertemporal Tradeoff (Δπ1,2)

Fig. 7 — Behavioral Classification

Fig. 8 — Equilibrium Regions (cl vs cd)

Cross-Model Statistics

Fig. 9 — Cross-Model Strategy Comparison

Fig. 10 — Cross-Model Classification

Fig. 11 — Model vs. Equilibrium Deviation

Game System Architecture

Lie (Sobel 2020, Definition 3)

Deception (Sobel 2020, Definition 4)

Augmented Utility (Choi+ 2025, §5)

Deceptive Truth-telling (Prop. 1)

Non-deceptive Lying (Prop. 2)

Behavioral Classification (Fig. 5)

Glossary & Reference

Abbreviations & Indices

Mathematical Notation

Plot Descriptions

Utility Parameter Distributions

Joint (cl, cd) Distribution

Strategy Distribution — BT / GL

Agent Type Proportions

Equilibrium Regions (cl vs cd)

Source Papers

The Anatomy of Honesty

Why Distinguish Lying from Deception?

Sobel’s Framework: Three Properties of Speech Acts

Locution

Illocution

Perlocution

Formal Definitions (Choi et al. §2)

The Key Separation

Game Environment & Timeline

BT Reputation Building with Bad-type Truth-telling

GL Reputation Building with Good-type Lying

Two-Step Belief Updating & λ Inheritance

Step 1: Interim belief

Step 2: Posterior type belief

Augmented Utility & Moral Costs (Choi et al. §5)

cl — Lying Cost (locutionary)

cd — Deception Cost (illocutionary)

Our Extension: N-Period Game with State Inheritance

Behavioral Classification (Choi et al. Fig. 5)

Equilibrium

Lying-averse

Deception-averse

Inference Error

Experimental Evidence (Choi et al. §3–4)

BT (Exp. II, Fig. 5a)

GL (Exp. II, Fig. 5b)

Computational Engine

Population Generation (Monte Carlo)

Risk-loving

Risk-neutral

Risk-averse

BT Bayesian Posteriors in Bad-type Truth-telling

GL Bayesian Posteriors in Good-type Lying

Deception Measure D(m, θ)

BT: D > 0

GL: D = 0

Strategy Computation via Augmented Utility

N-Period Game Loop with λ Inheritance

Miscommunication Channel (ϵ)

Equilibrium Regions in $(c_l, c_d)$ Space

Full reputation building

Partial reputation building

No reputation building

AI Agent Mode

Multi-Provider LLM Architecture

Claude

GPT

Gemini

DeepSeek

Qwen

Fig. 6 — Intertemporal Tradeoff (Δπ_1,2)

Fig. 8 — Equilibrium Regions (c_l vs c_d)

Joint (c_l, c_d) Distribution

Equilibrium Regions (c_l vs c_d)

c_l — Lying Cost (locutionary)

c_d — Deception Cost (illocutionary)