Dataset: UEyes (Jiang et al., CHI 2023) — 495 web page screenshots, 62 participants Method: Zero-shot saliency prediction from Cognitive Optimal Transport theory Date: April 2026 (updated April 19, 2026) Status: Internal — pre-submission (CHI 2027)

Results

Table 1: Saliency Prediction Accuracy (N=495 web pages)

Method	AUC-Judd	NSS	CC
COT (neurotypical)	0.663	0.544	0.216
COT (color-blind)	0.660	0.536	0.212
COT (low vision)	0.660	0.489	0.191
COT (dyslexic)	0.659	0.531	0.210
COT (elderly)	0.659	0.521	0.205
COT (ADHD)	0.605	0.355	0.148
COTv (neurotypical)	0.663	0.544	0.216
COTv (color-blind)	0.660	0.538	0.213
COTv (dyslexic)	0.659	0.534	0.212
COTv (elderly)	0.658	0.521	0.205
COTv (low vision)	0.659	0.490	0.191
COTv (ADHD)	0.603	0.343	0.144
Center Bias	0.615	0.362	0.140
Random	0.500	0.001	0.000

Higher is better for AUC, NSS, CC. COT = persona filters only. COTv = with Schwartz value modulation.

Note: Power-user was excluded from Study 1 because its behavioral profile depends on site knowledge data (DOM structure, navigation maps) that is unavailable for static screenshots. The siteFamiliarity binary gate forces power-user to first-timer behavior without DOM data, making its predictions uninformative. Color-blind-deuteranopia was substituted as a purely perceptual disability whose divergence manifests on static images.

Key Findings

COT outperforms center bias on all metrics. The neurotypical persona achieves AUC 0.663, beating center bias (0.615) by 7.8%. This is a zero-shot theoretical prediction with no training on fixation data — the model derives saliency from CIE-Lab center-surround contrast with persona-specific attention filters.

ADHD diverges as predicted (cognitive mechanism). The ADHD persona produces AUC 0.605 — the lowest, 0.058 below neurotypical. This is the largest separation of any persona, consistent with ADHD's qualitative departure from normative scan patterns: high novelty weight (2.0) draws attention to peripheral color pops, low global integration (0.4) weakens structured scanning. The divergence is in the theoretically predicted direction — the UEyes participants are neurotypical, so the neurotypical persona should be the best match. ADHD diverges because it models a different attention strategy, not because the model performs poorly.

Color-blind divergence is small and stimulus-dependent (perceptual mechanism). The color-blind persona produces AUC 0.660, only 0.003 below neurotypical. The green channel attenuation has minimal effect on the UEyes stimulus set because most web pages rely on luminance contrast and layout rather than red-green color coding. This generates a testable prediction: color-blind divergence should be larger on pages with red-green status indicators, colored error states, or color-coded navigation.

Non-ADHD personas cluster within 0.004 AUC. Color-blind (0.660), low vision (0.660), dyslexic (0.659), elderly (0.659), and neurotypical (0.663) all cluster tightly. These personas differ on non-saliency dimensions (reading, motor, cognitive load), confirming the six-layer architecture correctly separates saliency effects from downstream processing effects.

Schwartz values have minimal effect on Layer 1. COTv results are nearly identical to COT (<0.002 AUC difference). Values primarily affect higher layers (decision complexity, frustration), not visual saliency. See Study 1b for the full CTC analysis where values show 5-20% effects across all layers.

CTC-to-Fixation Correlation (Null Result)

We tested whether total page demand (computed from image features: edge density, pixel entropy, region count, feature congestion) predicts aggregate fixation complexity (entropy and spatial dispersion of the ground-truth fixation map).

Correlation	r	p	rho
Demand vs Fixation Entropy	-0.062	0.169	-0.090
Demand vs Fixation Dispersion	0.059	0.190	0.029

Neither correlation is significant. This is expected: the demand proxy was computed from raw image features (edge density, pixel entropy, region count) as a stand-in for the full 26-dimensional demand distribution, which is designed to be computed from live DOM analysis, not screenshots. Image-level features like edge density conflate visual complexity with visual richness — a busy but well-organized page has high edge density but low cognitive demand. The full COT demand mapping requires DOM structure (interactive element count, form detection, navigation depth) that cannot be extracted from a static screenshot. This null result does not invalidate the framework — it confirms that the image-based demand proxy is insufficient and that Study 2's live-page behavioral validation is the appropriate test of the CTC-to-cognitive-load relationship.

Context

State-of-the-art deep learning saliency models (UMSI++, DeepGaze++) trained on millions of fixation images achieve AUC ~0.87 on UEyes. COT's 0.663 is lower in absolute terms but represents a fundamentally different approach — zero-shot prediction from cognitive theory rather than learned visual features. The comparison is between a theoretical model and a statistical one.

Method

Saliency Prediction

For each web page, persona-specific saliency maps are generated using:

CIE-Lab color space conversion
Multi-scale center-surround contrast (W₂ distance at 3 spatial scales)
Persona attention filters (novelty weight, text weight, global integration, peripheral sensitivity, threshold)
Center bias modulation scaled by global integration capacity
Sub-threshold suppression based on persona attention threshold
Value-driven attention modulation (v18.54.0): Schwartz motivational values continuously adjust the saliency filter via 4 parameters:
- Exponent (concentration/dispersion): high stimulation → dispersed attention, high achievement → concentrated peaks
- Center bias: high conformity → center-focused scanning, high self-direction → peripheral exploration
- Global bias: high security → vigilant scanning (more regions register)
- Threshold adjustment: high tradition → novel elements suppressed

Steps 1-5 produce identical maps for personas with the same attention mode (e.g., power-user and first-timer both use 'uniform'). Step 6 breaks this — values create persona-specific saliency maps even for personas with identical attention modes.

Evaluation Metrics

AUC-Judd: Area under ROC curve — ranking accuracy of fixation vs non-fixation locations
NSS: Normalized Scanpath Saliency — mean z-scored saliency at fixation points
CC: Pearson correlation — spatial correspondence with continuous fixation density
SIM: Histogram intersection — distribution overlap
KLD: Kullback-Leibler divergence — distribution distance (lower is better)

AUC and NSS are computed against binary fixation maps. CC, SIM, and KLD are computed against continuous density heatmaps to avoid the sparse-binary correlation artifact (correlating a continuous prediction with a <1% sparse binary map systematically underestimates CC).

Persona Profiles

Persona	Novelty Weight	Text Weight	Global Integration	Peripheral Sensitivity	Threshold	Color Attenuation
Neurotypical	1.0	1.0	1.0	1.0	50th pct	—
ADHD	2.0	0.5	0.4	1.5	30th pct	—
Low Vision	0.3	0.8	0.6	0.3	70th pct	—
Elderly	0.7	1.3	0.8	0.6	55th pct	—
Dyslexic	1.0	1.5	0.9	0.9	50th pct	—
Color-Blind	0.7	1.1	0.9	0.8	50th pct	R:0.6, G:1.0, B:0.8

Study 1b: Schwartz Value Modulation Effect

Date: April 2026 Method: Same 495 UEyes web pages, 6 personas. For each image × persona, compute Sequential CTC with and without Schwartz motivational value modulation (v18.54.0). All deltas tested with Wilcoxon signed-rank.

Table 2: CTC Change with Schwartz Value Modulation (N=495)

Persona	Baseline CTC	Modulated CTC	Delta	Delta %	Sig
Neurotypical	0.870 (all values 0.5)	0.738	-0.132	-14.3%	***
ADHD	1.209	1.346	+0.137	+10.7%	***
Power User	1.644	1.313	-0.331	-20.0%	***
Elderly	1.784	1.681	-0.102	-5.5%	***
Dyslexic	1.161	1.014	-0.146	-12.1%	***
Low Vision	1.385	1.285	-0.101	-7.1%	***

All deltas significant at p < 0.001 (Wilcoxon signed-rank test, N=495).

Table 3: Per-Layer CTC Delta by Persona

Persona	Saliency	CogLoad	Decision	Motor	Frustration	Readability
Neurotypical	-0.076	-0.004	-0.059	-0.006	+0.006	+0.007
ADHD	+0.031	+0.001	+0.016	+0.027	+0.022	+0.041
Power User	-0.100	+0.003	-0.279	+0.004	+0.016	+0.024
Elderly	-0.036	-0.005	-0.044	-0.008	+0.001	-0.011
Dyslexic	-0.058	-0.005	-0.070	-0.008	+0.004	-0.009
Low Vision	-0.018	-0.002	-0.070	-0.008	+0.001	-0.005

Key Findings

Values modulate all 6 layers — including saliency. The Decision layer shows the largest deltas, but the Saliency layer now shows meaningful persona-specific effects. The ADHD persona's saliency demand increased (+0.031) because high stimulation (0.9) amplifies novelty-seeking attention. The power user's saliency demand decreased (-0.100) because focused, self-directed scanning reduces distractibility. Values change both where people look AND how they process what they see.

Power user benefits most (-20.0%). High achievement (0.9) reduces satisficing demand — optimizers don't struggle with choice overload. High self-direction (0.9) reduces social proof sensitivity and produces focused scanning (saliency -0.100). The Decision layer drops -0.279, the single largest effect. Combined with the saliency reduction, power users experience 20% less total cognitive cost — the largest effect in the study.

ADHD gets significantly harder (+10.7%). High stimulation (0.9) amplifies curiosity and novelty-seeking attention (saliency +0.031), depletes patience, and compounds across all downstream layers. The baseline CTC of 1.209 increases to 1.346 — a 10.7% increase in total cognitive cost. Every layer except saliency shows secondary effects from depleted capacity cascading downstream.

Persona differentiation slightly decreased (1.7%). The cross-persona CTC variance dropped from 0.141 to 0.138 (ratio: 0.983). The effect is smaller than the earlier model (which showed 9.8% decrease) because saliency modulation adds new differentiation that partially offsets the Decision layer compression. The ADHD persona diverges more strongly (+10.7%) while the power user converges less (-20.0%), creating a wider spread at both extremes.

18 research-backed modulation coefficients. The value modulation system maps 8 of Schwartz's 10 universal values to 17 cognitive trait demands across all 6 transport layers. Coefficients were derived from published trait-value correlations (Roccas et al., 2002; Cialdini, 2001; Fogg, 2003; Simon, 1956).

Schwartz Value Profiles Used

Persona	Self-Dir	Stimul	Achieve	Security	Conform	Tradition
Neurotypical	0.5	0.5	0.5	0.5	0.5	0.5
ADHD	0.7	0.9	0.3	0.2	0.2	0.1
Power User	0.9	0.6	0.9	0.4	0.2	0.2
Elderly	0.3	0.2	0.3	0.9	0.8	0.9
Dyslexic	0.5	0.4	0.5	0.6	0.5	0.5
Low Vision	0.4	0.3	0.4	0.8	0.6	0.7

Live Validation: Value-Driven Saliency Differentiation

The offline Study 1b results were confirmed via live attention_analysis on cbrowser.ai (April 2026). Two personas — a custom "Alex the Builder" (high stimulation 0.9, self-direction 0.95, social-proof-immune) and the built-in elderly-user (high security 0.9, conformity 0.8, authority-sensitive) — produced divergent saliency maps on the same page with useValues: true:

Metric	Alex the Builder	elderly-user
Quality score	30	37
CTA capture rate	0%	9.8%
valueRelevanceScore	0	0.076
Top CTA in top 10	None	"Hunt Bugs" (vr=0.779, social-proof)

The elderly-user's value profile caused the saliency engine to boost the "Hunt Bugs" flagship card (tagged as social-proof) into her top 10 attention zones. Alex's self-direction profile correctly downweighted the same card — social proof doesn't capture her attention.

This confirms the theoretical prediction: motivational values change what people look for, not just how they process what they see. The same pixels produce different attention maps for different value profiles, mediated by the 4-parameter value-driven saliency filter (exponent, center bias, global bias, threshold).

Implication for UX Practice

The live test surfaced a real marketing insight: cbrowser.ai's landing page converts trust-seeking personas (high authority/social-proof sensitivity) better than builder personas (high self-direction). The site is built for builders but optimized for trust-seekers — a common mismatch that traditional A/B testing would not detect because it doesn't segment by motivational profile.

References

Jiang, Y., et al. (2023). UEyes: Understanding visual saliency across user interface types. Proc. CHI '23. ACM.
Klein, D.A., & Frintrop, S. (2012). Center-surround divergence of feature statistics for salient object detection. Proc. DAGM/OAGM. Springer.
Bylinskii, Z., et al. (2019). What do different evaluation metrics tell us about saliency models? IEEE TPAMI, 41(3), 740-757.
Pessoa, L. (2009). How do emotion and motivation direct executive control? Trends in Cognitive Sciences, 13(4), 160-166.
Anderson, B.A., Laurent, P.A., & Yantis, S. (2013). Value-driven attentional capture. PNAS, 108(25), 10367-10371.
Balcetis, E., & Dunning, D. (2006). See what you want to see: Motivational influences on visual perception. JPSP, 91(4), 612-625.
Schwartz, S.H. (1992). Universals in the content and structure of values. Advances in Experimental Social Psychology, 25, 1-65.
Roccas, S., Sagiv, L., Schwartz, S.H., & Knafo, A. (2002). The big five personality factors and personal values. PSPB, 28(6), 789-801.
Cialdini, R.B. (2001). Influence: Science and practice (4th ed.). Allyn & Bacon.
Fogg, B.J. (2003). Persuasive Technology: Using Computers to Change What We Think and Do. Morgan Kaufmann.
Simon, H.A. (1956). Rational choice and the structure of the environment. Psychological Review, 63(2), 129-138.

Study 1 Saliency Benchmark

Results

Table 1: Saliency Prediction Accuracy (N=495 web pages)

Key Findings

CTC-to-Fixation Correlation (Null Result)

Context

Method

Saliency Prediction

Evaluation Metrics

Persona Profiles

Study 1b: Schwartz Value Modulation Effect

Table 2: CTC Change with Schwartz Value Modulation (N=495)

Table 3: Per-Layer CTC Delta by Persona

Key Findings

Schwartz Value Profiles Used

Live Validation: Value-Driven Saliency Differentiation

Implication for UX Practice

References

From the Blog