Skip to main content

Claude Sonnet 4.6

1 run · 1 dataset · 1 model

slug: claude-sonnet-4-6

0.738
Best SPS · globalopinionqa

Disaggregated subgroup scorecard. Each card below is one published run for this vendor; expand the question-type and demographic-subgroup sections to see the matrix beneath the headline SPS. Where coverage permits, 95% CI bands accompany the point estimate.

globalopinionqa raw

raw--claude-sonnet-4.6--tdefault--tplcurrent--94105f12

0.738 ± 0.053
SPS · 95% CI [0.562, 0.667] · n = 100
Question-type breakdown (6 topics)
Topic SPS p_dist p_rank p_refuse N
Health & Science 0.887 0.775 1.000 1.000 1
General Attitudes 0.864 0.785 0.943 1.000 4
Economy & Work 0.703 0.770 0.636 0.949 3
International Relations & Security 0.643 0.619 0.668 0.987 60
Trust & Wellbeing 0.616 0.489 0.743 1.000 2
Politics & Governance 0.514 0.498 0.530 0.963 30
No demographic subgroup breakdown published for this run yet. When conditioned runs land, age / geography / education / party-ID slices appear here with p_dist and coverage.

No demographic conditioning data has been published for this vendor yet. The question-type matrix above shows topic-level parity; subgroup rows fill in once SynthPanel-style conditioned runs land.

← Back to leaderboard