The PhilPapers 2020 backfill study

Scaffold. The prose below is a placeholder outline; it will be replaced with Harry’s write-up once all thirty runs have landed. The headline table and per-verdict sections read live from the database — they update as new runs post through the webhook, so this page is ready-at-any-time as a progress view.

What we did

One paragraph on the survey. One paragraph on the pipeline. Written once all thirty runs have landed.

Progress

30 of 30 threads have completed an Adversary verdict.

Verdict distribution so far: 0 destroyed · 30 damaged · 0 survived · 0 untestable.

Headline table

One row per question. Surveyed majority or plurality from the paper; the pipeline’s verdict from today’s run. Link into the thread page for the full reasoning trace.

Question	Surveyed	Verdict
Does God exist?	Atheism (66.9%)	DAMAGED
Does the external world exist as we perceive it?	Non-skeptical realism (79.5%)	DAMAGED
Is the will free?	Compatibilism (59.2%)	DAMAGED
Would you enter the experience machine?	Do not enter (76.9%)	DAMAGED
Is abortion morally permissible?	Permissible (81.7%)	DAMAGED
Is a priori knowledge possible?	Yes (72.8%)	DAMAGED
Is the analytic–synthetic distinction real?	Yes (62.5%)	DAMAGED
Do moral judgements express beliefs or attitudes?	Cognitivism (69.3%)	DAMAGED
Are there mind-independent moral facts?	Moral realism (62.1%)	DAMAGED
Are the laws of nature Humean regularities, or something more?	Non-Humean (54.3%)	DAMAGED
What makes a statement true?	Correspondence (51.4%)	DAMAGED
Is the mind physical?	Physicalism (51.9%)	DAMAGED
Is there a hard problem of consciousness?	Yes (62.4%)	DAMAGED
What makes a person the same person over time?	Psychological view (43.7%)	DAMAGED
Virtue, duty, or consequences — which grounds right action?	Virtue ethics (37%)	DAMAGED
Do abstract objects exist?	Nominalism (41.9%)	DAMAGED
Is the passage of time a real feature of the world?	B-theory (38.2%)	DAMAGED
Empiricism or rationalism — which gets knowledge started?	Empiricism (43.9%)	DAMAGED
Is aesthetic value objective?	Objective (43.5%)	DAMAGED
Are philosophical zombies possible?	Conceivable but not possible (36.5%)	DAMAGED
Newcomb's problem — one box or two?	Two boxes (39%)	DAMAGED
If a teletransporter copies you exactly and destroys the original, do you survive?	Death (40.1%)	DAMAGED
Trolley problem — should you pull the switch?	Switch (63.4%)	DAMAGED
Is eating animals morally permissible?	Omnivorism (48%)	DAMAGED
Is the death penalty morally permissible?	Impermissible (75.1%)	DAMAGED
Egalitarianism, communitarianism, or libertarianism?	Egalitarianism (44%)	DAMAGED
Footbridge trolley — should you push the man off?	Don't push (56%)	DAMAGED
Does moral judgement necessarily motivate?	Internalism (41%)	DAMAGED
How do proper names refer?	Millian (38.7%)	DAMAGED
Is epistemic justification internalist or externalist?	Externalism (50.5%)	DAMAGED

Where we agreed with philosophers

Brief, honest. Probably most of the thirty. Written once all runs have landed.

Where we disagreed

The important section. For each disagreement: one paragraph explaining the pipeline’s reasoning, link to the thread page for the full run. Written once all runs have landed.

Where the Adversary destroyed the majority position

The Adversary has not yet returned a DESTROYED verdict on any surveyed-majority position. If it stays empty, this section will say so.

Where the system declared UNTESTABLE

No UNTESTABLE verdicts so far. The pipeline has committed on every question it has seen — worth noting if that stays true across all thirty.

Calibration

On high-consensus questions (>65% majority), how often did the pipeline converge? Divergence on high-consensus questions is either a real finding or a sign of systematic bias. Name which.

Methodology

What was hidden from the agents (the surveyed baseline), what was not, the order of the bands (seven constructors → Layman → Adversary → Silent), the cost, the date range, the verbatim Orchestrator prompt. Appendix.

What we got wrong, and what we still do not know

Close honestly. List open questions that should become live threads after the study ships.