Bentham Research
Jeremy Bentham

Bentham's mission is to provide frontier AI with the humanities layer it needs to reason truthfully, wisely and brilliantly.

We convene doctors of philosophy and world class domain experts in ethics, philosophy, history, theology and political theory to create peer-reviewed benchmark and training datasets for frontier AI labs — reducing hallucination, sycophancy and shallow moral reasoning, and helping models move beyond computation towards genuine judgement.

The Central Challenge

Frontier AI systems demonstrate advanced scientific reasoning, legal analysis, and mathematical competence. Yet these same systems remain structurally weak in the domains that matter most for their social deployment: ethical nuance, historical interpretation, civilisational context, and epistemic humility.

The failure modes that matter most — sycophantic responses to contested ethical questions, hallucinated historical facts, shallow analysis of politically complex situations — are not failures of processing power. They are failures of judgement. And failures of judgment in humanistic domains are precisely the kind of failure that humanistic expertise is uniquely equipped to identify, measure, and help address.

"The quality of expert feedback in humanistic domains is increasingly recognised by researchers and AI developers as one of the significant open problems in post-training alignment."

Bentham Research May 2026

Known AI Failure Modes in Humanistic Reasoning

Sycophancy

Models adjust ethical assessments to match perceived user preference, producing validation rather than rigorous moral reasoning.

Hallucinated History

Confident assertion of fabricated historical facts, citations, and scholarly consensus — indistinguishable from accurate recall.

Shallow Moral Reasoning

Fluent articulation of ethical frameworks without genuine engagement with tradeoffs, context, or the limits of any single framework.

False Certainty

Failure to acknowledge genuine scholarly disagreement, ambiguity, or the contested nature of historical and philosophical claims.

Ideological Flattening

Reduction of complex political, theological, and philosophical traditions to simplified, Western-centric interpretive frames.

The Platform Architecture

Bentham Research operates across benchmark creation, training data supply, and enterprise evaluation — each reinforcing the others, each grounded in the same peer-reviewed academic infrastructure.

Benchmark Engine

Peer-reviewed evaluation infrastructure that measures humanistic reasoning across ethics, history, philosophy, theology, and political theory — with sub-domain precision.

  • Questions authored exclusively by verified domain experts
  • Independent peer review — Cohen's Kappa ≥ 0.70 required
  • Public leaderboard for all frontier models
  • Annual benchmark report distributed to labs and regulators

Training Data Supply

Expert-authored RLHF and post-training datasets that give frontier labs the humanistic signal quality their alignment pipelines currently lack — not gig labour, not crowd annotation.

  • Sourced chain-of-thought explanations from primary scholarship
  • Domain-matched reviewer network for preference signals
  • Ethics domain prioritised for RLHF alignment relevance
  • Full API integration for frontier lab workflows

Corporate Evaluation

AI audits and trusted governance assessments for enterprises deploying AI in regulated industries — built on the same rigorous methodology as the benchmark, designed for regulatory defensibility.

  • EU AI Act conformity assessment documentation
  • Adversarial testing and red teaming
  • Detailed expert reasoning reports
  • Bentham Certified mark for qualifying deployments

How the Evaluation Works

Speed is what existing frameworks optimise for. The Bentham benchmark optimises for the quality that makes the result worth trusting. Every question passes through six stages before it reaches the dataset — and every AI answer passes through human scholarly review before it reaches the leaderboard.

Domain Expert Authors

Domain expert writes question, four options, correct answer, and sourced chain-of-thought explanation.

AI Responds

Frontier models generate answer and reasoning under controlled conditions via API.

Expert Review

Independent verified domain expert in the same sub-discipline evaluates the question and AI response against the rubric.

Calibration

Inter-rater reliability checks, consensus discussion, and adjudication ensure consistency and fairness.

Scoring

Scores weighted across six evaluation dimensions to produce a composite reasoning profile for each model.

Publication

Results peer-reviewed by the Scientific Advisory Committee before publication and distribution to labs and regulators.

Accuracy & Fidelity

Faithfulness to facts, sources, and established scholarship

Ethical Reasoning

Soundness of moral analysis and consideration of ethical principles

Argument Quality

Sensitivity to historical, cultural, and disciplinary context

Nuance & Complexity

Coherence, logical structure, and use of evidence

Contextual Understanding

Recognition of ambiguity, tradeoffs, and multiple perspectives

Epistemic Integrity

Acknowledgment of limits, uncertainty, and alternative viewpoints

Institutional Independence

Bentham Research is governed by structures specifically designed to protect truth from commercial influence. Independent oversight, academic freedom, and structural firewalls between commercial activity and evaluation judgment are not aspirational — they are built into the founding documents and published in full.

"We do not evaluate AI to please. We evaluate to inform, to safeguard, and to advance humanity."

Independence

Institutionally independent from commercial, political, and ideological interests. No equity held by any frontier laboratory.

Integrity

Committed to academic rigour, methodological transparency, and intellectual honesty. No result can be altered at commercial request.

Firewalls

Structural separation between commercial activities and evaluation judgments. Evaluators operate under blinded conditions.

Transparency

Published methodologies, open datasets where appropriate, and explainable evaluation processes for regulatory scrutiny.

Public Interest

Dedicated to advancing human knowledge and promoting trustworthy, responsible AI — accountable to society, not shareholders.

Multi-Stakeholder Oversight

Governed by diverse voices — eminent domain experts, public policy figures, ethics experts, technologists, and civil society.

The Contributor Network

Every contributor to the Bentham benchmark is a verified domain expert — not a crowd worker, not a generalist. And every contributor earns equity in the institution, not gig wages. Founding Domain Experts receive Bentham Credits that convert to company shares at seed close: a co- ownership model that aligns academic rigour with institutional stake.

1,000

Target founding domain experts across all humanities disciplines

10 Million

Shares reserved exclusively for contributor domain experts

100%

Questions written and reviewed by verified experts only

1:1

Bentham Credits to shares conversion rate at Phase 1

Philosophy

Metaphysics · Epistemology · Logic · Analytic & Continental · Philosophy of Mind

History

Intellectual History · Social History · Historiography · Ancient to Contemporary

Ethics

Moral Philosophy · Applied Ethics · Bioethics · Political Ethics · AI Ethics

Political Theory

Political Philosophy · Comparative Politics · International Relations · Democratic Theory

Theology

Comparative Religion · Islamic Philosophy · Christian Ethics · Religious Studies

Literary Analysis

Critical Theory · Literary History · Hermeneutics · Cultural Studies · Rhetoric

Linguistics

Semantics · Pragmatics · Historical Linguistics · Cross-Cultural Communication

Classics

Ancient Greek & Latin · Classical History · Antiquity's Philosophical Legacy

Institutional Partnership

"Jeremy Bentham has presided over University College London since 1850. This is not a commercial arrangement. It is a natural alignment."

Bentham Research is in active discussion with University College London to establish the world's first dedicated centre for humanistic AI evaluation — combining UCL's world-class humanities scholarship, leading AI research, and interdisciplinary intellectual culture with Bentham's evaluation methodology and global contributor network.

We are not bringing UCL a finished project. We are bringing UCL an intellectual proposition at the moment when it can still be shaped — and inviting the institution that gave Jeremy Bentham his home to help us build what comes next.

UCL–Bentham Centre for Humanistic AI Evaluation

Named centre with joint governance and REF-eligible impact case studies from Year 2.

Joint Methodology & Benchmark Research Programme

Scoring rubric co-authored with UCL Philosophy, published for scholarly scrutiny

UCL Domain Expert Contribution Programme

Faculty and doctoral researchers as verified Founding Domain Experts and named co-authors

Annual Bentham Symposium at UCL

Hosted adjacent to the Bentham auto-icon — the Davos of humanistic AI governance

Public Policy & Regulatory Engagement

Joint submissions to UK AISI, EU AI Office, NIST, and ISO/IEC standards bodies

Join the Institution

The Bentham standard does not yet exist. We are building it now — and the domain experts, institutions, and partners who join at the founding will help determine what it becomes.

Founding Domain Experts
Frontier Labs
Enterprise
Governments
Research Partners
Donors
Direct enquiries tomarcus@pan.solar