Loading...
Independent, expert-reviewed assessments of frontier AI systems on the Bentham Humanities Benchmark. All questions authored and adjudicated by verified domain experts.
Latest evaluations of frontier AI models on the Bentham Humanities Benchmark