OpenToM: A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning Capabilities of Large Language Models

Hainiu Xu, Yulan He*, Lixing Zhu, Runcong Zhao, Jinhua Du

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Fingerprint

Dive into the research topics of 'OpenToM: A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning Capabilities of Large Language Models'. Together they form a unique fingerprint.

Computer Science

Psychology