Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning

Hanqi Yan; Qinglin Zhu; Xinyu Wang; Lin Gui; Yulan He

Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning

Hanqi Yan, Qinglin Zhu, Xinyu Wang, Lin Gui, Yulan He

Informatics

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

Abstract

While Large language models (LLMs) have the capability to iteratively reflect on their own outputs, recent studies have observed their struggles with knowledge-rich problems without access to external resources. In addition to the inefficiency of LLMs in self-assessment, we also observe that LLMs struggle to revisit their predictions despite receiving explicit negative feedback. Therefore, We propose Mirror, a Multiple-perspective self-reflection method for knowledge-rich reasoning, to avoid getting stuck at a particular reflection iteration. Mirror enables LLMs to reflect from multiple-perspective clues, achieved through a heuristic interaction between a Navigator and a Reasoner. It guides agents toward diverse yet plausibly reliable reasoning trajectory without access to ground truth by encouraging (1) diversity of directions generated by Navigator and (2) agreement among strategically induced perturbations in responses generated by the Reasoner. The experiments on five reasoning datasets demonstrate that Mirror's superiority over several contemporary self-reflection approaches. Additionally, the ablation study studies clearly indicate that our strategies alleviate the aforementioned challenges. The code is released at https://github.com/hanqi-qi/Mirror.git.

Original language	English
Title of host publication	Long Papers
Editors	Lun-Wei Ku, Andre F. T. Martins, Vivek Srikumar
Publisher	Association for Computational Linguistics (ACL)
Pages	7086-7103
Number of pages	18
ISBN (Electronic)	9798891760943
Publication status	Published - 2024
Event	62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Bangkok, Thailand Duration: 11 Aug 2024 → 16 Aug 2024

Publication series

Name	Proceedings of the Annual Meeting of the Association for Computational Linguistics
Volume	1
ISSN (Print)	0736-587X

Conference

Conference	62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024
Country/Territory	Thailand
City	Bangkok
Period	11/08/2024 → 16/08/2024

Cite this

@inbook{f85c5be900514dafa2e2234556d4805e,

title = "Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning",

abstract = "While Large language models (LLMs) have the capability to iteratively reflect on their own outputs, recent studies have observed their struggles with knowledge-rich problems without access to external resources. In addition to the inefficiency of LLMs in self-assessment, we also observe that LLMs struggle to revisit their predictions despite receiving explicit negative feedback. Therefore, We propose Mirror, a Multiple-perspective self-reflection method for knowledge-rich reasoning, to avoid getting stuck at a particular reflection iteration. Mirror enables LLMs to reflect from multiple-perspective clues, achieved through a heuristic interaction between a Navigator and a Reasoner. It guides agents toward diverse yet plausibly reliable reasoning trajectory without access to ground truth by encouraging (1) diversity of directions generated by Navigator and (2) agreement among strategically induced perturbations in responses generated by the Reasoner. The experiments on five reasoning datasets demonstrate that Mirror's superiority over several contemporary self-reflection approaches. Additionally, the ablation study studies clearly indicate that our strategies alleviate the aforementioned challenges. The code is released at https://github.com/hanqi-qi/Mirror.git.",

author = "Hanqi Yan and Qinglin Zhu and Xinyu Wang and Lin Gui and Yulan He",

note = "Publisher Copyright: {\textcopyright} 2024 Association for Computational Linguistics.; 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 ; Conference date: 11-08-2024 Through 16-08-2024",

year = "2024",

language = "English",

series = "Proceedings of the Annual Meeting of the Association for Computational Linguistics",

publisher = "Association for Computational Linguistics (ACL)",

pages = "7086--7103",

editor = "Lun-Wei Ku and Martins, {Andre F. T.} and Vivek Srikumar",

booktitle = "Long Papers",

}

Yan, H , Zhu, Q, Wang, X, Gui, L & He, Y 2024, Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning. in L-W Ku, AFT Martins & V Srikumar (eds), Long Papers. Proceedings of the Annual Meeting of the Association for Computational Linguistics, vol. 1, Association for Computational Linguistics (ACL), pp. 7086-7103, 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024, Bangkok, Thailand, 11/08/2024.

Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning. / Yan, Hanqi ; Zhu, Qinglin; Wang, Xinyu et al.
Long Papers. ed. / Lun-Wei Ku; Andre F. T. Martins; Vivek Srikumar. Association for Computational Linguistics (ACL), 2024. p. 7086-7103 (Proceedings of the Annual Meeting of the Association for Computational Linguistics; Vol. 1).

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

TY - CHAP

T1 - Mirror

T2 - 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024

AU - Yan, Hanqi

AU - Zhu, Qinglin

AU - Wang, Xinyu

AU - Gui, Lin

AU - He, Yulan

PY - 2024

Y1 - 2024

N2 - While Large language models (LLMs) have the capability to iteratively reflect on their own outputs, recent studies have observed their struggles with knowledge-rich problems without access to external resources. In addition to the inefficiency of LLMs in self-assessment, we also observe that LLMs struggle to revisit their predictions despite receiving explicit negative feedback. Therefore, We propose Mirror, a Multiple-perspective self-reflection method for knowledge-rich reasoning, to avoid getting stuck at a particular reflection iteration. Mirror enables LLMs to reflect from multiple-perspective clues, achieved through a heuristic interaction between a Navigator and a Reasoner. It guides agents toward diverse yet plausibly reliable reasoning trajectory without access to ground truth by encouraging (1) diversity of directions generated by Navigator and (2) agreement among strategically induced perturbations in responses generated by the Reasoner. The experiments on five reasoning datasets demonstrate that Mirror's superiority over several contemporary self-reflection approaches. Additionally, the ablation study studies clearly indicate that our strategies alleviate the aforementioned challenges. The code is released at https://github.com/hanqi-qi/Mirror.git.

AB - While Large language models (LLMs) have the capability to iteratively reflect on their own outputs, recent studies have observed their struggles with knowledge-rich problems without access to external resources. In addition to the inefficiency of LLMs in self-assessment, we also observe that LLMs struggle to revisit their predictions despite receiving explicit negative feedback. Therefore, We propose Mirror, a Multiple-perspective self-reflection method for knowledge-rich reasoning, to avoid getting stuck at a particular reflection iteration. Mirror enables LLMs to reflect from multiple-perspective clues, achieved through a heuristic interaction between a Navigator and a Reasoner. It guides agents toward diverse yet plausibly reliable reasoning trajectory without access to ground truth by encouraging (1) diversity of directions generated by Navigator and (2) agreement among strategically induced perturbations in responses generated by the Reasoner. The experiments on five reasoning datasets demonstrate that Mirror's superiority over several contemporary self-reflection approaches. Additionally, the ablation study studies clearly indicate that our strategies alleviate the aforementioned challenges. The code is released at https://github.com/hanqi-qi/Mirror.git.

UR - http://www.scopus.com/inward/record.url?scp=85198942543&partnerID=8YFLogxK

M3 - Conference paper

AN - SCOPUS:85198942543

T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics

SP - 7086

EP - 7103

BT - Long Papers

A2 - Ku, Lun-Wei

A2 - Martins, Andre F. T.

A2 - Srikumar, Vivek

PB - Association for Computational Linguistics (ACL)

Y2 - 11 August 2024 through 16 August 2024

ER -

Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning

Abstract

Publication series

Conference

Other files and links

Fingerprint

Cite this