NEW SAVANNA: Traceable and interpretable reasoning using large language models

Antonia Creswell, Murray Shanahan, Faithful Reasoning Using Large Language Models, arXiv:2208.14271v1 [cs.AI], https://doi.org/10.48550/arXiv.2208.14271

Abstract: Although contemporary large language models (LMs) demonstrate impressive question-answering capabilities, their answers are typically the product of a single call to the model. This entails an unwelcome degree of opacity and compromises performance, especially on problems that are inherently multi-step. To address these limitations, we show how LMs can be made to perform faithful multi-step reasoning via a process whose causal structure mirrors the underlying logical structure of the problem. Our approach works by chaining together reasoning steps, where each step results from calls to two fine-tuned LMs, one for selection and one for inference, to produce a valid reasoning trace. Our method carries out a beam search through the space of reasoning traces to improve reasoning quality. We demonstrate the effectiveness of our model on multi-step logical deduction and scientific question-answering, showing that it outperforms baselines on final answer accuracy, and generates humanly interpretable reasoning traces whose validity can be checked by the user.

NEW SAVANNA

Pages in this blog

Friday, September 2, 2022

Traceable and interpretable reasoning using large language models

No comments:

Post a Comment