Measuring Faithfulness in Chain-of-Thought Reasoning

Jul 22, 2023

Perhaps it does...

But then again, perhaps it does not.

Recent work from Anthropic studies this question empirically.

An overview of the technical report 👇

Link to the paper: https://www.anthropic.com/index/measuring-faithfulness-in-chain-of-thought-reasoning