Chain of thought seems to be an extraction algorithm for information buried deeper.
The models hold more information than they can immediately extract, but CoT can find a key to look it up or synthesise by applying some learned generalisations.
The models hold more information than they can immediately extract, but CoT can find a key to look it up or synthesise by applying some learned generalisations.