Thread | Brutkey

Brutal:

The findings across task, length, and format generalization experiments converge on a conclusion: [Chain-of-Thought reasoning] is not a mechanism for genuine logical inference but rather a sophisticated form of structured pattern matching, fundamentally bounded by the data distribution seen during training. When pushed even slightly beyond this distribution its performance degrades significantly, exposing the superficial nature of the "reasoning” it produces.

AA
@AAKL@infosec.exchange

@mttaggart@infosec.exchange So it's like the lottery.

Ed
@EdBruce@infosec.exchange

@AAKL@infosec.exchange @mttaggart@infosec.exchange not really with the lottery I know the odds...

AA
@AAKL@infosec.exchange

@EdBruce@infosec.exchange @mttaggart@infosec.exchange Good point.