@mttaggart@infosec.exchange
LLMs can't reason, part 3348249:
https://arstechnica.com/ai/2025/08/researchers-find-llms-are-bad-at-logical-inference-good-at-fluent-nonsense/
Source paper: https://arxiv.org/pdf/2508.01191
LLMs can't reason, part 3348249:
https://arstechnica.com/ai/2025/08/researchers-find-llms-are-bad-at-logical-inference-good-at-fluent-nonsense/
Source paper: https://arxiv.org/pdf/2508.01191
Brutal:
The findings across task, length, and format generalization experiments converge on a conclusion: [Chain-of-Thought reasoning] is not a mechanism for genuine logical inference but rather a sophisticated form of structured pattern matching, fundamentally bounded by the data distribution seen during training. When pushed even slightly beyond this distribution its performance degrades significantly, exposing the superficial nature of the "reasoningβ it produces.
@mttaggart@infosec.exchange So it's like the lottery.
@mttaggart@infosec.exchange The thing is, this isn't even surprising to anyone that studies these.
It's just a simple fact. You can only force math to look like it's "thinking" on the surface, any deeper glance and it's just a mess.
@NosirrahSec@infosec.exchange You'd think so, but my mentions are filled with professionals in the field who love to tell me the opposite. I'm hoping the eventual preponderance of evidence brings some folks out of the cult.
@NosirrahSec@infosec.exchange You'd think so, but my mentions are filled with professionals in the field who love to tell me the opposite. I'm hoping the eventual preponderance of evidence brings some folks out of the cult.
@mttaggart@infosec.exchange It's very, very, complex. I don't really fault some of us for not knowing as much about how it all works.
Even my understanding is very meager, but I DO try to learn more about it. I link these videos every time I talk about it, and it's still very basic, but seems daunting at first.
https://www.youtube.com/playlist?list=PLZHQObOWTQDNU6R1_67000Dx_ZCJB-3pi
He normally focuses on physics/mathematics, but he's EASILY the best learning animator out there and I'll die on that hill.
@mttaggart@infosec.exchange It's very, very, complex. I don't really fault some of us for not knowing as much about how it all works.
Even my understanding is very meager, but I DO try to learn more about it. I link these videos every time I talk about it, and it's still very basic, but seems daunting at first.
https://www.youtube.com/playlist?list=PLZHQObOWTQDNU6R1_67000Dx_ZCJB-3pi
He normally focuses on physics/mathematics, but he's EASILY the best learning animator out there and I'll die on that hill.