AI models lag behind AGI-level reasoning despite recent advances
Apple researchers have found that leading AI models still fall short of humanlike reasoning, casting doubt on claims that artificial general intelligence (AGI) is imminent.
In a June paper titled The Illusion of Thinking, Apple’s team evaluated major large reasoning models (LRMs), including ChatGPT and Claude, using custom puzzle games rather than standard coding or math benchmarks.
While recent AI updates show gains on conventional tests, the study found that these benchmarks fail to capture broader reasoning capabilities.
The researchers tested both “thinking” and “non-thinking” model variants and found performance dropped sharply as task complexity increased.
“We found that LRMs have limitations in exact computation: they fail to use explicit algorithms and reason inconsistently across puzzles,” the paper stated.
Additionally, they observed that AI has a propensity to overthink, frequently beginning with accurate solutions before deviating into flawed reasoning as the responses developed.
These patterns suggest current models imitate reasoning without internalising it, lacking the kind of generalisable thinking associated with AGI.
The study concluded that existing approaches may be hitting fundamental limits in replicating human reasoning.
This analysis contrasts with optimistic forecasts from figures like OpenAI CEO Sam Altman and Anthropic CEO Dario Amodei.
“We are now confident we know how to build AGI as we have traditionally understood it,” Altman said in January.
Amodei predicted AGI might exceed human ability by 2026 or 2027.
Apple’s findings suggest such projections may underestimate the complexity of achieving genuine general intelligence in machines.
Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
You may also like
Siebert Financial Seeks $100 Million for Crypto and AI After SEC Nod
SBI Invests $50 Million in Circle in NYSE Debut
AI Overtakes Crypto in Online Chatter, Santiment Reports Growing Debate on Job Displacement
The conversation around artificial intelligence is intensifying across the crypto space, with AI discussions now overshadowing crypto itself in online forums, according to blockchain analytics firm Santiment.

UK Insolvency Service Appoints First Crypto Specialist to Boost Asset Recovery
The UK Insolvency Service has taken a significant step in modernizing its approach to asset recovery by appointing its first cryptocurrency intelligence specialist. This move comes as digital assets, such as Bitcoin and Ethereum, become increasingly prevalent in bankruptcy and criminal investigations.

Trending news
MoreCrypto prices
More








