收听本期播客
阅读正文
Artificial Intelligence (AI) has been making remarkable progress in recent years, with tech companies striving to develop systems that can match human thinking. However, a new study by Apple researchers has raised significant concerns about the limitations of advanced AI, particularly a type known as large reasoning models (LRMs). These models are built to solve complex problems by breaking them down into smaller, logical steps. Yet, the study reveals that when faced with extremely challenging tasks, LRMs experience what experts call a ‘complete accuracy collapse,’ meaning they fail to deliver correct answers even when provided with the right approach.
The research tested LRMs alongside other AI systems from major companies such as OpenAI, Google, and Anthropic. The Apple team used intricate puzzles like the Tower of Hanoi—a classic game involving moving disks between rods under strict rules—to evaluate the models’ logical thinking skills. The results were both unexpected and alarming. While standard AI systems managed simpler tasks reasonably well, both standard models and LRMs struggled terribly with more difficult challenges. Even more troubling, as the tasks grew harder, the LRMs appeared to reduce their efforts to reason through problems, almost as if they were giving up.
Experts in the field have reacted strongly to these findings. Gary Marcus, an academic known for his caution about overhyping AI, described the results as ‘pretty devastating.’ He argues that current AI systems, including those behind popular tools like ChatGPT, are still a long way from achieving artificial general intelligence—the ultimate goal where machines can perform any intellectual task a human can. According to Marcus and others, these systems lack the depth of reasoning needed to truly transform society.
So, what does this mean for the future of AI? The Apple study suggests that the current methods of developing these technologies may have reached their limit. Some researchers warn that the industry might be heading down a dead end, unable to advance reasoning abilities further without a completely new approach. This raises serious questions about whether the dream of machines thinking like humans is achievable with today’s tools and techniques. As the debate continues, it becomes clear that the path to creating truly intelligent machines is far more complicated than many had hoped.
