Home AI GPT-5.2 Outperforms Humans in Exams: OpenAI Cautions About the Risks of Advanced...

GPT-5.2 Outperforms Humans in Exams: OpenAI Cautions About the Risks of Advanced AI Capabilities

0

OpenAI’s latest system, Poetiq (GPT-5.2X-High), has achieved a remarkable 75% accuracy on the ARC-AGI-2 benchmark, significantly surpassing the previous state-of-the-art by 15 percentage points. This benchmark tests AI’s abstract reasoning capabilities rather than just statistical pattern recognition. Poetiq’s architecture emphasizes system-level integration over simple model expansion, highlighting the importance of software design in enhancing AI performance without additional training. OpenAI’s co-founder Greg Brockman noted that while large models show potential, they often remain underutilized, leading to a “capability overhang.” This suggests that the true value of AI lies not only in model improvement but also in effective human-machine collaboration within real-world applications. With the ARC-AGI-2 results, the conversation shifts toward optimizing AI usage and integrating it meaningfully into daily life, signaling a new competitive landscape focused on systems and processes rather than solely model parameters.

For more details, refer to the original 36Kr article.

Source link

NO COMMENTS

Exit mobile version