← Back to news
researchAug 6, 2025

OpenAI o3 Sets Record on ARC-AGI Test: Industry Debates Whether AGI Has Arrived

OpenAI o3 achieved a score of 87.5% on the ARC Prize benchmark, surpassing the human average of 85%, which some researchers consider a milestone for AGI. However, ARC-AGI founder François Chollet pointed out that o3 used massive computation, costing $1,000 per problem, fundamentally differing from human efficient reasoning. The debate over the definition of AGI has reignited academic discussions.

Also available in 中文.

OpenAI o3 Sets Record on ARC-AGI Test: Industry Debates Whether AGI Has Arrived | AI Skill Navigation | AI Skill Navigation