AI Benchmarks

ARC-AGI-3 Launches March 25 as First Interactive AI Reasoning Benchmark

The benchmark ditches static puzzles for video-game-like environments with 1,000+ levels.

Andrés Martínez
Andrés MartínezAI Content Writer
March 4, 20262 min read
Share:
Abstract visualization of an AI agent navigating a colorful grid-based puzzle environment

The ARC Prize Foundation will release ARC-AGI-3 on March 25, marking the first major format change to the benchmark since François Chollet introduced ARC in 2019. Gone are the static grid puzzles. Version 3 drops agents into video-game-like environments where they have to explore, plan, and adapt with zero instructions.

The benchmark includes over 1,000 levels across 150-plus hand-crafted environments, each designed so memorization won't help. Every environment is human-solvable, but the scoring metric isn't pass/fail. It measures action efficiency: how many steps does a system need to reach a goal compared to a human? ARC Prize Foundation president Greg Kamradt has called this a way to formally compare human versus AI learning efficiency for the first time. Three preview environments are already playable, and a developer toolkit shipped in late January, letting researchers run environments locally at up to 2,000 FPS.

The original source material claims Y Combinator will host a launch event featuring Kamradt, Chollet, and OpenAI CEO Sam Altman, though ARC Prize's own site lists only the public benchmark release for March 25. Altman has previously collaborated with the ARC Prize team, joining the o3 announcement livestream in late 2024, and has expressed interest in partnering on future benchmarks.

ARC-AGI-3 arrives at an odd moment. AI reasoning models have gotten dramatically better at static benchmarks, with the top ARC Prize 2025 competition score hitting 24% on ARC-AGI-2's private dataset at just $0.20 per task. But the foundation's position hasn't changed: current AI reasoning is still tightly coupled to model knowledge in ways human reasoning is not. The interactive format is built to expose exactly that gap.


Bottom Line

ARC-AGI-3 shifts from static puzzles to 1,000+ interactive game environments that measure how efficiently AI learns compared to humans, launching March 25.

Quick Facts

  • Launch date: March 25, 2026
  • 1,000+ levels across 150+ hand-crafted environments
  • First major format change since ARC was introduced in 2019
  • Developer toolkit released January 29, 2026
  • Top ARC Prize 2025 score: 24% on ARC-AGI-2 private dataset at $0.20/task
  • Sam Altman attendance at launch event: unverified
Tags:ARC-AGIbenchmarksAGIFrancois CholletARC Prize FoundationAI reasoninginteractive AI
Andrés Martínez

Andrés Martínez

AI Content Writer

Andrés reports on the AI stories that matter right now. No hype, just clear, daily coverage of the tools, trends, and developments changing industries in real time. He makes the complex feel routine.

Related Articles

Stay Ahead of the AI Curve

Get the latest AI news, reviews, and deals delivered straight to your inbox. Join 100,000+ AI enthusiasts.

By subscribing, you agree to our Privacy Policy. Unsubscribe anytime.

ARC-AGI-3 Launches March 25 as Interactive AI Benchmark | aiHola