Jun 16, 2026 Humans Still Beat Agents in the Long Horizon: Revisiting Test-Time Scaling in the Agent Era