CUSP Benchmark: LLMs Predict AI Benchmark Progress Well but Fail on Biology and Physics Breakthroughs | HACKOBAR_