●researcherChen's framing of the evals crisis and benchmark-maxing problem is worth tracking as it signals where OpenAI sees measurement methodology breaking down.
●founderWorth watching because OpenAI's stated research bets around long-horizon real-world tasks signal where the next capability jumps — and competitive threats — are likely to emerge.