●builderRelevant if building productivity copilots for spreadsheet tools — the benchmark defines the task structure and evaluation protocol you'd need to target.
●researcherFills a data gap for spreadsheet agent research with a realistic online evaluation protocol.