●builderYou should audit multi-agent system designs that assume cross-family model diversity ensures behavioral variance — post-training recipe is the stronger lever.
●researcherThe 940K-chain corpus and hedging metric provide a validated framework for studying interactive multi-LLM behavior beyond offline preference studies.