●builderYou cannot reliably use KLD or perplexity as a cheap proxy to pick between near-baseline quantizations — run actual downstream benchmarks instead.
●researcherWorth watching because it systematically invalidates a common evaluation shortcut across 69 quant configurations and 14 metric variants.