●builderYou can drop PUM into reasoning pipelines as a plug-in prefix scorer for beam search or Best-of-N without task-specific retraining.
●researcherPUM reframes process reward modeling around outcome-grounded utility rather than step correctness, with released models and data to benchmark against.