Configurable Safety Reward Model Hits 94.6% F1 on CoSApien Benchmark | HACKOBAR_