●researcherChallenges the scalar reward assumption in RLHF pipelines with empirical evidence from a non-Western setting, suggesting multi-winner aggregation methods are needed.
●policyWorth watching because it reframes alignment failures as structural aggregation problems in plural societies, with implications for fairness audits of deployed RLHF systems.