๐ Paper: https://arxiv.org/abs/2506.15498
๐ค Models: https://huggingface.co/collections/UKPLab/spare-prm
๐ป Code: https://github.com/UKPLab/aaai2026-spare-prm
Follow the authors Imbesat Hassan Rizvi and Iryna Gurevych from the Ubiquitous Knowledge Processing Lab (UKP Lab), Technische Universitรคt Darmstadt and Xiaodan Zhu from the Department of Electrical and Computer Engineering, Smith Engineering and Ingenuity Labs Research Institute at Queen's University.
#AAAI2026 #ProcessSupervision #Reasoning #RewardModelling #ReferenceGuidedEvaluation
























