"Self-Steering Language Models"
This paper introduces DisCIPL, a method for "self-steering" LMs (Language Model) where a Planner model generates a task-specific inference program that is executed by a population of Follower models. Our approach equips LMs with the ability to write recursive search procedures that guide LM inference