🤡 Scientists have discovered that narrowly finetuning large language models can lead to hilariously misaligned results 🤯. Who knew that stretching a rubber band in one place would make the whole thing snap? 🙄 Bravo to the geniuses who spend years fine-tuning #chaos. 👏
https://arxiv.org/abs/2502.17424 #scientificdiscovery #humor #language_models #misalignment #fine_tuning #HackerNews #ngated