#harmfulness

Arie van Deursen 🇪🇺🇳🇱avandeursen@mastodon.acm.org
2025-08-07

GPT5’s “safe completion” was previously called “safe answering”, and is included in the benchmark we developed to assess the “Harmfulness of Applying Off-the-Shelf Large Language Models to Programming Tasks”.

dl.acm.org/doi/abs/10.1145/372

#gpt5 #safecompletion #harmfulness #fse2025

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst