Our recent work at #SRIInternational studies if medical #GenerativeAI #GenAI systems can support patients' information needs.
Paper: https://arxiv.org/abs/2402.00234
The short answer, unsurprisingly, is no.
Turns out #AI #ML science lacks methods to evaluate the performance and usefulness #GenAI (and other methods) in real-world, human contexts. The methods - accuracy on benchmark tasks - fall short of measuring effectiveness.
This is a critical gap that needs serious thought. (1/n)