Introducing #LMEval – a tool that helps AI researchers & developers compare the performance of different #LLMs.
Designed to be accurate, multimodal, and easy to use, LMEval has already been used to evaluate major models in terms of safety and security.
Dive deeper: https://bit.ly/3T7fgfk