#LMEval

2025-06-05

Introducing #LMEval – a tool that helps AI researchers & developers compare the performance of different #LLMs.

Designed to be accurate, multimodal, and easy to use, LMEval has already been used to evaluate major models in terms of safety and security.

Dive deeper: bit.ly/3T7fgfk

#AI #opensource #Google #InfoQ

KINEWS24KiNews
2025-06-02

🔍 Was ist Google LMEval? Entdecke das neue KI-Test-Framework!

Einheitliche Modellbewertung
Multimodal & anbieterübergreifend
Effiziente, inkrementelle Tests

Jetzt LIKEN, teilen, LESEN und FOLGEN!

kinews24.de/google-lmeval-llms

2025-05-15

At Giskard, we've integrated LMEval into our Phare LLM benchmark (phare.giskard.ai) to independently evaluate popular models' security and safety dimensions - through rigorous testing.

Read the announcement: opensource.googleblog.com/2025

#LMEval #AISecurity #LLMEvaluation #OpenSource

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst