#AiModelEvaluation

LBHustonlbhuston
2025-04-30

Logical Reasoning & Critical Thinking (15%) – Does it demonstrate good reasoning and avoid fallacies?

Read more πŸ‘‰ lttr.ai/AeNpS

LBHustonlbhuston
2025-03-03

Logical reasoning was strong on technical and philosophical topics.

Read more πŸ‘‰ lttr.ai/Ab7cS

LBHustonlbhuston
2025-02-14

Reduce factual errors (particularly in history and technical explanations).

Read more πŸ‘‰ lttr.ai/AbYrK

LBHustonlbhuston
2025-02-07

I wanted to compare this against my earlier review of the same model using the Llama framework.As you can see, I also implemented a more formal testing system.

Read more πŸ‘‰ lttr.ai/AbKgf

LBHustonlbhuston
2025-02-03

This wasn’t just a casual testβ€”I ran the model through a structured evaluation framework that assigns letter grades and a final weighted score based on the following

Read more πŸ‘‰ lttr.ai/AbBZa

LBHustonlbhuston
2025-01-31

Model Review: DeepSeek-R1-Distill-Qwen-7B on M1 Mac (LMStudio API Test): lttr.ai/Aa8Bi

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst