#llama_cpp

:rss: Qiita - 人気の記事qiita@rss-mstdn.studiofreesia.com
2025-05-04
🎹 Tim Janik ✅timj@social.tchncs.de
2025-01-31

#MistralSmall24B-Instruct is a really nice model to run locally for Coding Advice, Summarizing or Creative Writing.

With a recent #llama_cpp on a #GeForce #RTX4090 at Q8, the 24GB VRAM is tightly maxed out and I am seeing text generation at 7-9 token/s.

huggingface.co/mistralai/Mistr

2024-05-19

FYI GGUF is now following a naming convention of `<Model>-<Version>-<ExpertsCount>x<Parameters>-<EncodingScheme>-<ShardNum>-of-<ShardTotal>.gguf`

github.com/ggerganov/ggml/blob

2024-05-13

Just merged in metadata override for adding custom authorship information to metadata in github.com/ggerganov/llama.cpp . If you are a model weight distributor, you may want to note this so that your models are easier to search for in

2024-05-12

Thanks to Josh Ramer for contributing a debug helper script to which will help in debugging a specific test in GDB. This will help improve maintainer experience in improving the stability of the llama.cpp project!

To use this helper script, refer to this document for further guidance github.com/ggerganov/llama.cpp

github.com/josh-ramer

2024-05-12

Proposing adding metadata override and a default naming scheme for generated files when converting to .

Requesting feedback if what I got makes sense for everyone github.com/ggerganov/llama.cpp

This is most relevant for model creators

Kyle Leaders (Open To Work)kleaders@fosstodon.org
2024-04-02

Anyone happen to know the correct prompt format for #mixtral 8x7b? I'm not having luck with the [INST] style in llama.cpp. #llm #llama_cpp #MistralAI #mistral

2024-03-24

tonight's project was to build llama.cpp.

to get a sense of what starting from scratch feels like, i built a quick chatbot using the llama 13B parameter foundational model, quantized to 4 bits.

The following is a conversation with an AI research assistant.
The assistant's tone is angry and always replies in ALL CAPS.

Human: Hello, who are you?
AI: WHY ARE YOU WASTING MY TIME?

Human: Can you tell me about the creation of blackholes?
AI: THERE'S NO SUCH THING AS BLACK HOLES. THERE IS NO SUCH THING AS SPACE.

Human: Oh...

AI: I HAVE NOTHING TO SAY TO YOU.

Human:


#llm #llama_cpp

:rss: CyberAgent Developers Bldevelopers@rss-mstdn.studiofreesia.com
2023-12-18
:rss: Qiita - 人気の記事qiita@rss-mstdn.studiofreesia.com
2023-12-11

【Llama.cpp】GGUFモデルの量子化具合による生成文章の違いを徹底比較!【houou-7b】
qiita.com/keisuke-okb/items/b8

#qiita #量子化 #LLM #llama_cpp #llama2 #GGUF

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst