Hugging Face GGUF 模型可视化

Yongqiang Cheng

已于 2025-02-14 19:04:07 修改

阅读量1.6k

点赞数 21

分类专栏： ggml - llama.cpp - whisper.cpp 文章标签： Hugging Face GGUF 模型可视化

于 2025-02-03 20:50:50 首次发布

世上没有白读的书，每一页都算数。

本文链接：https://blog.csdn.net/chengyq116/article/details/145432625

版权

ggml - llama.cpp - whisper.cpp 专栏收录该内容

19 篇文章

订阅专栏

Hugging Face GGUF 模型可视化

1. Finding GGUF files (检索 GGUF 模型)
2. Viewer for metadata & tensors info (可视化 GGUF 模型)
References

无知小儿，仙家雄霸天下，依附强者才是唯一的出路。否则天地虽大，也让你们无路可走！《哪吒 2》

GGUF
https://huggingface.co/docs/hub/gguf

Hugging Face Hub supports all file formats, but has built-in features for GGUF format, a binary format that is optimized for quick loading and saving of models, making it highly efficient for inference purposes.

GGUF
https://github.com/ggerganov/ggml/blob/master/docs/gguf.md

GGUF is designed for use with GGML and other executors. GGUF was developed by Georgi Gerganov who is also the developer of llama.cpp, a popular C/C++ LLM inference framework. Models initially developed in frameworks like PyTorch can be converted to GGUF format for use with those engines.

Georgi Gerganov
https://huggingface.co/ggerganov

在这里插入图片描述

As we can see in this graph, unlike tensor-only file formats like safetensors - which is also a recommended model format for the Hub - GGUF encodes both the tensors and a standardized set of metadata.

Safetensors
https://huggingface.co/docs/safetensors

1. Finding GGUF files (检索 GGUF 模型)

You can browse all models with GGUF files filtering by the GGUF tag: https://huggingface.co/models?library=gguf.

Moreover, you can use https://huggingface.co/spaces/ggml-org/gguf-my-repo tool to convert/quantize your model weights into GGUF weights.

在这里插入图片描述

For example, you can check out TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF for seeing GGUF files in action.

TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF
https://huggingface.co/TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF

在这里插入图片描述

deepseek-ai/DeepSeek-V3
https://huggingface.co/deepseek-ai/DeepSeek-V3

2. Viewer for metadata & tensors info (可视化 GGUF 模型)

The Hub has a viewer for GGUF files that lets a user check out metadata & tensors info (name, shape, precison). The viewer is available on model page & files page.

点击下图绿色矩形框区域图标即可进入 Hugging Face GGUF 模型可视化界面。

TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF
https://huggingface.co/TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF/tree/main

在这里插入图片描述