Llama2 Chat 7B - GGUF
- Model creator: Meta
- Original model: Llama 2 7b Chat GGML
<!-- README_GGUF.md-about-gguf start -->
About GGUF
GGUF is a new format introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp.
The key benefit of GGUF is that it is a extensible, future-proof format which stores more information about the model as metadata. It also includes significantly improved tokenization code, including for the first time full support for special tokens. This should improve performance, especially with models that use new special tokens and implement custom prompt templates.