The ollama model for the 4bit-quantized GGUF version of llama3-8b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-4bit).

8B

5,316 Pulls Updated 4 months ago

2 Tags
b4a46dbb319f • 4.7GB • Updated 4 months ago
94533a9e86f5 • 4.7GB • Updated 4 months ago