The ollama model for the 4bit-quantized GGUF version of llama3-70b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-70B-Chinese-Chat-GGUF-4bit).

70B

1,814 Pulls Updated 4 months ago

1 Tag
4961d345b489 • 40GB • Updated 4 months ago