wangshenzhi / llama3-70b-chinese-chat-ollama-q8

The ollama model for the 8bit-quantized GGUF version of llama3-70b-chinese-chat.

70B

1,808 Pulls Updated 4 months ago

params

50020e23ef83 · 126B

{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>" ], "temperature": 0.6, "top_p": 0.9 }