The ollama model for the 8bit-quantized GGUF version of llama3-70b-chinese-chat.

70B

1,808 Pulls Updated 4 months ago

{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>" ], "temperature": 0.6, "top_p": 0.9 }