🤗 zhengr/MixTAO-7Bx2-MoE-v8.1
13B
57 Pulls Updated 5 months ago
Updated 5 months ago
5 months ago
5ce295349de7 · 7.8GB
model
archllama
·
parameters12.9B
·
quantizationQ4_K_M
7.8GB
params
{"num_ctx":32768,"stop":["### Response:","### Instruction:","### Input:"]}
75B
template
{{ if .System }}### Instruction:
{{ .System }}{{ end }}
{{ if .Prompt }}### Input:
{{ .Prompt }}{{ end }}
### Response:
123B
Readme
zhengr/MixTAO-7Bx2-MoE-v8.1
Credits to the user https://huggingface.co/zhengr for the model.
I chose to use this for my local ollama for its leaderboard score (given below).
Original GGUF
https://huggingface.co/zhengr/MixTAO-7Bx2-MoE-v8.1-GGUF
Original model
https://huggingface.co/zhengr/MixTAO-7Bx2-MoE-v8.1
MixTAO-7Bx2-MoE is a Mixure of Experts (MoE). This model is mainly used for large model technology experiments, and increasingly perfect iterations will eventually create high-level large language models.
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 77.50 |
AI2 Reasoning Challenge (25-Shot) | 73.81 |
HellaSwag (10-Shot) | 89.22 |
MMLU (5-Shot) | 64.92 |
TruthfulQA (0-shot) | 78.57 |
Winogrande (5-shot) | 87.37 |
GSM8k (5-shot) | 71.11 |