AlphaMonarch-7B is a new DPO merge that retains all the reasoning abilities of the very best merges and significantly improves its conversational abilities.

7B

452 Pulls Updated 7 months ago

3b1bc934d80a · 74B
{ "num_ctx": 8192, "stop": [ "<|im_start|>", "<|im_end|>" ] }