Experimental model doing a DPO training on top of Kunoichi-DPO-v2-7b, i.e. double-DPO.
7B
152 Pulls Updated 3 months ago
1 Tag
4742d588810c • 7.7GB •
Updated 3 months ago