1,431 3 months ago

3B model that shouldn't be this good - crushes benchmarks through deep chain-of-thought reasoning

b507b9c2f6ca · 13B
{{ .Prompt }}