vanilj

Phi-4

Microsoft's Phi 4 model

83.9K Pulls 5 Tags Updated 1 year ago

phi-4-unsloth

The Phi 4 model with fixed tokenizer from Unsloth

6,829 Pulls 8 Tags Updated 11 months ago

mistral-nemo-12b-celeste-v1.9

2,338 Pulls 6 Tags Updated 1 year ago

supernova-medius

Arcee-SuperNova-Medius is a 14B parameter language model developed by Arcee.ai, built on the Qwen2.5-14B-Instruct architecture.

tools

2,219 Pulls 22 Tags Updated 1 year ago

midnight-miqu-70b-v1.5

Midnight-Miqu-70B-v1.5-GGUF Q4_K_S & Q4_K_M

1,648 Pulls 2 Tags Updated 1 year ago

palmyra-fin-70b-32k

Palmyra-Fin-70B-32K is a model built by Writer specifically to meet the needs of the financial industry. It is a leading LLM on financial benchmarks, outperforming other large language models in various financial tasks and evaluations.

1,358 Pulls 7 Tags Updated 1 year ago

gemma-2-ataraxy-9b

Made from Gemma 2 9B SPPO iter3 and SimPO

689 Pulls 19 Tags Updated 1 year ago

smaug-llama-3-70b-instruct

This model was built using a new Smaug recipe for improving performance on real world multi-turn conversations applied to meta-llama/Meta-Llama-3-70B-Instruct.

563 Pulls 11 Tags Updated 1 year ago

qwen2.5-coder-32b-instruct-iq4_xs

Perfect size for 24GB GPUs!

tools

528 Pulls 1 Tag Updated 1 year ago

llama-3-8b-instruct-coder-v2

Llama-3-8B-Instruct-Coder-v2

522 Pulls 9 Tags Updated 1 year ago

llama-3.1-70b-instruct-lorablated-iq2_xs

tools

506 Pulls 1 Tag Updated 1 year ago

hermes-3-llama-3.1-8b

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board.

tools

435 Pulls 5 Tags Updated 1 year ago

theia-21b-v1

An upscaled NeMo with half its layers trained

360 Pulls 7 Tags Updated 1 year ago

reflection-70b-iq2_xxs

Reflection Llama-3.1 70B is (currently) the world's top open-source LLM, trained with a new technique called Reflection-Tuning that teaches a LLM to detect mistakes in its reasoning and correct course.

335 Pulls 1 Tag Updated 1 year ago

mistral-nemo-gutenberg-12b-v2

axolotl-ai-co/romulus-mistral-nemo-12b-simpo finetuned on jondurbin/gutenberg-dpo-v0.1

tools

311 Pulls 1 Tag Updated 1 year ago

calme-2.4-rys-78b

This model is a fine-tuned version of the dnhkng/RYS-XLarge, pushing the boundaries of natural language understanding and generation even further.

tools

304 Pulls 2 Tags Updated 1 year ago

qwq-32b-iq4_xs

IQ4_XS quant of Qwen/QwQ-32B

tools

247 Pulls 4 Tags Updated 9 months ago

llama-3-8b-instruct-32k-v0.1

Llama 3 8b 32k

234 Pulls 11 Tags Updated 1 year ago

llama-3.1-instruct-bellman-8b-swedish

This version of bellman is finetuned from llama-3.1-instruct-8b. It's finetuned for prompt question answering, based on a dataset created from Swedish wikipedia, with a lot of Sweden-centric questions.

tools

219 Pulls 3 Tags Updated 1 year ago

tess-v2.5-qwen2-72b

Tess-v2.5 (Qwen2-72B) was fine-tuned over the newly released Qwen2-72B base, using the Tess-v2.5 dataset that contain 300K samples spanning multiple topics.

191 Pulls 3 Tags Updated 1 year ago

llama3.1-70b-iquants

Llama 3.1 70b IQs: IQ1_M, IQ2_M, IQ2_S, IQ2_XS, IQ2_XXS, IQ3_XS, IQ4_XS

tools

168 Pulls 8 Tags Updated 1 year ago

llama-3-14b-instruct-v1

Self-merge Llama 3 14B Instruct

156 Pulls 2 Tags Updated 1 year ago

einstein-v6.1-llama3-8b

Weyaxi/Einstein-v6.1-Llama3-8B

144 Pulls 11 Tags Updated 1 year ago

qwen2.5-32b-instruct_iq4_xs

Qwen2.5 32B Instruct IQ4_XS

tools

136 Pulls 1 Tag Updated 1 year ago

llama-3-peach-instruct-4x8b-moe

This is a experimental 4x8B Llama 3 MoE

134 Pulls 2 Tags Updated 1 year ago

command-r-08-2024-q4_k_m

Command R is a Large Language Model optimized for conversational interaction and long context tasks.

98 Pulls 1 Tag Updated 1 year ago

athene-v2-chat-iq3_xs

Athene-V2-Chat-IQ3_XS

tools

93 Pulls 1 Tag Updated 1 year ago

orca-llama-3-8b-instruct

Orca-Llama-3-8B-Instruct-DPO

85 Pulls 2 Tags Updated 1 year ago

una-simplesmaug-34b-v1beta

UNA SimpleSmaug 34b v1beta Q4_K_M GGUF

75 Pulls 1 Tag Updated 1 year ago

qwen2.5-14b-instruct-iq4_xs

Qwen2.5 14B Instruct IQ4_XS

tools

72 Pulls 1 Tag Updated 1 year ago

llama-3-magenta-instruct-4x8b-moe

This is a experimental 4x8B Llama 3 MoE

61 Pulls 1 Tag Updated 1 year ago

mixtral_34bx2_moe_60b

Mixtral_34Bx2_MoE_60B GGUF Q4_K_M

57 Pulls 1 Tag Updated 1 year ago

trinity-2-codestral-22b-v0.2

Trinity is a coding specific Large Language Model series created by Migel Tissera.

54 Pulls 2 Tags Updated 1 year ago

cathallama-70b-i1-iq2_s

Perfect for 24GB cards

tools

42 Pulls 1 Tag Updated 1 year ago

qwen2.5-72b-instruct-iq3_xxs

Qwen2.5 is the latest series of Qwen large language models.

tools

28 Pulls 1 Tag Updated 1 year ago

mistral-large-instruct-2407-iq3_xx

tools

26 Pulls 1 Tag Updated 1 year ago

rys-xlarge-iq3_xs

This is a new kind of model optimization. This model is based on MaziyarPanahi/calme-2.1-qwen2-72b, which was tuned from Qwen2-72B.

26 Pulls 1 Tag Updated 1 year ago

command-r-08-2024:q4_k_m

Command R is a Large Language Model optimized for conversational interaction and long context tasks.

Hi. RDson@🤗

Phi-4

phi-4-unsloth

mistral-nemo-12b-celeste-v1.9

supernova-medius

midnight-miqu-70b-v1.5

palmyra-fin-70b-32k

gemma-2-ataraxy-9b

smaug-llama-3-70b-instruct

qwen2.5-coder-32b-instruct-iq4_xs

llama-3-8b-instruct-coder-v2

llama-3.1-70b-instruct-lorablated-iq2_xs

hermes-3-llama-3.1-8b

theia-21b-v1

reflection-70b-iq2_xxs

mistral-nemo-gutenberg-12b-v2

calme-2.4-rys-78b

qwq-32b-iq4_xs

llama-3-8b-instruct-32k-v0.1

llama-3.1-instruct-bellman-8b-swedish

tess-v2.5-qwen2-72b

llama3.1-70b-iquants

llama-3-14b-instruct-v1

einstein-v6.1-llama3-8b

qwen2.5-32b-instruct_iq4_xs

llama-3-peach-instruct-4x8b-moe

command-r-08-2024-q4_k_m

athene-v2-chat-iq3_xs

orca-llama-3-8b-instruct

una-simplesmaug-34b-v1beta

qwen2.5-14b-instruct-iq4_xs

llama-3-magenta-instruct-4x8b-moe

mixtral_34bx2_moe_60b

trinity-2-codestral-22b-v0.2

cathallama-70b-i1-iq2_s

qwen2.5-72b-instruct-iq3_xxs

mistral-large-instruct-2407-iq3_xx

rys-xlarge-iq3_xs

command-r-08-2024:q4_k_m