Hi. RDson@🤗
-
midnight-miqu-70b-v1.5
Midnight-Miqu-70B-v1.5-GGUF Q4_K_S & Q4_K_M
70B661 Pulls 2 Tags Updated 5 months ago
-
smaug-llama-3-70b-instruct
This model was built using a new Smaug recipe for improving performance on real world multi-turn conversations applied to meta-llama/Meta-Llama-3-70B-Instruct.
70B488 Pulls 11 Tags Updated 4 months ago
-
mistral-nemo-12b-celeste-v1.9
12B456 Pulls 6 Tags Updated 7 weeks ago
-
llama-3-8b-instruct-coder-v2
Llama-3-8B-Instruct-Coder-v2
8B342 Pulls 9 Tags Updated 4 months ago
-
reflection-70b-iq2_xxs
Reflection Llama-3.1 70B is (currently) the world's top open-source LLM, trained with a new technique called Reflection-Tuning that teaches a LLM to detect mistakes in its reasoning and correct course.
70B196 Pulls 1 Tag Updated 2 weeks ago
-
hermes-3-llama-3.1-8b
Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board.
Tools 8B192 Pulls 5 Tags Updated 5 weeks ago
-
theia-21b-v1
An upscaled NeMo with half its layers trained
190 Pulls 7 Tags Updated 6 weeks ago
-
llama-3-8b-instruct-32k-v0.1
Llama 3 8b 32k
8B185 Pulls 11 Tags Updated 5 weeks ago
-
llama-3.1-70b-instruct-lorablated-iq2_xs
Tools 70B174 Pulls 1 Tag Updated 5 weeks ago
-
einstein-v6.1-llama3-8b
Weyaxi/Einstein-v6.1-Llama3-8B
8B120 Pulls 11 Tags Updated 4 months ago
-
tess-v2.5-qwen2-72b
Tess-v2.5 (Qwen2-72B) was fine-tuned over the newly released Qwen2-72B base, using the Tess-v2.5 dataset that contain 300K samples spanning multiple topics.
104 Pulls 3 Tags Updated 3 months ago
-
llama-3-peach-instruct-4x8b-moe
This is a experimental 4x8B Llama 3 MoE
103 Pulls 2 Tags Updated 4 months ago
-
gemma-2-ataraxy-9b
Made from Gemma 2 9B SPPO iter3 and SimPO
9B97 Pulls 19 Tags Updated 2 weeks ago
-
llama3.1-70b-iquants
Llama 3.1 70b IQs: IQ1_M, IQ2_M, IQ2_S, IQ2_XS, IQ2_XXS, IQ3_XS, IQ4_XS
Tools 70B69 Pulls 8 Tags Updated 5 weeks ago
-
orca-llama-3-8b-instruct
Orca-Llama-3-8B-Instruct-DPO
8B64 Pulls 2 Tags Updated 5 months ago
-
llama-3-magenta-instruct-4x8b-moe
This is a experimental 4x8B Llama 3 MoE
60 Pulls 1 Tag Updated 4 months ago
-
una-simplesmaug-34b-v1beta
UNA SimpleSmaug 34b v1beta Q4_K_M GGUF
34B58 Pulls 1 Tag Updated 5 months ago
-
llama-3-14b-instruct-v1
Self-merge Llama 3 14B Instruct
55 Pulls 2 Tags Updated 5 months ago
-
mixtral_34bx2_moe_60b
Mixtral_34Bx2_MoE_60B GGUF Q4_K_M
35 Pulls 1 Tag Updated 5 months ago
-
calme-2.4-rys-78b
This model is a fine-tuned version of the dnhkng/RYS-XLarge, pushing the boundaries of natural language understanding and generation even further.
Tools34 Pulls 2 Tags Updated 9 days ago
-
palmyra-fin-70b-32k
Palmyra-Fin-70B-32K is a model built by Writer specifically to meet the needs of the financial industry. It is a leading LLM on financial benchmarks, outperforming other large language models in various financial tasks and evaluations.
70B28 Pulls 7 Tags Updated 5 weeks ago
-
command-r-08-2024-q4_k_m
Command R is a Large Language Model optimized for conversational interaction and long context tasks.
20 Pulls 1 Tag Updated 3 weeks ago
-
cathallama-70b-i1-iq2_s
Perfect for 24GB cards
Tools 70B19 Pulls 1 Tag Updated 5 weeks ago
-
rys-xlarge-iq3_xs
This is a new kind of model optimization. This model is based on MaziyarPanahi/calme-2.1-qwen2-72b, which was tuned from Qwen2-72B.
18 Pulls 1 Tag Updated 2 weeks ago
-
mistral-nemo-gutenberg-12b-v2
axolotl-ai-co/romulus-mistral-nemo-12b-simpo finetuned on jondurbin/gutenberg-dpo-v0.1
Tools 12B16 Pulls 1 Tag Updated 2 weeks ago
-
mistral-large-instruct-2407-iq3_xx
Tools 123B9 Pulls 1 Tag Updated 2 weeks ago
-
trinity-2-codestral-22b-v0.2
Trinity is a coding specific Large Language Model series created by Migel Tissera.
22B5 Pulls 2 Tags Updated 4 days ago
-
qwen2.5-72b-instruct-iq3_xxs
Qwen2.5 is the latest series of Qwen large language models.
Tools 72B1 Pull 1 Tag Updated 2 days ago
-
command-r-08-2024:q4_k_m
Command R is a Large Language Model optimized for conversational interaction and long context tasks.
Updated 3 weeks ago