mannix

llama3.1-8b-abliterated

Ablitered v3 llama-3.1 8b with uncensored prompt

tools

47.8K Pulls 38 Tags Updated 1 year ago

gemma2-9b-simpo

Fine-tuned google/gemma-2-9b-it on princeton-nlp/gemma2-ultrafeedback-armorm with the SimPO objective.

11.3K Pulls 24 Tags Updated 1 year ago

llama3-uncensored

llama3-8b with uncensored GuruBot prompt

9,545 Pulls 1 Tag Updated 2 years ago

defog-llama3-sqlcoder-8b

A capable language model for text to SQL generation for Postgres, Redshift and Snowflake that is on-par with the most capable generalist frontier models.

7,692 Pulls 16 Tags Updated 2 years ago

llama3.1-8b-lexi

This is an uncensored version of Llama 3.1 8B Instruct with an uncensored prompt.

tools

7,373 Pulls 45 Tags Updated 1 year ago

deepseek-coder-v2-lite-instruct

An open-source Mixture-of-Experts code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.

6,921 Pulls 23 Tags Updated 1 year ago

llamax3-8b-alpaca

LLaMAX is a multilingual language model, developed through continued pre-training on Llama3, and supports over 100 languages

6,764 Pulls 16 Tags Updated 1 year ago

llama3-8b-ablitered-v3

Ablitered v3 llama-3 8b with uncensored prompt

4,259 Pulls 25 Tags Updated 1 year ago

qwen2-57b

Mixture-of-Experts model 57b

3,909 Pulls 18 Tags Updated 1 year ago

hermes-3-llama-3.1-8b

Hermes 3 Llama-3.1 8b Model by NousResearch

tools

2,871 Pulls 22 Tags Updated 1 year ago

dolphin-2.9-llama3-8b

The uncensored Dolphin model based on Mistral that excels at coding tasks. Updated to version 2.9 with llama 3.

2,290 Pulls 2 Tags Updated 2 years ago

llava-phi3

A new small LLaVA model fine-tuned from Phi 3 Mini [I-Quants]

vision

2,014 Pulls 4 Tags Updated 2 years ago

jan-nano

Jan-Nano is a compact 4-billion parameter language model specifically designed and trained for deep research tasks.

tools thinking

1,905 Pulls 4 Tags Updated 11 months ago

qwq-32b-abilterated

QwQ is an experimental research model focused on advancing AI reasoning capabilities. Abliterated with uncensored prompt, i-matrix quants.

tools

1,780 Pulls 17 Tags Updated 1 year ago

qwen2.5-coder

The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing. I-Quant models.

tools

1,485 Pulls 47 Tags Updated 1 year ago

dolphin-2.9.2-qwen2-72b

This model is based on Qwen2-72b, Dolphin-2.9.2 has a variety of instruction, conversational, and coding skills. It also has initial agentic abilities and supports function calling. Dolphin is uncensored.

1,431 Pulls 20 Tags Updated 1 year ago

phi3-mini-4k

Phi-3 Mini is a lightweight 3B state-of-the-art open models by Microsoft. Updated in July 2024.

1,426 Pulls 19 Tags Updated 1 year ago

gemma2-9b-sppo-iter3

This model was developed using Self-Play Preference Optimization at iteration 3, based on the google/gemma-2-9b-it architecture as starting point.

1,090 Pulls 24 Tags Updated 1 year ago

smallthinker-abliterated

A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model. I-Quants models, abliterated with uncensored prompt.

789 Pulls 15 Tags Updated 1 year ago

gemma2-9b

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models

786 Pulls 24 Tags Updated 1 year ago

dolphin-mixtral

Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.

781 Pulls 2 Tags Updated 2 years ago

gemma4-98e-v6-coder

The best 20b coding model just got better! Beats the bigger 26b brother in Python and code reasoning

vision tools thinking

660 Pulls 62 Tags Updated yesterday

qwen2-math-7b

Qwen2-Math is a series of specialized math language models built upon the Qwen2 LLMs

653 Pulls 21 Tags Updated 1 year ago

llama3.1-storm

Llama-3.1-Storm-8B outperforms both Llama-3.1-8B-Instruct and Hermes-3-Llama-3.1-8B!

tools

597 Pulls 13 Tags Updated 1 year ago

llama3-groq-tool-8b

A series of models from Groq that represent a significant advancement in open-source AI capabilities for tool use/function calling.

tools

559 Pulls 18 Tags Updated 1 year ago

qwq-32b

QwQ is an experimental research model focused on advancing AI reasoning capabilities. i-matrix quantizations.

tools

552 Pulls 23 Tags Updated 1 year ago

omnimerge-v4-mtp

Qwen/Qwen3.6-27B + 3 Qwen3.6 fine-tunes with MLP-passthrough surgery - MTP quants

551 Pulls 8 Tags Updated 2 weeks ago

internlm2.5-20b

InternLM2.5 has open-sourced a 20 billion parameter base model and a chat model tailored for practical scenarios.

540 Pulls 14 Tags Updated 1 year ago

llama3.1-70b

New state-of-the-art model from Meta available in 8B, 70B and 405B sizes

tools

535 Pulls 34 Tags Updated 1 year ago

deepseek-v2-lite-instruct

A strong, economical, and efficient Mixture-of-Experts language model.

512 Pulls 8 Tags Updated 1 year ago

omnimerge-v4

Qwen/Qwen3.6-27B + 3 Qwen3.6 fine-tunes with MLP-passthrough surgery

498 Pulls 27 Tags Updated 2 weeks ago

llama3-12b

Meta-Llama-3-12B-Instruct is a depth upscaling merge of llama3-8b from M. Labonne

465 Pulls 22 Tags Updated 2 years ago

gemma4-98e-v7-coder

An even more improved version of Gemma-4 98e coder variant, the best 20b coder

vision tools thinking

406 Pulls 57 Tags Updated yesterday

gemma4-98e-v5-coder

Pruned to 98 experts gemma-4 a4b 26b v5-coder. Best 20b coder model overall

tools thinking

394 Pulls 31 Tags Updated 2 weeks ago

llama-3.3

New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model. I-Quants models.

tools

374 Pulls 9 Tags Updated 1 year ago

replete-coder-llama3-8b

Replete-Coder-llama3-8b is a general purpose model that is specially trained in coding in over 100 coding languages.

367 Pulls 10 Tags Updated 1 year ago

smaug-llama3-8b

This model was built using the Smaug recipe for improving performance on real world multi-turn conversations applied to meta-llama/Meta-Llama-3-8B-Instruct.

335 Pulls 22 Tags Updated 1 year ago

hermes-3-llama-3.1-70b

Hermes 3 Llama-3.1 70b Model by NousResearch

tools

330 Pulls 22 Tags Updated 1 year ago

mixtral_7bx2_moe

A high-quality Mixture of Experts (MoE) model with open weights by Mistral AI.

315 Pulls 11 Tags Updated 2 years ago

gemma4-98e-v7-coderx

Gemma-4 98e coder max variant, top notch coding skills at the expense of science knowledge

vision tools thinking

315 Pulls 57 Tags Updated yesterday

gemma2-2b

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models

307 Pulls 11 Tags Updated 1 year ago

smaug-llama3-70b

This model was built using a new Smaug recipe for improving performance on real world multi-turn conversations applied to Meta-Llama-3-70B-Instruct

292 Pulls 9 Tags Updated 1 year ago

qwen2-7b

Qwen2 is the new series of Qwen large language models

291 Pulls 4 Tags Updated 2 years ago

llama3.1-8b

New state-of-the-art model from Meta available in 8B, 70B and 405B sizes.

tools

275 Pulls 41 Tags Updated 1 year ago

llama3-sppo-iter3

Meta Llama-3-8b with Self-Play Preference Optimization for Language Model Alignment at iteration 3

264 Pulls 21 Tags Updated 1 year ago

smaug-llama3-70b-32k

This model was built using a new Smaug recipe for improving performance on real world multi-turn conversations applied to Meta-Llama-3-70B-Instruct, 32k context length

261 Pulls 4 Tags Updated 1 year ago

replete-adapted-llama3-8b

Replete-Adapted-llama3-8b is a general purpose model that is specially trained in coding in over 100 coding languages.

218 Pulls 17 Tags Updated 1 year ago

llama3-gradient

This model extends LLama-3 70B's context length from 8k to over 1m tokens. [I-Quants]

207 Pulls 4 Tags Updated 2 years ago

replete-coder-merged-8b

Replete-Coder-Merged-8b is a general purpose model that is specially trained in coding in over 100 coding languages

203 Pulls 21 Tags Updated 1 year ago

smaug-qwen2-72b

The latest in the Smaug series - a finetune of Qwen2-72B-Instruct

192 Pulls 21 Tags Updated 1 year ago

gemma4-98e-v4

Pruned to 98 experts gemma-4 a4b 26b v4

tools thinking

172 Pulls 30 Tags Updated 2 weeks ago

starling-lm-10.7b

Starling-LM-10.7B-beta, an open large language model (LLM) trained by Reinforcement Learning from AI Feedback (RLAIF)

161 Pulls 13 Tags Updated 2 years ago

eurus-2-7b-prime

Eurus-2-7B-PRIME is trained using PRIME (Process Reinforcement through IMplicit rEward) method, an open-source solution for online reinforcement learning (RL) with process rewards, to advance reasoning abilities of language models.

151 Pulls 23 Tags Updated 1 year ago

qwen2-math-1.5b

Qwen2-Math is a series of specialized math language models built upon the Qwen2 LLMs

143 Pulls 7 Tags Updated 1 year ago

gemma4-31b-he1

Pruned/masked heads version of gemma4-31b

tools thinking

131 Pulls 26 Tags Updated 2 weeks ago

nous-hermes2-solar-10.7b

The powerful Solar based model by Nous Research that excels at scientific discussion and coding tasks.

130 Pulls 7 Tags Updated 1 year ago

qwen2-1.5b

Qwen2 is the new series of Qwen large language models

125 Pulls 4 Tags Updated 2 years ago

smallthinker

A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model. I-Quants models.

113 Pulls 14 Tags Updated 1 year ago

wizardlm2

State of the art large language model from Microsoft AI with improved performance on complex chat, multilingual, reasoning and agent use cases. More quantizations.

108 Pulls 2 Tags Updated 2 years ago

discopop-zephyr-7b-gemma

A 7B parameter GPT-like model fine-tuned on a mix of publicly available, synthetic datasets using DiscoPOP

93 Pulls 5 Tags Updated 1 year ago

qwen2-0.5b

Qwen2 is the new series of Qwen large language models

91 Pulls 7 Tags Updated 2 years ago

phi3-mini-cpo-simpo

Phi-3-mini-4K-instruct with CPO-SimPO

90 Pulls 16 Tags Updated 1 year ago

gemma4-98e

Pruned to 98 experts gemma-4 a4b 26b v3

tools thinking

70 Pulls 27 Tags Updated 2 weeks ago

llamax3-8b

LLaMAX is a multilingual language model, developed through continued pre-training on Llama3, and supports over 100 languages.

38 Pulls

alchemistcoder-7b

AlchemistCoder is a series of coding models by InternLM. Tuned from Llama 2.

24 Pulls 3 Tags Updated 2 years ago

llama3-8b-v0.9

MaziyarPanahi/Llama-3-8B-Instruct-v0.9

3 Pulls 2 Tags Updated 2 years ago

Crazy Overclocker, Amateur Coder, AI Pragmatist, Non-sense Lover. osync developer

llama3.1-8b-abliterated

gemma2-9b-simpo

llama3-uncensored

defog-llama3-sqlcoder-8b

llama3.1-8b-lexi

deepseek-coder-v2-lite-instruct

llamax3-8b-alpaca

llama3-8b-ablitered-v3

qwen2-57b

hermes-3-llama-3.1-8b

dolphin-2.9-llama3-8b

llava-phi3

jan-nano

qwq-32b-abilterated

qwen2.5-coder

dolphin-2.9.2-qwen2-72b

phi3-mini-4k

gemma2-9b-sppo-iter3

smallthinker-abliterated

gemma2-9b

dolphin-mixtral

gemma4-98e-v6-coder

qwen2-math-7b

llama3.1-storm

llama3-groq-tool-8b

qwq-32b

omnimerge-v4-mtp

internlm2.5-20b

llama3.1-70b

deepseek-v2-lite-instruct

omnimerge-v4

llama3-12b

gemma4-98e-v7-coder

gemma4-98e-v5-coder

llama-3.3

replete-coder-llama3-8b

smaug-llama3-8b

hermes-3-llama-3.1-70b

mixtral_7bx2_moe

gemma4-98e-v7-coderx

gemma2-2b

smaug-llama3-70b

qwen2-7b

llama3.1-8b

llama3-sppo-iter3

smaug-llama3-70b-32k

replete-adapted-llama3-8b

llama3-gradient

replete-coder-merged-8b

smaug-qwen2-72b

gemma4-98e-v4

starling-lm-10.7b

eurus-2-7b-prime

qwen2-math-1.5b

gemma4-31b-he1

nous-hermes2-solar-10.7b

qwen2-1.5b

smallthinker

wizardlm2

discopop-zephyr-7b-gemma

qwen2-0.5b

phi3-mini-cpo-simpo

gemma4-98e

llamax3-8b

alchemistcoder-7b

llama3-8b-v0.9