-
wizardcoder
State-of-the-art code generation model
Code 7B 13B 33B 34B88.7K Pulls 67 Tags Updated 8 months ago
-
adrienbrault/nous-hermes2pro-llama3-8b
NousResearch/Hermes-2-Pro-Llama-3-8B
8B87.7K Pulls 5 Tags Updated 4 months ago
-
stable-code
Stable Code 3B is a coding model with instruct and code completion variants on par with models such as Code Llama 7B that are 2.5x larger.
Code87.6K Pulls 36 Tags Updated 5 months ago
-
openhermes
OpenHermes 2.5 is a 7B model fine-tuned by Teknium on Mistral with fully open datasets.
7B86.2K Pulls 35 Tags Updated 8 months ago
-
bakllava
BakLLaVA is a multimodal model consisting of the Mistral 7B base model augmented with the LLaVA architecture.
Vision 7B84.5K Pulls 17 Tags Updated 9 months ago
-
stablelm2
Stable LM 2 is a state-of-the-art 1.6B and 12B parameter language model trained on multilingual data in English, Spanish, German, Italian, French, Portuguese, and Dutch.
1.6B 12B77K Pulls 84 Tags Updated 4 months ago
-
qwen2-math
Qwen2 Math is a series of specialized math language models built upon the Qwen2 LLMs, which significantly outperforms the mathematical capabilities of open-source models and even closed-source models (e.g., GPT4o).
1.5B 7B 72B74.9K Pulls 52 Tags Updated 2 weeks ago
-
wizard-math
Model focused on math and logic problems
7B 13B74.2K Pulls 64 Tags Updated 9 months ago
-
neural-chat
A fine-tuned model based on Mistral with good coverage of domain and language.
7B69.5K Pulls 50 Tags Updated 5 months ago
-
llama3-gradient
This model extends LLama-3 8B's context length from 8k to over 1m tokens.
8B 70B68.6K Pulls 35 Tags Updated 4 months ago
-
deepseek-llm
An advanced language model crafted with 2 trillion bilingual tokens.
7B 67B65.1K Pulls 64 Tags Updated 9 months ago
-
phind-codellama
Code generation model based on Code Llama.
Code 34B62.9K Pulls 49 Tags Updated 8 months ago
-
nous-hermes
General use models based on Llama and Llama 2 from Nous Research.
7B 13B61.9K Pulls 63 Tags Updated 10 months ago
-
xwinlm
Conversational model based on Llama 2 that performs competitively on various benchmarks.
7B 13B61.3K Pulls 80 Tags Updated 10 months ago
-
sqlcoder
SQLCoder is a code completion model fined-tuned on StarCoder for SQL generation tasks
Code 7B 15B 70B61.3K Pulls 48 Tags Updated 10 months ago
-
dolphincoder
A 7B and 15B uncensored variant of the Dolphin model family that excels at coding, based on StarCoder2.
Code 7B60.8K Pulls 35 Tags Updated 5 months ago
-
yarn-llama2
An extension of Llama 2 that supports a context of up to 128k tokens.
7B 13B58.5K Pulls 67 Tags Updated 10 months ago
-
mistral-large
Mistral Large 2 is Mistral's new flagship model that is significantly more capable in code generation, mathematics, and reasoning with 128k context window and support for dozens of languages.
Tools 123B57.3K Pulls 17 Tags Updated 8 weeks ago
-
Duggles/meta-llama3.1-instruct-uncensored
8B57K Pulls 1 Tag Updated 6 weeks ago
-
wizardlm
General use model based on Llama 2.
7B 13B 30B56.2K Pulls 73 Tags Updated 5 months ago
-
llama3-chatqa
A model from NVIDIA based on Llama 3 that excels at conversational question answering (QA) and retrieval-augmented generation (RAG).
8B 70B55.4K Pulls 35 Tags Updated 4 months ago
-
starling-lm
Starling is a large language model trained by reinforcement learning from AI feedback focused on improving chatbot helpfulness.
7B53.3K Pulls 36 Tags Updated 9 months ago
-
smollm
🪐 A family of small models with 135M, 360M, and 1.7B parameters, trained on a new high-quality dataset.
52.5K Pulls 94 Tags Updated 4 weeks ago
-
falcon Archive
A large language model built by the Technology Innovation Institute (TII) for use in summarization, text generation, and chat bots.
7B 40B 180B50.8K Pulls 38 Tags Updated 11 months ago
-
moondream
moondream2 is a small vision language model designed to run efficiently on edge devices.
Vision50.7K Pulls 18 Tags Updated 4 months ago