-
everythinglm
Uncensored Llama2 based model with support for a 16K context window.
13B26.8K Pulls 18 Tags Updated 8 months ago
-
aiden_lu/minicpm-v2.6
MiniCPM-V 2.6 is the latest and most capable model in the MiniCPM-V series. It exhibits a significant performance improvement over MiniCPM-Llama3-V 2.5
Vision 7B26.5K Pulls 1 Tag Updated 5 weeks ago
-
tyllama/kevin
Kevin is an advanced AI-powered app designed to offer a personalized conversational experience. By leveraging state-of-the-art technologies in natural language processing, machine learning, and speech recognition, Kevin aims to understand and learn
8B25.9K Pulls 1 Tag Updated 5 months ago
-
magicoder
🎩 Magicoder is a family of 7B parameter models trained on 75K synthetic instruction data using OSS-Instruct, a novel approach to enlightening LLMs with open-source code snippets.
Code 7B24.2K Pulls 18 Tags Updated 9 months ago
-
llama3-groq-tool-use
A series of models from Groq that represent a significant advancement in open-source AI capabilities for tool use/function calling.
Tools 8B 70B23.8K Pulls 33 Tags Updated 2 months ago
-
stablelm-zephyr
A lightweight chat model allowing accurate, and responsive output without requiring high-end hardware.
23.6K Pulls 17 Tags Updated 9 months ago
-
codebooga
A high-performing code instruct model created by merging two existing code models.
Code 34B23.1K Pulls 16 Tags Updated 10 months ago
-
mistrallite
MistralLite is a fine-tuned model based on Mistral with enhanced capabilities of processing long contexts.
7B21.9K Pulls 17 Tags Updated 10 months ago
-
znbang/bge
BAAI General Embedding
Embedding 33M21.1K Pulls 16 Tags Updated 6 months ago
-
falcon2
Falcon2 is an 11B parameters causal decoder-only model built by TII and trained over 5T tokens.
11B20.9K Pulls 17 Tags Updated 4 months ago
-
wizard-vicuna
Wizard Vicuna is a 13B parameter model based on Llama 2 trained by MelodysDreamj.
13B20.2K Pulls 17 Tags Updated 11 months ago
-
duckdb-nsql
7B parameter text-to-SQL model made by MotherDuck and Numbers Station.
Code 7B20.2K Pulls 17 Tags Updated 7 months ago
-
megadolphin
MegaDolphin-2.2-120b is a transformation of Dolphin-2.2-70b created by interleaving the model with itself.
18.9K Pulls 19 Tags Updated 8 months ago
-
notux
A top-performing mixture of experts model, fine-tuned with high-quality data.
8x7B18.1K Pulls 18 Tags Updated 8 months ago
-
goliath
A language model created by combining two fine-tuned Llama 2 70B models into one.
17.8K Pulls 16 Tags Updated 10 months ago
-
salmatrafi/acegpt
AceGPT: Aligning Large Language Models with Local (Arabic) Values
7B 13B17.7K Pulls 2 Tags Updated 5 months ago
-
open-orca-platypus2
Merge of the Open Orca OpenChat model and the Garage-bAInd Platypus 2 model. Designed for chat and code generation.
13B17.6K Pulls 17 Tags Updated 13 months ago
-
notus
A 7B chat model fine-tuned with high-quality data and based on Zephyr.
7B17.4K Pulls 18 Tags Updated 8 months ago
-
shaw/dmeta-embedding-zh
https://huggingface.co/DMetaSoul/Dmeta-embedding-zh
Embedding15.2K Pulls 1 Tag Updated 5 months ago
-
CognitiveComputations/dolphin-llama3.1
Dolphin is an uncensored multilingual chat tuned model by Eric Hartford, conformant to system prompt, good at coding and various tasks
8B14.9K Pulls 15 Tags Updated 6 weeks ago
-
bge-m3
BGE-M3 is a new model from BAAI distinguished for its versatility in Multi-Functionality, Multi-Linguality, and Multi-Granularity.
Embedding14.8K Pulls 3 Tags Updated 6 weeks ago
-
dbrx
DBRX is an open, general-purpose LLM created by Databricks.
132B14.3K Pulls 7 Tags Updated 5 months ago
-
mathstral
MathΣtral: a 7B model designed for math reasoning and scientific discovery by Mistral AI.
7B14.3K Pulls 17 Tags Updated 2 months ago
-
CognitiveComputations/dolphin-2.9.2-qwen2-7b
14.2K Pulls 13 Tags Updated 3 months ago
-
wangshenzhi/llama3-8b-chinese-chat-ollama-q8
The ollama model for the 8bit-quantized GGUF version of llama3-8b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-8bit).
8B12.7K Pulls 2 Tags Updated 4 months ago