222 Downloads Updated yesterday
ollama run openbmb/minicpm5:fp16
Updated yesterday
yesterday
661f549292bc · 2.2GB ·
MiniCPM Tech Report | GitHub Repo | UltraData | MiniCPM Desk Pet | Online Demo
English | 中文
We are releasing MiniCPM5-1B, the first model in the MiniCPM5 series. It is a dense 1B Transformer built for on-device, local deployment, and resource-constrained scenarios, reaching 1B-class open-source SOTA.
🏆 1B-class open-source SOTA: compared with strong open-source models in the same size class, MiniCPM5-1B reaches SOTA within this comparison set. Its advantage is most visible in agentic tool use, code generation, and difficult reasoning.

🧠 Hybrid Reasoning: built-in <think> chat template, switch via enable_thinking. The same checkpoint serves as both a fast assistant and a deliberate reasoner.
🛠️ Deployment / Fine-tuning Resources: the MiniCPM GitHub repo provides single-page cookbooks and Agent Skills for major inference backends and fine-tuning frameworks.
🐱 Desktop Pet: a local-LLM desktop pet driven by MiniCPM5-1B.
Use this directory to choose the model format that matches your runtime:
MiniCPM5-1B has the following features:
LlamaForCausalLMMiniCPM5-1B is the first checkpoint in the MiniCPM5 series. It is designed for local assistants, coding agents, tool-use workflows, and reasoning scenarios where a compact model is preferred. The model keeps a small deployment footprint while providing native long-context support and both Think / No Think chat modes through the same checkpoint.
Note: If you want to use local deployment, you can refer to this document.