-
minicpm-o2.6
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
vision 8b26.6K Pulls 13 Tags Updated 11 months ago
-
minicpm-v4.5
A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
vision 8b17.6K Pulls 11 Tags Updated 9 months ago
-
minicpm-v4.6
A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone
vision 1b6,952 Pulls 13 Tags Updated 6 hours ago
-
minicpm-o4.5
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Mulitmodal Live Streaming on Your Phone
vision 8b6,901 Pulls 12 Tags Updated 3 months ago
-
minicpm-v2.6
A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
vision 8b2,477 Pulls 12 Tags Updated 11 months ago
-
minicpm-v4
A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
vision 4b1,826 Pulls 12 Tags Updated 9 months ago
-
minicpm4.1
highly efficient large language models (LLMs) designed explicitly for end-side devices
1,279 Pulls 1 Tag Updated 8 months ago
-
minicpm-v2.5
A GPT-4V Level Multimodal LLM on Your Phone
vision 8b433 Pulls 13 Tags Updated 11 months ago
-
minicpm5
highly efficient large language models (LLMs) designed explicitly for end-side devices
222 Pulls 4 Tags Updated yesterday