6,975 9 hours ago

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

vision 1b

13 models