1,122 1 week ago

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

vision 1b

13 models