Lightweight and fast vision model, does a decent job captioning photos.
Vision
3B
687 Pulls Updated 6 months ago
53a7eb15ae08 · 37B
{
"stop": [
"Q:",
"A:"
],
"temperature": 0
}