Fixed memory prediction issues. Limit the number of layers loaded by GPU.

16B

46 Pulls Updated 3 months ago

732caedf08d1 · 112B
{{ if .System }}{{ .System }} {{ end }}{{ if .Prompt }}User: {{ .Prompt }} {{ end }}Assistant: {{ .Response }}