Starling-LM-10.7B-beta, an open large language model (LLM) trained by Reinforcement Learning from AI Feedback (RLAIF)

115 7 months ago

5e6cbd573f8f · 105B
{
"stop": [
"<|endoftext|>",
"<|end_of_turn|>",
"Human:",
"Assistant:"
],
"temperature": 0.1
}