Hermes 3 Llama-3.1 70b Model by NousResearch
105 Pulls Updated 2 months ago
Updated 2 months ago
2 months ago
3bffd0bdf21c · 40GB
Readme
- Quantization from
fp32
- Using i-matrix
calibration_datav3.txt
temperature
set to0.2
: beware higher value will destroy reasoning and math capabilities of the model
Model Description
Hermes 3 is the latest version of our flagship Hermes series of LLMs by Nous Research.
For more details on new capabilities, training results, and more, see the Hermes 3 Technical Report.
Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board.
The ethos of the Hermes series of models is focused on aligning LLMs to the user, with powerful steering capabilities and control given to the end user.
The Hermes 3 series builds and expands on the Hermes 2 set of capabilities, including more powerful and reliable function calling and structured output capabilities, generalist assistant capabilities, and improved code generation skills.
Benchmarks
Hermes 3 is competitive, if not superior, to Llama-3.1 Instruct models at general capabilities, with varying strengths and weaknesses attributable between the two.
Full benchmark comparisons below: