Hermes 3 Llama-3.1 70b Model by NousResearch

tools

105 2 months ago

Readme

  • Quantization from fp32
  • Using i-matrix calibration_datav3.txt
  • temperature set to 0.2: beware higher value will destroy reasoning and math capabilities of the model

Model Description

Hermes 3 is the latest version of our flagship Hermes series of LLMs by Nous Research.

For more details on new capabilities, training results, and more, see the Hermes 3 Technical Report.

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board.

The ethos of the Hermes series of models is focused on aligning LLMs to the user, with powerful steering capabilities and control given to the end user.

The Hermes 3 series builds and expands on the Hermes 2 set of capabilities, including more powerful and reliable function calling and structured output capabilities, generalist assistant capabilities, and improved code generation skills.

Benchmarks

Hermes 3 is competitive, if not superior, to Llama-3.1 Instruct models at general capabilities, with varying strengths and weaknesses attributable between the two.

Full benchmark comparisons below:

image.png