Saved in:
Bibliographic Details
Main Authors: Teknium, Ryan, Quesnelle, Jeffrey, Guang, Chen
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2408.11857
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Instruct (or "chat") tuned models have become the primary way in which most people interact with large language models. As opposed to "base" or "foundation" models, instruct-tuned models are optimized to respond to imperative statements. We present Hermes 3, a neutrally-aligned generalist instruct and tool use model with strong reasoning and creative abilities. Its largest version, Hermes 3 405B, achieves state of the art performance among open weight models on several public benchmarks.