Table of Contents: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Teknium, Ryan, Quesnelle, Jeffrey, Guang, Chen
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2408.11857
Tags:	Add Tag No Tags, Be the first to tag this record!

Table of Contents:

Instruct (or "chat") tuned models have become the primary way in which most people interact with large language models. As opposed to "base" or "foundation" models, instruct-tuned models are optimized to respond to imperative statements. We present Hermes 3, a neutrally-aligned generalist instruct and tool use model with strong reasoning and creative abilities. Its largest version, Hermes 3 405B, achieves state of the art performance among open weight models on several public benchmarks.

Similar Items