Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Alarcia, Ramon Maria Garcia, Golkar, Alessandro
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2410.00749
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914961773559808
author	Alarcia, Ramon Maria Garcia Golkar, Alessandro
author_facet	Alarcia, Ramon Maria Garcia Golkar, Alessandro
contents	As Large Language Models become ubiquitous in many sectors and tasks, there is a need to reduce token usage, overcoming challenges such as short context windows, limited output sizes, and costs associated with token intake and generation, especially in API-served LLMs. This work brings the Design Structure Matrix from the engineering design discipline into LLM conversation optimization. Applied to a use case in which the LLM conversation is about the design of a spacecraft and its subsystems, the DSM, with its analysis tools such as clustering and sequencing, demonstrates being an effective tool to organize the conversation, minimizing the number of tokens sent to or retrieved from the LLM at once, as well as grouping chunks that can be allocated to different context windows. Hence, this work broadens the current set of methodologies for token usage optimization and opens new avenues for the integration of engineering design practices into LLMs.
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_00749
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Optimizing Token Usage on Large Language Model Conversations Using the Design Structure Matrix Alarcia, Ramon Maria Garcia Golkar, Alessandro Computation and Language As Large Language Models become ubiquitous in many sectors and tasks, there is a need to reduce token usage, overcoming challenges such as short context windows, limited output sizes, and costs associated with token intake and generation, especially in API-served LLMs. This work brings the Design Structure Matrix from the engineering design discipline into LLM conversation optimization. Applied to a use case in which the LLM conversation is about the design of a spacecraft and its subsystems, the DSM, with its analysis tools such as clustering and sequencing, demonstrates being an effective tool to organize the conversation, minimizing the number of tokens sent to or retrieved from the LLM at once, as well as grouping chunks that can be allocated to different context windows. Hence, this work broadens the current set of methodologies for token usage optimization and opens new avenues for the integration of engineering design practices into LLMs.
title	Optimizing Token Usage on Large Language Model Conversations Using the Design Structure Matrix
topic	Computation and Language
url	https://arxiv.org/abs/2410.00749

Similar Items