Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Shi, Xiangyu, Chiesa, Marco, Maguire Jr., Gerald Q., Kostic, Dejan
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence Multiagent Systems
Online Access:	https://arxiv.org/abs/2510.03346
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910028496109568
author	Shi, Xiangyu Chiesa, Marco Maguire Jr., Gerald Q. Kostic, Dejan
author_facet	Shi, Xiangyu Chiesa, Marco Maguire Jr., Gerald Q. Kostic, Dejan
contents	Large Language Models (LLMs) are increasingly deployed in multi-agent systems, where effective inter-model communication is crucial. Existing communication protocols either rely on natural language, incurring high inference costs and information loss, or on hidden states, which suffer from information concentration bias and inefficiency. To address these limitations, we propose KVComm, a novel communication framework that enables efficient communication between LLMs through selective sharing of KV pairs. KVComm leverages the rich information encoded in the KV pairs while avoiding the pitfalls of hidden states. We introduce a KV layer-wise selection strategy based on attention importance scores with a Gaussian prior to identify the most informative KV pairs for communication. Extensive experiments across diverse tasks and model pairs demonstrate that KVComm achieves comparable performance to the upper-bound method, which directly merges inputs to one model without any communication, while transmitting as few as 30\% of layers' KV pairs. Our study highlights the potential of KV pairs as an effective medium for inter-LLM communication, paving the way for scalable and efficient multi-agent systems.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_03346
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	KVComm: Enabling Efficient LLM Communication through Selective KV Sharing Shi, Xiangyu Chiesa, Marco Maguire Jr., Gerald Q. Kostic, Dejan Machine Learning Artificial Intelligence Multiagent Systems Large Language Models (LLMs) are increasingly deployed in multi-agent systems, where effective inter-model communication is crucial. Existing communication protocols either rely on natural language, incurring high inference costs and information loss, or on hidden states, which suffer from information concentration bias and inefficiency. To address these limitations, we propose KVComm, a novel communication framework that enables efficient communication between LLMs through selective sharing of KV pairs. KVComm leverages the rich information encoded in the KV pairs while avoiding the pitfalls of hidden states. We introduce a KV layer-wise selection strategy based on attention importance scores with a Gaussian prior to identify the most informative KV pairs for communication. Extensive experiments across diverse tasks and model pairs demonstrate that KVComm achieves comparable performance to the upper-bound method, which directly merges inputs to one model without any communication, while transmitting as few as 30\% of layers' KV pairs. Our study highlights the potential of KV pairs as an effective medium for inter-LLM communication, paving the way for scalable and efficient multi-agent systems.
title	KVComm: Enabling Efficient LLM Communication through Selective KV Sharing
topic	Machine Learning Artificial Intelligence Multiagent Systems
url	https://arxiv.org/abs/2510.03346

Similar Items