Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Devatine, Nicolas, Abraham, Louis
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2412.17321
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915076071489536
author	Devatine, Nicolas Abraham, Louis
author_facet	Devatine, Nicolas Abraham, Louis
contents	Assessing the extent of human edits on texts generated by Large Language Models (LLMs) is crucial to understanding the human-AI interactions and improving the quality of automated text generation systems. Existing edit distance metrics, such as Levenshtein, BLEU, ROUGE, and TER, often fail to accurately measure the effort required for post-editing, especially when edits involve substantial modifications, such as block operations. In this paper, we introduce a novel compression-based edit distance metric grounded in the Lempel-Ziv-77 algorithm, designed to quantify the amount of post-editing applied to LLM-generated texts. Our method leverages the properties of text compression to measure the informational difference between the original and edited texts. Through experiments on real-world human edits datasets, we demonstrate that our proposed metric is highly correlated with actual edit time and effort. We also show that LLMs exhibit an implicit understanding of editing speed, that aligns well with our metric. Furthermore, we compare our metric with existing ones, highlighting its advantages in capturing complex edits with linear computational efficiency. Our code and data are available at: https://github.com/NDV-tiime/CompressionDistance
format	Preprint
id	arxiv_https___arxiv_org_abs_2412_17321
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Assessing Human Editing Effort on LLM-Generated Texts via Compression-Based Edit Distance Devatine, Nicolas Abraham, Louis Computation and Language Artificial Intelligence Assessing the extent of human edits on texts generated by Large Language Models (LLMs) is crucial to understanding the human-AI interactions and improving the quality of automated text generation systems. Existing edit distance metrics, such as Levenshtein, BLEU, ROUGE, and TER, often fail to accurately measure the effort required for post-editing, especially when edits involve substantial modifications, such as block operations. In this paper, we introduce a novel compression-based edit distance metric grounded in the Lempel-Ziv-77 algorithm, designed to quantify the amount of post-editing applied to LLM-generated texts. Our method leverages the properties of text compression to measure the informational difference between the original and edited texts. Through experiments on real-world human edits datasets, we demonstrate that our proposed metric is highly correlated with actual edit time and effort. We also show that LLMs exhibit an implicit understanding of editing speed, that aligns well with our metric. Furthermore, we compare our metric with existing ones, highlighting its advantages in capturing complex edits with linear computational efficiency. Our code and data are available at: https://github.com/NDV-tiime/CompressionDistance
title	Assessing Human Editing Effort on LLM-Generated Texts via Compression-Based Edit Distance
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2412.17321

Similar Items