Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wang, Haining, Clark, Jason, McKelvey, Hannah, Sterman, Leila, Gao, Zheng, Tian, Zuoyu, Kübler, Sandra, Liu, Xiaozhong
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence Computers and Society
Online Access:	https://arxiv.org/abs/2410.17088
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915244535709696
author	Wang, Haining Clark, Jason McKelvey, Hannah Sterman, Leila Gao, Zheng Tian, Zuoyu Kübler, Sandra Liu, Xiaozhong
author_facet	Wang, Haining Clark, Jason McKelvey, Hannah Sterman, Leila Gao, Zheng Tian, Zuoyu Kübler, Sandra Liu, Xiaozhong
contents	A vast amount of scholarly work is published daily, yet much of it remains inaccessible to the general public due to dense jargon and complex language. To address this challenge in science communication, we introduce a reinforcement learning framework that fine-tunes a language model to rewrite scholarly abstracts into more comprehensible versions. Guided by a carefully balanced combination of word- and sentence-level accessibility rewards, our language model effectively substitutes technical terms with more accessible alternatives, a task which models supervised fine-tuned or guided by conventional readability measures struggle to accomplish. Our best model adjusts the readability level of scholarly abstracts by approximately six U.S. grade levels -- in other words, from a postgraduate to a high school level. This translates to roughly a 90% relative boost over the supervised fine-tuning baseline, all while maintaining factual accuracy and high-quality language. An in-depth analysis of our approach shows that balanced rewards lead to systematic modifications in the base model, likely contributing to smoother optimization and superior performance. We envision this work as a step toward bridging the gap between scholarly research and the general public, particularly younger readers and those without a college degree.
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_17088
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Science Out of Its Ivory Tower: Improving Accessibility with Reinforcement Learning Wang, Haining Clark, Jason McKelvey, Hannah Sterman, Leila Gao, Zheng Tian, Zuoyu Kübler, Sandra Liu, Xiaozhong Computation and Language Artificial Intelligence Computers and Society A vast amount of scholarly work is published daily, yet much of it remains inaccessible to the general public due to dense jargon and complex language. To address this challenge in science communication, we introduce a reinforcement learning framework that fine-tunes a language model to rewrite scholarly abstracts into more comprehensible versions. Guided by a carefully balanced combination of word- and sentence-level accessibility rewards, our language model effectively substitutes technical terms with more accessible alternatives, a task which models supervised fine-tuned or guided by conventional readability measures struggle to accomplish. Our best model adjusts the readability level of scholarly abstracts by approximately six U.S. grade levels -- in other words, from a postgraduate to a high school level. This translates to roughly a 90% relative boost over the supervised fine-tuning baseline, all while maintaining factual accuracy and high-quality language. An in-depth analysis of our approach shows that balanced rewards lead to systematic modifications in the base model, likely contributing to smoother optimization and superior performance. We envision this work as a step toward bridging the gap between scholarly research and the general public, particularly younger readers and those without a college degree.
title	Science Out of Its Ivory Tower: Improving Accessibility with Reinforcement Learning
topic	Computation and Language Artificial Intelligence Computers and Society
url	https://arxiv.org/abs/2410.17088

Similar Items