Saved in:
Bibliographic Details
Main Authors: Luo, Ling, Ning, Jinzhong, Zhao, Yingwen, Wang, Zhijun, Ding, Zeyuan, Chen, Peng, Fu, Weiru, Han, Qinyu, Xu, Guangtao, Qiu, Yunzhi, Pan, Dinghao, Li, Jiru, Li, Hao, Feng, Wenduo, Tu, Senbo, Liu, Yuqi, Yang, Zhihao, Wang, Jian, Sun, Yuanyuan, Lin, Hongfei
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2311.11608
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913254100434944
author Luo, Ling
Ning, Jinzhong
Zhao, Yingwen
Wang, Zhijun
Ding, Zeyuan
Chen, Peng
Fu, Weiru
Han, Qinyu
Xu, Guangtao
Qiu, Yunzhi
Pan, Dinghao
Li, Jiru
Li, Hao
Feng, Wenduo
Tu, Senbo
Liu, Yuqi
Yang, Zhihao
Wang, Jian
Sun, Yuanyuan
Lin, Hongfei
author_facet Luo, Ling
Ning, Jinzhong
Zhao, Yingwen
Wang, Zhijun
Ding, Zeyuan
Chen, Peng
Fu, Weiru
Han, Qinyu
Xu, Guangtao
Qiu, Yunzhi
Pan, Dinghao
Li, Jiru
Li, Hao
Feng, Wenduo
Tu, Senbo
Liu, Yuqi
Yang, Zhihao
Wang, Jian
Sun, Yuanyuan
Lin, Hongfei
contents Objective: Most existing fine-tuned biomedical large language models (LLMs) focus on enhancing performance in monolingual biomedical question answering and conversation tasks. To investigate the effectiveness of the fine-tuned LLMs on diverse biomedical NLP tasks in different languages, We present Taiyi, a bilingual fine-tuned LLM for diverse biomedical tasks. Materials and Methods: We first curated a comprehensive collection of 140 existing biomedical text mining datasets (102 English and 38 Chinese datasets) across over 10 task types. Subsequently, a two-stage strategy is proposed for supervised fine-tuning to optimize the model performance across varied tasks. Results: Experimental results on 13 test sets covering named entity recognition, relation extraction, text classification, question answering tasks demonstrate that Taiyi achieves superior performance compared to general LLMs. The case study involving additional biomedical NLP tasks further shows Taiyi's considerable potential for bilingual biomedical multi-tasking. Conclusion: Leveraging rich high-quality biomedical corpora and developing effective fine-tuning strategies can significantly improve the performance of LLMs within the biomedical domain. Taiyi shows the bilingual multi-tasking capability through supervised fine-tuning. However, those tasks such as information extraction that are not generation tasks in nature remain challenging for LLM-based generative approaches, and they still underperform the conventional discriminative approaches of smaller language models.
format Preprint
id arxiv_https___arxiv_org_abs_2311_11608
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks
Luo, Ling
Ning, Jinzhong
Zhao, Yingwen
Wang, Zhijun
Ding, Zeyuan
Chen, Peng
Fu, Weiru
Han, Qinyu
Xu, Guangtao
Qiu, Yunzhi
Pan, Dinghao
Li, Jiru
Li, Hao
Feng, Wenduo
Tu, Senbo
Liu, Yuqi
Yang, Zhihao
Wang, Jian
Sun, Yuanyuan
Lin, Hongfei
Computation and Language
Artificial Intelligence
Objective: Most existing fine-tuned biomedical large language models (LLMs) focus on enhancing performance in monolingual biomedical question answering and conversation tasks. To investigate the effectiveness of the fine-tuned LLMs on diverse biomedical NLP tasks in different languages, We present Taiyi, a bilingual fine-tuned LLM for diverse biomedical tasks. Materials and Methods: We first curated a comprehensive collection of 140 existing biomedical text mining datasets (102 English and 38 Chinese datasets) across over 10 task types. Subsequently, a two-stage strategy is proposed for supervised fine-tuning to optimize the model performance across varied tasks. Results: Experimental results on 13 test sets covering named entity recognition, relation extraction, text classification, question answering tasks demonstrate that Taiyi achieves superior performance compared to general LLMs. The case study involving additional biomedical NLP tasks further shows Taiyi's considerable potential for bilingual biomedical multi-tasking. Conclusion: Leveraging rich high-quality biomedical corpora and developing effective fine-tuning strategies can significantly improve the performance of LLMs within the biomedical domain. Taiyi shows the bilingual multi-tasking capability through supervised fine-tuning. However, those tasks such as information extraction that are not generation tasks in nature remain challenging for LLM-based generative approaches, and they still underperform the conventional discriminative approaches of smaller language models.
title Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks
topic Computation and Language
Artificial Intelligence
url https://arxiv.org/abs/2311.11608