Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Mirza, Paramita, Sudhi, Viju, Sahoo, Soumya Ranjan, Bhat, Sinchana Ramakanth
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2403.17536
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913284603510784
author	Mirza, Paramita Sudhi, Viju Sahoo, Soumya Ranjan Bhat, Sinchana Ramakanth
author_facet	Mirza, Paramita Sudhi, Viju Sahoo, Soumya Ranjan Bhat, Sinchana Ramakanth
contents	State-of-the-art intent classification (IC) and slot filling (SF) methods often rely on data-intensive deep learning models, limiting their practicality for industry applications. Large language models on the other hand, particularly instruction-tuned models (Instruct-LLMs), exhibit remarkable zero-shot performance across various natural language tasks. This study evaluates Instruct-LLMs on popular benchmark datasets for IC and SF, emphasizing their capacity to learn from fewer examples. We introduce ILLUMINER, an approach framing IC and SF as language generation tasks for Instruct-LLMs, with a more efficient SF-prompting method compared to prior work. A comprehensive comparison with multiple baselines shows that our approach, using the FLAN-T5 11B model, outperforms the state-of-the-art joint IC+SF method and in-context learning with GPT3.5 (175B), particularly in slot filling by 11.1--32.2 percentage points. Additionally, our in-depth ablation study demonstrates that parameter-efficient fine-tuning requires less than 6% of training data to yield comparable performance with traditional full-weight fine-tuning.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_17536
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	ILLUMINER: Instruction-tuned Large Language Models as Few-shot Intent Classifier and Slot Filler Mirza, Paramita Sudhi, Viju Sahoo, Soumya Ranjan Bhat, Sinchana Ramakanth Computation and Language State-of-the-art intent classification (IC) and slot filling (SF) methods often rely on data-intensive deep learning models, limiting their practicality for industry applications. Large language models on the other hand, particularly instruction-tuned models (Instruct-LLMs), exhibit remarkable zero-shot performance across various natural language tasks. This study evaluates Instruct-LLMs on popular benchmark datasets for IC and SF, emphasizing their capacity to learn from fewer examples. We introduce ILLUMINER, an approach framing IC and SF as language generation tasks for Instruct-LLMs, with a more efficient SF-prompting method compared to prior work. A comprehensive comparison with multiple baselines shows that our approach, using the FLAN-T5 11B model, outperforms the state-of-the-art joint IC+SF method and in-context learning with GPT3.5 (175B), particularly in slot filling by 11.1--32.2 percentage points. Additionally, our in-depth ablation study demonstrates that parameter-efficient fine-tuning requires less than 6% of training data to yield comparable performance with traditional full-weight fine-tuning.
title	ILLUMINER: Instruction-tuned Large Language Models as Few-shot Intent Classifier and Slot Filler
topic	Computation and Language
url	https://arxiv.org/abs/2403.17536

Similar Items