Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Yang, Seungwon
Format:	Recurso educativo Open Access
Language:	en
Published:	2013
Subjects:	Information Science Library Science Electronic Libraries Information Retrieval Indexing Natural Language Processing Internet Usability Identification Navigation (Information Systems)
Online Access:	https://eric.ed.gov/?id=ED557892
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1867181914350157824
author	Yang, Seungwon
author_facet	Yang, Seungwon Yang, Seungwon
collection	Education Resources Information Center
contents	Automatic Identification of Topic Tags from Texts Based on Expansion-Extraction Approach Yang, Seungwon Information Science Library Science Electronic Libraries Information Retrieval Indexing Natural Language Processing Internet Usability Identification Navigation (Information Systems) Identifying topics of a textual document is useful for many purposes. We can organize the documents by topics in digital libraries. Then, we could browse and search for the documents with specific topics. By examining the topics of a document, we can quickly understand what the document is about. To augment the traditional manual way of topic tagging tasks, which is labor-intensive, solutions using computers have been developed. This dissertation describes the design and development of a topic identification approach, in this case applied to disaster events. In a sense, this study represents the marriage of research analysis with an engineering effort in that it combines inspiration from Cognitive Informatics with a practical model from Information Retrieval. One of the design constraints, however, is that the Web was used as a universal knowledge source, which was essential in accessing the required information for inferring topics from texts. Retrieving specific information of interest from such a vast information source was achieved by querying a search engine's application programming interface. Specifically, the information gathered was processed mainly by incorporating the Vector Space Model from the Information Retrieval field. As a proof of concept, we subsequently developed and evaluated a prototype tool, Xpantrac, which is able to run in a batch mode to automatically process text documents. A user interface of Xpantrac also was constructed to support an interactive semi-automatic topic tagging application, which was subsequently assessed via a usability study. Throughout the design, development, and evaluation of these various study components, we detail how the hypotheses and research questions of this dissertation have been supported and answered. We also present that our overarching goal, which was the identification of topics in a human-comparable way without depending on a large training set or a corpus, has been achieved. [The dissertation citations contained here are published with the permission of ProQuest LLC. Further reproduction is prohibited without permission. Copies of dissertations may be obtained by Telephone (800) 1-800-521-0600. Web page: http://www.proquest.com/en-US/products/dissertations/individuals.shtml.]
format	Recurso educativo Open Access
id	eric_ED557892
institution	ERIC Institute of Education Sciences
language	en
publishDate	2013
record_format	eric
spellingShingle	Automatic Identification of Topic Tags from Texts Based on Expansion-Extraction Approach Yang, Seungwon Information Science Library Science Electronic Libraries Information Retrieval Indexing Natural Language Processing Internet Usability Identification Navigation (Information Systems) Automatic Identification of Topic Tags from Texts Based on Expansion-Extraction Approach Yang, Seungwon Information Science Library Science Electronic Libraries Information Retrieval Indexing Natural Language Processing Internet Usability Identification Navigation (Information Systems) Identifying topics of a textual document is useful for many purposes. We can organize the documents by topics in digital libraries. Then, we could browse and search for the documents with specific topics. By examining the topics of a document, we can quickly understand what the document is about. To augment the traditional manual way of topic tagging tasks, which is labor-intensive, solutions using computers have been developed. This dissertation describes the design and development of a topic identification approach, in this case applied to disaster events. In a sense, this study represents the marriage of research analysis with an engineering effort in that it combines inspiration from Cognitive Informatics with a practical model from Information Retrieval. One of the design constraints, however, is that the Web was used as a universal knowledge source, which was essential in accessing the required information for inferring topics from texts. Retrieving specific information of interest from such a vast information source was achieved by querying a search engine's application programming interface. Specifically, the information gathered was processed mainly by incorporating the Vector Space Model from the Information Retrieval field. As a proof of concept, we subsequently developed and evaluated a prototype tool, Xpantrac, which is able to run in a batch mode to automatically process text documents. A user interface of Xpantrac also was constructed to support an interactive semi-automatic topic tagging application, which was subsequently assessed via a usability study. Throughout the design, development, and evaluation of these various study components, we detail how the hypotheses and research questions of this dissertation have been supported and answered. We also present that our overarching goal, which was the identification of topics in a human-comparable way without depending on a large training set or a corpus, has been achieved. [The dissertation citations contained here are published with the permission of ProQuest LLC. Further reproduction is prohibited without permission. Copies of dissertations may be obtained by Telephone (800) 1-800-521-0600. Web page: http://www.proquest.com/en-US/products/dissertations/individuals.shtml.]
title	Automatic Identification of Topic Tags from Texts Based on Expansion-Extraction Approach
topic	Information Science Library Science Electronic Libraries Information Retrieval Indexing Natural Language Processing Internet Usability Identification Navigation (Information Systems)
url	https://eric.ed.gov/?id=ED557892

Similar Items