Saved in:
Bibliographic Details
Main Author: Greenfield, Rich
Format: Recurso educativo Open Access
Language:en
Published: 1997
Subjects:
Online Access:https://eric.ed.gov/?id=ED406996
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867181005340672000
author Greenfield, Rich
author_facet Greenfield, Rich
Greenfield, Rich
collection Education Resources Information Center
contents Do We Still Need Controlled Vocabulary? Of Course, We Do! But How Do We Get It: The Roles for Text Analysis Softwares. Greenfield, Rich Access to Information Authority Control (Information) Automatic Indexing Bibliographic Records Cataloging Comparative Analysis Computer Software Evaluation Electronic Libraries Information Networks Information Technology Internet Library Catalogs Library Development Online Catalogs Standards Technological Advancement World Wide Web The author argues that traditional library cataloging (MARC) and the online public access catalog (OPAC) are in collision with the world of the Internet because items in electronic formats undergo MARC cataloging only on a very selective basis. Also the library profession initially isolated itself from World Wide Web development by predicting no real need for universal access, by ignoring large areas of human creativity, and by de-emphasizing "ephemeral" resources. This paper recommends a constructive merger of the best of both worlds--the full text analysis provided by web search engines and the controlled vocabularies found in library OPACs. The Congressional Research Service (CRS) is being used as a testbed to examine relevant techniques. Three of the major text analysis technologies are natural language processing, case-based reasoning, and adaptive learning. As part of "the new OPAC," the Experimental Search System (ESS) is one of the Library of Congress' first efforts to make selected cataloging and digital library resources available over the World Wide Web by means of a single, point-and-click interface. Perhaps even more promising is the idea of using large MARC databases to generate word clusters associated with controlled vocabulary terms and classifications. Six commercial text analysis software products are reviewed in the Appendix, using a comparative table. These tools, many of them associated with major search engine vendors, may support automatic classification and document analysis, thereby increasing cataloger productivity. (AEF)
format Recurso educativo Open Access
id eric_ED406996
institution ERIC Institute of Education Sciences
language en
publishDate 1997
record_format eric
spellingShingle Do We Still Need Controlled Vocabulary? Of Course, We Do! But How Do We Get It: The Roles for Text Analysis Softwares.
Greenfield, Rich
Access to Information
Authority Control (Information)
Automatic Indexing
Bibliographic Records
Cataloging
Comparative Analysis
Computer Software Evaluation
Electronic Libraries
Information Networks
Information Technology
Internet
Library Catalogs
Library Development
Online Catalogs
Standards
Technological Advancement
World Wide Web
Do We Still Need Controlled Vocabulary? Of Course, We Do! But How Do We Get It: The Roles for Text Analysis Softwares. Greenfield, Rich Access to Information Authority Control (Information) Automatic Indexing Bibliographic Records Cataloging Comparative Analysis Computer Software Evaluation Electronic Libraries Information Networks Information Technology Internet Library Catalogs Library Development Online Catalogs Standards Technological Advancement World Wide Web The author argues that traditional library cataloging (MARC) and the online public access catalog (OPAC) are in collision with the world of the Internet because items in electronic formats undergo MARC cataloging only on a very selective basis. Also the library profession initially isolated itself from World Wide Web development by predicting no real need for universal access, by ignoring large areas of human creativity, and by de-emphasizing "ephemeral" resources. This paper recommends a constructive merger of the best of both worlds--the full text analysis provided by web search engines and the controlled vocabularies found in library OPACs. The Congressional Research Service (CRS) is being used as a testbed to examine relevant techniques. Three of the major text analysis technologies are natural language processing, case-based reasoning, and adaptive learning. As part of "the new OPAC," the Experimental Search System (ESS) is one of the Library of Congress' first efforts to make selected cataloging and digital library resources available over the World Wide Web by means of a single, point-and-click interface. Perhaps even more promising is the idea of using large MARC databases to generate word clusters associated with controlled vocabulary terms and classifications. Six commercial text analysis software products are reviewed in the Appendix, using a comparative table. These tools, many of them associated with major search engine vendors, may support automatic classification and document analysis, thereby increasing cataloger productivity. (AEF)
title Do We Still Need Controlled Vocabulary? Of Course, We Do! But How Do We Get It: The Roles for Text Analysis Softwares.
topic Access to Information
Authority Control (Information)
Automatic Indexing
Bibliographic Records
Cataloging
Comparative Analysis
Computer Software Evaluation
Electronic Libraries
Information Networks
Information Technology
Internet
Library Catalogs
Library Development
Online Catalogs
Standards
Technological Advancement
World Wide Web
url https://eric.ed.gov/?id=ED406996